the difference between structured and unstructured data (reproduced)
Information can be divided into two broad categories in the society. A class of information can be represented by data or a unified structure, which we call structured data, such as numbers and symbols, while another type of information cannot be represented in numbers or in a unified structure, such as text, images, sounds, web pages, etc., which we call unstructured data. Structured data belongs to unstructured data and is a special case of unstructured data.
Defined:
Structured data: The data, stored in the database, can be used to logically express the data of the implementation of the two-dimensional table structure.
Unstructured data: Includes all forms of Office documents, text, pictures, XML, HTML, various reports, images and audio/video information, and so on.
semi-structured data: The data between fully structured data (such as relational databases, data in object-oriented databases) and completely unstructured data (such as sounds, image files, etc.), and HTML documents belong to semi-structured data. It is generally self-describing, the structure and content of the data mixed together, there is no obvious distinction.
Example:
Data Model:
Structured data: Two-dimensional tables (relational)
Semi-structured data: trees, graphs
Unstructured data: None
Rmdbs data models include: Mesh data model, hierarchical data model, relational
Other:
Structured data: First structure, then data
Semi-structured data: first with data, then structure
Reference articles:
1, http://blog.csdn.net/utnewbear/article/details/8267653
2, http://renyangok.iteye.com/blog/45459
Reproduced
Information can be divided into two broad categories in the society. A class of information can be represented by data or a unified structure, which we call structured data, such as numbers and symbols, while another type of information cannot be represented in numbers or in a unified structure, such as text, images, sounds, web pages, etc., which we call unstructured data. Structured data belongs to unstructured data and is a special case of unstructured data.
Defined:
Structured data: The data, stored in the database, can be used to logically express the data of the implementation of the two-dimensional table structure.
Unstructured data: Includes all forms of Office documents, text, pictures, XML, HTML, various reports, images and audio/video information, and so on.
semi-structured data: The data between fully structured data (such as relational databases, data in object-oriented databases) and completely unstructured data (such as sounds, image files, etc.), and HTML documents belong to semi-structured data. It is generally self-describing, the structure and content of the data mixed together, there is no obvious distinction.
Example:
Data Model:
Structured data: Two-dimensional tables (relational)
Semi-structured data: trees, graphs
Unstructured data: None
Rmdbs data models include: Mesh data model, hierarchical data model, relational
Other:
Structured data: First structure, then data
Semi-structured data: first with data, then structure
Reference articles:
1, http://blog.csdn.net/utnewbear/article/details/8267653
2, http://renyangok.iteye.com/blog/45459