Types of Data
There are two types of data:
- with data model: structured, know what to expect
- without data model: structured, have to impose some structure, harder to know what to expect
With Data Model
Structured
- Relations in a Relational Databases
- Relational Data Model
Semi-structured
- XML, HTML, JSON
- Also have some Data Model: Semi-Structured Data Model
- Data Model can be expressed via constrains with schemas: DTD or XML Schema
No Data Model
Unstructured
- Free text: need Text Mining and NLP techniques
- Audio, Video
- Use Information Retrieval to index and structure unstructured data - especially text
Machine-generated data
- Sensors,
- Internet of Things