Database Reference
In-Depth Information
•
Semi-structured data:
Textual data files with a discernible pattern that
enables parsing (such as Extensible Markup Language [XML] data files
that are self-describing and defined by an XML schema). See
Figure 1.5
.
•
Quasi-structured data:
Textual data with erratic data formats that can
be formatted with effort, tools, and time (for instance, web clickstream
data that may contain inconsistencies in data values and formats). See
•
Unstructured data:
Data that has no inherent structure, which may
include text documents, PDFs, images, and video. See
Figure 1.7
.