Introduction to Linked Data and Its Lifecycle on the Web - Reasoning Web

Databases Reference

In-Depth Information

Structure of this chapter. This chapter aims to explain the foundations of Linked Data

and introducing the di

ff

erent aspects of the Linked Data lifecycle by highlighting a par-

ticular approach and providing references to related work and further reading. We start

by briefly explaining the principles underlying the Linked Data paradigm in Section 2.

The first aspect of the Linked Data lifecycle is the extraction of information from un-

structured, semi-structured and structured sources and their representation according to

the RDF data model (Section 3). We present the user friendly authoring and manual

revision aspect of Linked Data with the example of Semantic Wikis in Section 4. The

interlinking aspect is tackled in Section 5 and gives an overview on the LIMES frame-

work. We describe how the instance data published and commonly found on the Data

Web can be enriched with higher level structures in Section 6. We present an overview

of the various data quality dimensions and metrics along with currently existing tools

for data quality assessment of Linked Data in Section 7. Due to space limitations we

omit a detailed discussion of the evolution as well as search, browsing and exploration

aspects of the Linked Data lifecycle in this chapter. The chapter is concluded by sev-

eral sections on promising applications of Linked Data and semantic technologies, in

particular Open Governmental Data, Semantic Business Intelligence and Statistical and

Economic Data.

Overall, this is an updated version of a similar lecture given at Reasoning Web Sum-

mer School 2011 [13].

2

The Linked Data Paradigm

In this section we introduce the basic principles of Linked Data. The section is partially

based on the Section 2 from [54]. The term Linked Data refers to a set of best practices

for publishing and interlinking structured data on the Web. These best practices were

introduced by Tim Berners-Lee in his Web architecture note Linked Data 1

and have

become known as the Linked Data principles. These principles are:

- Use URIs as names for things.

- Use HTTP URIs so that people can look up those names.

- When someone looks up a URI, provide useful information, using the standards

(RDF, SPARQL).

- Include links to other URIs, so that they can discover more things.

The basic idea of Linked Data is to apply the general architecture of the World Wide

Web [71] to the task of sharing structured data on global scale. The Document Web

is built on the idea of setting hyperlinks between Web documents that may reside on

di

erent Web servers. It is built on a small set of simple standards: Uniform Resource

Identifiers (URIs) and their extension Internationalized Resource Identifiers (IRIs) as

globally unique identification mechanism [21], the Hypertext Transfer Protocol (HTTP)

as universal access mechanism [44], and the Hypertext Markup Language (HTML) as

a widely used content format [65]. Linked Data builds directly on Web architecture and

applies this architecture to the task of sharing data on global scale.

ff

1 http://www.w3.org/DesignIssues/LinkedData.html

Reasoning Web

Search WWH ::

Custom Search

Home