The Semantic Web, linked data and open data

Source: Internet
Author: User

Back in 2001 Tim Berners-Lee and his collaborators published a seminal article (groundbreaking paper) called "the Semantic Web" in which they presented their idea of"A new form of web content that is meaningful to computers [and] will unleash a revolution of new possibilities". In the last few years, the idea has gained traction and technologies have become available to build parts of this vision. unfortunately, getting started is not so easy, because there are using concepts with slightly varying names and minute differences in their meaning and several technologies with cryptic names, so let's start with some definitions.

First up is the termSemantic Web. The Semantic Web describes the vision that machines will be able to understand the meaning ("semantics") of information on the Internet, and be able to"Perform tasks automatically and locate related information on behalf of the user"(Wikipedia). What is important to understand, is that this term describes an amalgam of concepts and technologies (similar to the" Web 2.0 ") and not a single technology.

One semantic ical concept that is part of the semantic web vision isLinked Data, Which describes"A Method of publishing structured data, so that it can be interlinked and become more useful"(Wikipedia ). the above-mentioned example shows the power of this: instead of giving our software a meaningless (at least to a machine) string as an input, we give it an object with an uri (Zürich) and define this object as being of type, amongst others, populated place.

The meaning of "a populated place" in this case is clearly defined, so that others can look up what it meansExactlyAnd also use this definition themselves. this way, if someone uses "a populated place", everyone talks about the same thing. also, if we take a look at the definition of label, it says that it is "a human-readable name for the subject ".

The description of "a populated place" is part ofVocabularyThat has been defined in an ontology. what's interesting is that this ontology can be defined by anyone. this allows for the creation of ontologies for special areas of interest, such as the "friend of a friend (foaf)" or the "hcard" vocabularies, which were created by individuals or small groups and have proven useful to their community. because of the distributedness Of These ontologies, they can be formed bottom-up and save us from creating the one global ontology, which wocould be a gargantuan task.

Linked Data by itself doesn't have to be publicly available data, it can just as well be used in private, so we need one more definition:Open Data. It describes "a philosophy and practice requiring that certain data be freely available to everyone, without restrictions from copyright, patents or other mechanic ISMs of control" (Wikipedia ). this is similar in spirit to other movements like open source software, and there is work being done to create licenses that clarify the usage terms of the data (e.g. open definition and the open data commons ).

At last, to describe data that is openAndLinked, there's the combination of the two,Linked Open Data. This is the data we, as visualization creators, want, because it has clear license terms and is easily linkable with other data sets. to put these terms in relation to each other, I created the following graphic; in the world of all data, only the blue areas are open to the public, with the dark blue being openAndLinked.

Democratic Governments have always had to make the data they produce transparent to their citizens, however, just do so using proprietary file formats like excel, machine-unfriendly events like PDFs, or "hide" the data by distributing it over into government sites and thus making it (unintentionally) hard to find. this is all open data, because people can look at and use it.

Luckily, there is this new trend to make dataReallyOpen, not just legally and as a matter of form. sites like data.gov have started to provide open data as a central, searchable catalog, often with the option of accessing the data through APIS, which makes it a lot easier to consume the data, as it doesn' t have to be transformed, combined and prepared for a program to use. with this central catalog in place, they have now been able to go a step further and start transforming this data into a huge linked open data set, that is accessible to everyone. the graphic below shows the size of the linked Open Data web at the end of 2010: each bubble is a website that you can access through linked open data technologies in similar ways that you wowould normally access a database.

To get some perspective on these different ways of publishing data, berners-Lee suggested a 5-star system to describe the accessibility quality of data sets to emphasize that "the Semantic Web isn' t just about putting data on the Web ", but doing so in ways that allow machines to understand the meaning of the data. the lidrc lab has taken Berners-Lee's proposal and prepared it using examples and annotations. go and have a look at the linked open data star scheme by example, it's a good read.

What the system does not take into account, however, isQualityOf the data itself. as with everything on the Internet, remember that even if you get your hands on a well-published linked open data set, it may be incomplete, taken out of context or badly curated. bad content in, bad content out does still apply. this problem is especially acute for linked open data at the moment, because everyone is just starting out with creating the ontologies and links and there is no way to do this overnight, so incompleteness will probably prevail for a while.

 

 

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.