Enterprise Linked DataLinked Data is a sub-topic of the Semantic Web. The term Linked Data is used to describe a method of exposing, sharing, and connecting data via dereferenceable URIs on the Web. Integration

Management summary

How can linked dataLinked Data is a sub-topic of the Semantic Web. The term Linked Data is used to describe a method of exposing, sharing, and connecting data via dereferenceable URIs on the Web. be used as a more agile and flexible methodology for enterprise data integrationData integration involves combining data residing in different sources and providing users with a unified view of these data. This process becomes significant in a variety of situations both commercial (when two similar companies need to merge their databases) and scientific (combining research ...?

Addressed problem

Enterprise Linked Data IntegrationPutting all the information in one place which describes a business object like a product, a customer or a certain technology can ease the life of many people significantly. Unfortunately, the automatic integration of data from various sources can cause tremendous efforts. Data in enterprises is organised such that data remains locked up in its database. Knowledge workers are forced to collect information from a series of data silos manually to put those pieces together like a puzzle in order to create the basis for a decision making process. Data integration projects most often are built upon yet another inflexible data structure. Numerous amendments or additions made to the structure or to the semantics of an information component cannot be reflected properly by the integration layer. The result is a landscape consisting of data silos which are scarcely connected to each other. Intelligent linkages happen only in the course of ad hoc processes which are not readily comprehensible.

Our solution approach

Web data, but also data in enterprises are characterized by a great structural diversity as well as frequent changes. This poses a great challenge for applications based on that data. We address this problem by using a flexible data model that supports the integration of heterogeneous and volatile data. We make use of linked data technologies for data integration purposes which relies on graph-based models. This allows to incrementally extend the schema by various properties and constraints. Linked data is based on open standards which makes the effort future-proof.

Results

  • 360views on specific business objects (‘topic pages’) like products, companies, technologies etc.
  • Reports based on sometimes complex queries which can only be answered if data is used from various sources
  • Mashups of unstructured (e.g.: business news, social media, etc.) and structured data (e.g.: statistics, legacy data, etc.)
  • Mashups of data from the web (e.g.: open government data) and internal data sources

Used methods, technologies and standards

  • Linked data stack
  • Semantic web standardsThe Semantic Web Stack, also known as Semantic Web Cake or Semantic Web Layer Cake, illustrates the architecture of the Semantic Web. (RDFThe Resource Description Framework (RDF) is a family of World Wide Web Consortium (W3C) specifications originally designed as a metadata data model. It has come to be used as a general method for conceptual description or modeling of information that is implemented in web resources, using a ..., SKOSSimple Knowledge Organization System (SKOS) is a family of formal languages designed for representation of thesauri, classification schemes, taxonomies, subject-heading systems, or any other type of structured controlled vocabulary. SKOS is built upon RDF and RDFS, and its main objective is to ..., SPARQLSPARQL is an RDF query language; its name is a recursive acronym that stands for SPARQL Protocol and RDF Query Language. It was standardized by the RDF Data Access Working Group (DAWG) of the World Wide Web Consortium, and is considered a key semantic web technology. On 15 January 2008, SPARQL ... etc.)
  • Linked data alignment
  • Linked data manager
  • PoolParty Semantic Integrator
  • PoolParty Extractor
  • Large scale RDF triple stores (e.g.: VirtuosoVirtuoso Universal Server is a middleware and database engine hybrid that combines the functionality of a traditional RDBMS, ORDBMS, virtual database, RDF, XML, free-text, web application server and file server functionality in a single system. Rather than have dedicated servers for each of the ...)

See also

Linked Open Data
Recommender Systems
Semantic Content Management
Vocabulary Management
Back to Top