A rule-based transducer forquerying incompletely aligned datasets

Ana I. Torre-Bastida, Jesús Bermúdez, Arantza Illarramendi

Research output: Contribution to journalArticlepeer-review

Abstract

A growing number of Linked Open Data sources (from diverse provenances and about different domains) that can be freely browsed and searched to find and extract useful information have been made available. However, access to them is difficult for different reasons. This study addresses access issues concerning heterogeneity. It is common for datasets to describe the same or overlapping domains while using different vocabularies. Our study presents a transducer that transforms a SPARQL query suitably expressed in terms of the vocabularies used in a source dataset into another SPARQL query suitably expressed for a target dataset involving different vocabularies. The transformation is based on existing alignments between terms in different datasets. Whenever the transducer is unable to produce a semantically equivalent query because of the scarcity of term alignments, the transducer produces a semantic approximation of the query to avoid returning the empty answer to the user. Transformation across datasets is achieved through the management of a wide range of transformation rules. The feasibility of our proposal has been validated with a prototype implementation that processes queries that appear in well-known benchmarks and SPARQL endpoint logs. Results of the experiments show that the system is quite effective in achieving adequate transformations.

Original languageEnglish
Article number23
JournalACM Transactions on the Web
Volume12
Issue number4
DOIs
Publication statusPublished - Sept 2018

Keywords

  • Linked open data
  • Query transformation
  • RDF
  • Semantic web
  • SPARQL

Fingerprint

Dive into the research topics of 'A rule-based transducer forquerying incompletely aligned datasets'. Together they form a unique fingerprint.

Cite this