Intelligent SPARQL endpoints: Optimizing execution performance by automatic query relaxation and queue scheduling

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Citation (Scopus)

Abstract

The Web of Data is widely considered as one of the major global repositories populated with countless interconnected and structured data prompting these linked datasets to be continuously and sharply increasing. In this context the so-called SPARQL Protocol and RDF Query Language is commonly used to retrieve and manage stored data by means of SPARQL endpoints, a query processing service especially designed to get access to these databases. Nevertheless, due to the large amount of data tackled by such endpoints and their structural complexity, these services usually suffer from severe performance issues, including inadmissible processing times. This work aims at overcoming this noted inefficiency by designing a distributed parallel system architecture that improves the performance of SPARQL endpoints by incorporating two functionalities: (1) a queuing system to avoid bottlenecks during the execution of SPARQL queries; and (2) an intelligent relaxation of the queries submitted to the endpoint at hand whenever the relaxation itself and the consequently lowered complexity of the query are beneficial for the overall performance of the system. To this end the system relies on a two-fold optimization criterion: the minimization of the query running time, as predicted by a supervised learning model; and the maximization of the quality of the results of the query as quantified by a measure of similarity. These two conflicting optimization criteria are efficiently balanced by two bi-objective heuristic algorithms sequentially executed over groups of SPARQL queries. The approach is validated on a prototype and several experiments that evince the applicability of the proposed scheme.

Original languageEnglish
Title of host publicationAlgorithms and Architectures for Parallel Processing - 16th International Conference, ICA3PP 2016, Proceedings
EditorsJesus Carretero, Koji Nakano, Ryan K.L. Ko, Peter Mueller, Javier Garcia-Blas
PublisherSpringer Verlag
Pages3-17
Number of pages15
ISBN (Print)9783319495828
DOIs
Publication statusPublished - 2016
Event16th International Conference on Algorithms and Architectures for Parallel Processing, ICA3PP 2016 - Granada, Spain
Duration: 14 Dec 201616 Dec 2016

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume10048 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference16th International Conference on Algorithms and Architectures for Parallel Processing, ICA3PP 2016
Country/TerritorySpain
CityGranada
Period14/12/1616/12/16

Keywords

  • Linked open data
  • Multiobjective optimization
  • Ontology management
  • Query rewriting
  • SPARQL

Fingerprint

Dive into the research topics of 'Intelligent SPARQL endpoints: Optimizing execution performance by automatic query relaxation and queue scheduling'. Together they form a unique fingerprint.

Cite this