RDF Chain Query Optimization in a Distributed Environment

Alexander Hogenboom, E Niewenhuijse, M Jansen, Flavius Frasincar, Damir Vandic

Research output: Chapter/Conference proceedingConference proceedingAcademicpeer-review

2 Citations (Scopus)

Abstract

In order to efficiently disclose the ever-growing amount of distributed RDF data in Semantic Web environments, RDF query engines must optimize the join order of partial query results. Existing methods include two-phase optimization (2PO), a genetic algorithm (GA), and ant colony optimization (ACO), which have mostly been evaluated on a single source. We adapt these methods to a distributed setting and evaluate the effects of distinct join methods, i.e., nested-loop join, bind join, and AGJoin. When optimizing RDF chain queries combining real-world data from 34 different SPARQL endpoints, the ACO method produces the best results in the least amount of time for most chain queries consisting of up to about ten joins. For larger chain queries, each of our considered algorithms may have its benefits, depending on the join method used. When using the least naive join method, i.e., AGJoin, a GA approach produces solutions of a competitive quality in significantly less time than both ACO and 2PO.
Original languageEnglish
Title of host publication30th ACM/SIGAPP Symposium on Applied Computing (SAC 2015)
PublisherACM
Pages353-359
Number of pages7
DOIs
Publication statusPublished - 13 Apr 2015

Research programs

  • EUR ESE 32

Fingerprint

Dive into the research topics of 'RDF Chain Query Optimization in a Distributed Environment'. Together they form a unique fingerprint.

Cite this