Storage balancing in P2P based distributed RDF data stores

Maximiliano Osorio, Carlos Buil-Aranda

Research output: Contribution to conferencePaper

Abstract

Centralized RDF repositories have been designed to support RDF data storage and retrieval. However, they suffer from the traditional limitations of centralized approaches which are scalability and fault tolerance, specially in a Web scenario. Peer to Peer (P2P) networks can provide the scalability, fault-tolerance and robustness, features that the current solutions to local RDF storage do not provide. A common strategy from state-of-the-art P2P-RDF data stores is to store triples at three locations so each triple can be found using a look-up by subject, predicate, or object identifier. One major issue of this strategy is the lack of load-balancing, since occurrences in triples are not uniformly distributed. Consequently, this issue leads an unbalanced query processing load distribution and unfair storage load in the network. To solve this problem we propose a new scheme to split the data in the overloaded nodes across neighboring nodes. We propose the use of a Prefix Hash Table consisting in XXX to access to such data. We provide an empirical evaluation of our approach and compare with other state of the art systems for storage balancing showing the feasibility of our approach.
Original languageEnglish
Publication statusPublished - 1 Jan 2017
EventCEUR Workshop Proceedings -
Duration: 1 Jan 2019 → …

Conference

ConferenceCEUR Workshop Proceedings
Period1/01/19 → …

Fingerprint Dive into the research topics of 'Storage balancing in P2P based distributed RDF data stores'. Together they form a unique fingerprint.

  • Cite this

    Osorio, M., & Buil-Aranda, C. (2017). Storage balancing in P2P based distributed RDF data stores. Paper presented at CEUR Workshop Proceedings, .