SAFE: SPARQL Federation over RDF Data Cubes with Access Control

Khan, Yasar; Saleem, Muhammad; Mehdi, Muntazir; Hogan, Aidan; Mehmood, Qaiser; Rebholz-Schuhmann, Dietrich; Sahay, Ratnesh

Author	dc.contributor.author	Khan, Yasar
Author	dc.contributor.author	Saleem, Muhammad
Author	dc.contributor.author	Mehdi, Muntazir
Author	dc.contributor.author	Hogan, Aidan
Author	dc.contributor.author	Mehmood, Qaiser
Author	dc.contributor.author	Rebholz-Schuhmann, Dietrich
Author	dc.contributor.author	Sahay, Ratnesh
Admission date	dc.date.accessioned	2019-05-29T13:10:18Z
Available date	dc.date.available	2019-05-29T13:10:18Z
Publication date	dc.date.issued	2017
Cita de ítem	dc.identifier.citation	Journal of Biomedical Semantics, Volumen 8, Issue 1, 2017
Identifier	dc.identifier.issn	20411480
Identifier	dc.identifier.other	10.1186/s13326-017-0112-6
Identifier	dc.identifier.uri	https://repositorio.uchile.cl/handle/2250/168791
Abstract	dc.description.abstract	Background: Several query federation engines have been proposed for accessing public Linked Open Data sources. However, in many domains, resources are sensitive and access to these resources is tightly controlled by stakeholders; consequently, privacy is a major concern when federating queries over such datasets. In the Healthcare and Life Sciences (HCLS) domain real-world datasets contain sensitive statistical information: strict ownership is granted to individuals working in hospitals, research labs, clinical trial organisers, etc. Therefore, the legal and ethical concerns on (i) preserving the anonymity of patients (or clinical subjects); and (ii) respecting data ownership through access control; are key challenges faced by the data analytics community working within the HCLS domain. Likewise statistical data play a key role in the domain, where the RDF Data Cube Vocabulary has been proposed as a standard format to enable the exchange of such data. However, to the best of our knowledge, no existing approach has looked to optimise federated queries over such statistical data. Results: We present SAFE: a query federation engine that enables policy-aware access to sensitive statistical datasets represented as RDF data cubes. SAFE is designed specifically to query statistical RDF data cubes in a distributed setting, where access control is coupled with source selection, user profiles and their access rights. SAFE proposes a join-aware source selection method that avoids wasteful requests to irrelevant and unauthorised data sources. In order to preserve anonymity and enforce stricter access control, SAFE's indexing system does not hold any data instances-it stores only predicates and endpoints. The resulting data summary has a significantly lower index generation time and size compared to existing engines, which allows for faster updates when sources change. Conclusions: We validate the performance of the system with experiments over real-world datasets provided by three clinical organisations as well as legacy linked datasets. We show that SAFE enables granular graph-level access control over distributed clinical RDF data cubes and efficiently reduces the source selection and overall query execution time when compared with general-purpose SPARQL query federation engines in the targeted setting.
Lenguage	dc.language.iso	en
Publisher	dc.publisher	BioMed Central
Type of license	dc.rights	Attribution-NonCommercial-NoDerivs 3.0 Chile
Link to License	dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/3.0/cl/
Source	dc.source	Journal of Biomedical Semantics
Keywords	dc.subject	Data access policy
Keywords	dc.subject	Healthcare and life sciences
Keywords	dc.subject	Linked Data
Keywords	dc.subject	SPARQL query federation
Título	dc.title	SAFE: SPARQL Federation over RDF Data Cubes with Access Control
Document type	dc.type	Artículo de revista
Cataloguer	uchile.catalogador	laj
Indexation	uchile.index	Artículo de publicación SCOPUS
uchile.cosecha	uchile.cosecha	SI

Files in this item

Name:: SAFE.pdf
Size:: 3.135Mb
Format:: PDF

This item appears in the following Collection(s)

Artículos de revistas
Artículos de revistas

Show simple item record

Except where otherwise noted, this item's license is described as Attribution-NonCommercial-NoDerivs 3.0 Chile