Tech Report

Extracting URI Patterns from SPARQL Endpoints

Understanding the structure of identifiers in a particular dataset is critical for users/applications that want to use such a dataset, and connect to it. This is especially true in Linked Data where, while benefiting from having the structure of URIs, identifiers are also designed according to specific conventions, which are rarely made explicit and documented. In this paper, we present an automatic method to extract such URI patterns which is based on adapting formal concept analysis techniques to the mining of string patterns. The result is a tool that can generate, in a few minutes, the documentation of the URI patterns employed in a SPARQL endpoint by the instances of each class in the corresponding datasets. We evaluate the approach through demonstrating its performance and efficiency on several endpoints of various origins.

ID: kmi-15-01

Date: 2015

Author(s): Mathieu d'Aquin,Alessandro Adamou,Enrico Daga,Nicolas Jay

Resources:
Download PDF

View By

Other Publications

Latest Seminar
Microsoft Research Cambridge

Actions and their Consequences: Implicit Interactions with Machine Learned Knowledge Bases

More Details

CONTACT US

Knowledge Media Institute
The Open University
Walton Hall
Milton Keynes
MK7 6AA
United Kingdom

Tel: +44 (0)1908 653800

Fax: +44 (0)1908 653169

Email: KMi Support

COMMENT

If you have any comments, suggestions or general feedback regarding our website, please email us at the address below.

Email: KMi Development Team