Tech Report

Extracting URI Patterns from SPARQL Endpoints

Understanding the structure of identifiers in a particular dataset is critical for users/applications that want to use such a dataset, and connect to it. This is especially true in Linked Data where, while benefiting from having the structure of URIs, identifiers are also designed according to specific conventions, which are rarely made explicit and documented. In this paper, we present an automatic method to extract such URI patterns which is based on adapting formal concept analysis techniques to the mining of string patterns. The result is a tool that can generate, in a few minutes, the documentation of the URI patterns employed in a SPARQL endpoint by the instances of each class in the corresponding datasets. We evaluate the approach through demonstrating its performance and efficiency on several endpoints of various origins.

ID: kmi-15-01

Date: 2015

Author(s): Mathieu d'Aquin,Alessandro Adamou,Enrico Daga,Nicolas Jay

Download PDF

View By

Other Publications

Latest Seminar
Mr Antonello Meloni
Department of Mathematics and Computer Science, University of Cagliari, IT

Large Language Models for Scientific Question Answering: an Extensive Analysis of the SciQA Benchmark

Watch the live webcast


Knowledge Media Institute
The Open University
Walton Hall
Milton Keynes
United Kingdom

Tel: +44 (0)1908 653800

Fax: +44 (0)1908 653169

Email: KMi Support


If you have any comments, suggestions or general feedback regarding our website, please email us at the address below.

Email: KMi Development Team