Full Seminar Details

Prof. Alan Rector

University of Manchester

Prof. Alan Rector
Experiments in understanding and QA of a very large Ontology
This event took place on Thursday 23 September 2010 at 12:00

SNOMED-CT is a very large (450,000 concept) terminology based on a subset of description logic. Until recently, it was published only in "classified" form in a set of distribution tables. Although everybody knows the hierarchies contain many anomalies, it has been almost impossible to comment on them. Recently they have published the "stated form" and a script for transforming it into OWL. At the same time a group of hospitals has published a list of the most commonly used codes for "problems" - the Core Problem List Subset. Using the module extraction mechanism in the OWL API, and the subset as a signature, a module can be extracted from the stated form which is guaranteed to be sufficient to classify it in the same way as it would be classified in the full SNOMED, but in an ontology of only 35,000 concepts. The new out SNOROCKET (an optimised EL++ classifer) classifies the subset in about 30 seconds making possible iterative exploration and modification.

Using this subset we have begun to develop methods to explore the core subset in combination with two projects. We have begun by taking common key concepts of importance for users and looking up the hierarchies to see how they were classified, then looking for analogies to any problems found. We call the method "analysis by repair". Issues discovered range from simple omissions to gross errors in the ontology schemas for anatomy. Only a few are evident locally without classification.

We have found the Protege Inferred class hierarchy the best screening tool for looking up hierarchies and the OWLViz tool the best definitive tool. Usually, but not always, a complex tangled upwards hierarchy indicates problems. We are just starting to explore the OPPL to find patterns. Performing the task on a large scale requires improved tools.

While this sub-project focuses on an ontology used for terminology, the context is that we wish to use such terminologies as just one small piece of a much larger programme of hybrid ontology based architecture that clearly distinguishes domain ontologies, such as SNOMED, from ontologies describing the use of information from the data structures for that information and that use a variety of reasoning techniques.

(Due to unforeseen circumstances we were unable to record or webcast this event, we apologise to those who were otherwise unable to attend this event in person)

Watch the webcast replay >>

Jobs

Research Asst / Assoc - Text and Data Mining

Knowledge Media Institute (KMi)
29,799 - 38,833 (Grades AC1 / AC2)
Based in Milton Keynes
Temporary contract until 31 December 2018

WE ACCEPT APPLICATIONS FROM CITIZENS GLOBALLY The Knowledge Media Institute (KMi) is a distinct research unit within the Faculty of Science, Technology, Engineering and Mathematics (STEM) at the Open University. KMi is looking for a Web Developer...

Senior Research Fellow x 2

Knowledge Media Institute (KMi)
50,618 - 56,950 (Grade AC4)
Based in Milton Keynes
Permanent Position

WE ACCEPT APPLICATIONS FROM CITIZENS GLOBALLY The Knowledge Media Institute (KMi) is one of the top research centres in the world in the area of knowledge and media technologies, and we offer a creative and flexible working environment. The...

CONTACT US

Knowledge Media Institute
The Open University
Walton Hall
Milton Keynes
MK7 6AA
United Kingdom

Tel: +44 (0)1908 653800

Fax: +44 (0)1908 653169

Email: KMi Support

COMMENT

If you have any comments, suggestions or general feedback regarding our website, please email us at the address below.

Email: KMi Development Team