KMi Publications

Tech Reports

Tech Report kmi-08-08 Abstract


The Use of Ontologies for Improving Image Retrieval and Annotation
Techreport ID: kmi-08-08
Date: 2008
Author(s): Ainhoa Llorente Coto
Download PDF

Nowadays, digital photography is a common technology for capturing and archiving images due to the falling price of storage devices and the wide availability of digital cameras. Without efficient retrieval methods the search of images in large collections is becoming a painstaking work. Most of the traditional image search engines rely on keyword-based annotations because they lack the ability to examine image content. However, “a picture is worth a thousand words”, this means that up to a thousand words can be needed to describe the content depicted in a picture. This research proposes the use of highly structured annotations called ontologies to improve efficiency in image retrieval as well as to overcome the semantic gap that remains between user expectations and system retrieval capabilities. This work focuses on automated image annotation which is the process of creating a model that assigns visual terms to images because manual annotation is a time consuming and inefficient task. Up to now, most of the automated image annotation systems are based on a combination of image analysis and statistical machine learning techniques. The main objective of this research is to evaluate whether the underlying information contained in an ontology created from the vocabulary of terms used for the annotation could be effectively used together with the extracted visual information in order to produce more accurate annotations.
 
KMi Publications
 

Multimedia and Information Systems is...


Multimedia and Information Systems
Our research is centred around the theme of Multimedia Information Retrieval, ie, Video Search Engines, Image Databases, Spoken Document Retrieval, Music Retrieval, Query Languages and Query Mediation.

We focus on content-based information retrieval over a wide range of data spanning form unstructured text and unlabelled images over spoken documents and music to videos. This encompasses the modelling of human perception of relevance and similarity, the learning from user actions and the up-to-date presentation of information. Currently we are building a research version of an integrated multimedia information retrieval system MIR to be used as a research prototype. We aim for a system that understands the user's information need and successfully links it to the appropriate information sources, be it a report or a TV news clip. This work is guided by the vision that an automated knowledge extraction system ultimately empowers people making efficient use of information sources without the burden of filing data into specialised databases.

Visit the MMIS website