Tech Reports
Tech Report kmi-05-14 Abstract
Extracting Domain Ontologies with CORDER
Techreport ID: kmi-05-14
Date: 2005
Author(s): Camilo Thorne, Jianhan Zhu, Victoria Uren
The CORDER web mining engine developed at the Knowledge Media Institute computes a lexical coocurrence network out of websites - a binary relation R. A natural extension of CORDER would be that of learning an ontology. However, our work shows that coocurrence proves insufficient to discover concepts and conceptual taxonomies (i.e. very simple ontologies) out of this network. To tackle this problem two unsupervised learning methods were studied based, on the one hand, on set similarity (and thus on a set-based representation of the data) and, on the other hand, on cosine similarity (and thus on a vector-space representation of the data). The underlying idea being that of taking into account, for the clustering, as features, their related coocurring entities (and thus the indirect links among the entities), as suggested, for instance, by O. Ferret. For the purposes of this study, we restricted ourselves to (solely) research areas. The most promising results in our experiments were given by the vector-space representation. To validate the results we used the ACM classification of computer science research areas as our gold standard.
Future Internet
KnowledgeManagementMultimedia &
Information SystemsNarrative
HypermediaNew Media SystemsSemantic Web &
Knowledge ServicesSocial Software
Social Software is...

Interacting with other people not only forms the core of human social and psychological experience, but also lies at the centre of what makes the internet such a rich, powerful and exciting collection of knowledge media. We are especially interested in what happens when such interactions take place on a very large scale -- not only because we work regularly with tens of thousands of distance learners at the Open University, but also because it is evident that being part of a crowd in real life possesses a certain 'buzz' of its own, and poses a natural challenge. Different nuances emerge in different user contexts, so we choose to investigate the contexts of work, learning and play to better understand the trade-offs involved in designing effective large-scale social software for multiple purposes.
Check out these Hot Social Software Projects:
List all Social Software Projects
Check out these Hot Social Software Technologies:
List all Social Software Technologies
List all Social Software Projects
Check out these Hot Social Software Technologies:
List all Social Software Technologies



