Full Seminar Details
Evgeny Shadchnev
Imperial College London, Department of Computing

This event took place on Friday 24 August 2007 at 12:00
Lucene, a state-of-the-art open source information retrieval library, is an efficient solution for indexing and searching textual data. However, some Lucene usage scenarios require handling of geographically augmented data, that is, text documents that contain geographical coordinates (e.g. wikipedia pages about cities). This data is best searched using spacial access methods, such as R-trees, provided that the number of unique documents is large enough to benefit from this approach. An extension to Lucene that improves its speed at searching geographically augmented data is described.