KMi Seminars
Inferring the Structure of a Tennis Game Using Multimodal Information
This event took place on Wednesday 13 June 2012 at 11:30

 
Dr. Qiang Huang School of Computing Sciences - University of East Anglia


Our ambitious long-term goal is to understand multimodal interaction between humans and we use a sports game, tennis, as a starting-point. In tennis, the goals of interactions are clearly defined and the interaction is subject to clear rules. As such, the game can be effectively analysed in terms of sequences of “events”. Our work focuses on the retrieval of these sequences from audio and visual information, and moves beyond low-level information classification or clustering of features to inferring the low-level structure of the game, a task which we believe could also be accomplished by an intelligent human who had no previous exposure to the game of tennis. The process of segmenting the stream of events present in the game is somewhat akin to a child learning how to segment a stream of speech into a sequence of words: the child notices that some phonetic sequences tend to re-occur, and that there are patterns of co-occurrence across different sequences. In this spirit, we will use a variable-length multigram model (VLMM) to search for regular occurring patterns of match events that are detected and inferred using multimodal information and constitute the basic “units” in a tennis match.



 
KMi Seminars
 

Multimedia and Information Systems is...


Multimedia and Information Systems
Our research is centred around the theme of Multimedia Information Retrieval, ie, Video Search Engines, Image Databases, Spoken Document Retrieval, Music Retrieval, Query Languages and Query Mediation.

We focus on content-based information retrieval over a wide range of data spanning form unstructured text and unlabelled images over spoken documents and music to videos. This encompasses the modelling of human perception of relevance and similarity, the learning from user actions and the up-to-date presentation of information. Currently we are building a research version of an integrated multimedia information retrieval system MIR to be used as a research prototype. We aim for a system that understands the user's information need and successfully links it to the appropriate information sources, be it a report or a TV news clip. This work is guided by the vision that an automated knowledge extraction system ultimately empowers people making efficient use of information sources without the burden of filing data into specialised databases.

Visit the MMIS website