KMi Publications

Tech Reports

Tech Report kmi-12-02 Abstract


Sentiment Analysis of Microblogs
Techreport ID: kmi-12-02
Date: 2012
Author(s): Hassan Saif
Download PDF

In the past years, we have witnessed an increased interest in microblogs as a hot research topic in the domain of sentiment analysis and opinion mining. Through platforms like Twitter and Facebook, millions of status updates and tweet messages, which reflect people’s opinions and attitudes, are created and sent every day. This has recently brought great potentials and created unlimited opportunities where companies can detect the level of satisfaction or intensity of complaints about certain products and services and policy makers and politicians are able to detect the public opinions about their policies or political issues. Sentiment analysis of microblogs faces several major challenges due to the unique characteristics possessed by microblogging services. One challenge is data sparsity. This is because microblogs contain a large number of irregular and ill-formed words due to the length limit. Another challenge is open-domain where users can post about any topic. This forces building sentiment classifier that work independently of the studied domain. Another serious challenge is data dynamics and evolution as microblogs are produced continuously by a large and uncontrolled number of users. This poses very strict constraints where microblogging data should be processed and analysed in real-time. This report summarises the previous work in microblog sentiment analysis and discusses the major challenges that are yet to be overcome. It then presents my pilot work that has been undertaken so far in which I proposed a novel feature-approach to addressed the data sparsity problem of tweets data. The future plan for the remaining two years is given at the end of the report.
 
KMi Publications Event | SSSW 2013, The 10th Summer School on Ontology Engineering and the Semantic Web Journal | 25 years of knowledge acquisition
 

Multimedia and Information Systems is...


Multimedia and Information Systems
Our research is centred around the theme of Multimedia Information Retrieval, ie, Video Search Engines, Image Databases, Spoken Document Retrieval, Music Retrieval, Query Languages and Query Mediation.

We focus on content-based information retrieval over a wide range of data spanning form unstructured text and unlabelled images over spoken documents and music to videos. This encompasses the modelling of human perception of relevance and similarity, the learning from user actions and the up-to-date presentation of information. Currently we are building a research version of an integrated multimedia information retrieval system MIR to be used as a research prototype. We aim for a system that understands the user's information need and successfully links it to the appropriate information sources, be it a report or a TV news clip. This work is guided by the vision that an automated knowledge extraction system ultimately empowers people making efficient use of information sources without the burden of filing data into specialised databases.

Visit the MMIS website