Making sense of microposts (#MSM2013) concept extraction challenge

Amparo Elizabeth Cano Basave, Andrea Varga, Matthew Rowe, Milan Stankovic, Aba-Sah Dadzie

Research output: Chapter in Book/Published conference outputConference publication


Microposts are small fragments of social media content that have been published using a lightweight paradigm (e.g. Tweets, Facebook likes, foursquare check-ins). Microposts have been used for a variety of applications (e.g., sentiment analysis, opinion mining, trend analysis), by gleaning useful information, often using third-party concept extraction tools. There has been very large uptake of such tools in the last few years, along with the creation and adoption of new methods for concept extraction. However, the evaluation of such efforts has been largely consigned to document corpora (e.g. news articles), questioning the suitability of concept extraction tools and methods for Micropost data. This report describes the Making Sense of Microposts Workshop (#MSM2013) Concept Extraction Challenge, hosted in conjunction with the 2013 World Wide Web conference (WWW'13). The Challenge dataset comprised a manually annotated training corpus of Microposts and an unlabelled test corpus. Participants were set the task of engineering a concept extraction system for a defined set of concepts. Out of a total of 22 complete submissions 13 were accepted for presentation at the workshop; the submissions covered methods ranging from sequence mining algorithms for attribute extraction to part-of-speech tagging for Micropost cleaning and rule-based and discriminative models for token classification. In this report we describe the evaluation process and explain the performance of different approaches in different contexts.

Original languageEnglish
Title of host publication#MSM2013 : concept extraction challenge at Making Sense of Microposts 2013
Subtitle of host publicationProceedings of the concept extraction challenge at the workshop on 'Making Sense of Microposts' co-located with the 22nd international World Wide Web conference (WWW'13)
EditorsAmparo E. Cano, Matthew Rowe, Milan Stankovic, Aba-Sah Dadzie
Number of pages15
Publication statusPublished - 2013
EventMaking sense of microposts - Rio de Janeiro, Brazil
Duration: 13 May 2013 → …

Publication series

NameCEUR workshop proceedings
ISSN (Print)1613-0073


WorkshopMaking sense of microposts
CityRio de Janeiro
Period13/05/13 → …
OtherCo-located with the 22nd International World Wide Web Conference (WWW 2013)

Bibliographical note

Cano Basave, AE, Varga, A, Rowe, M, Stankovic, M & Dadzie, A-S: Making sense of microposts (#MSM2013) concept extraction challenge. Proc. of the workshop on 'Making Sense of Microposts' co-located with the 22nd international World Wide Web conference (WWW'13), Rio de Janeiro, Brazil, 13 May,


Dive into the research topics of 'Making sense of microposts (#MSM2013) concept extraction challenge'. Together they form a unique fingerprint.

Cite this