Abstract
As microblog services such as Twitter become a fast and convenient communication approach, identification of trendy topics in microblog services has great academic and business value. However detecting trendy topics is very challenging due to huge number of users and short-text posts in microblog diffusion networks. In this paper we introduce a trendy topics detection system under computation and communication resource constraints. In stark contrast to retrieving and processing the whole microblog contents, we develop an idea of selecting a small set of microblog users and processing their posts to achieve an overall acceptable trendy topic coverage, without exceeding resource budget for detection. We formulate the selection operation of these subset users as mixed-integer optimization problems, and develop heuristic algorithms to compute their approximate solutions. The proposed system is evaluated with real-time test data retrieved from Sina Weibo, the dominant microblog service provider in China. It's shown that by monitoring 500 out of 1.6 million microblog users and tracking their microposts (about 15,000 daily) with our system, nearly 65% trendy topics can be detected, while on average 5 hours earlier before they appear in Sina Weibo official trends.
Original language | English |
---|---|
Title of host publication | IEEE International Conference on Communications, ICC |
Publisher | IEEE |
Pages | 1194-1200 |
Number of pages | 7 |
ISBN (Print) | 978-1-4673-6432-4 |
DOIs | |
Publication status | Published - 9 Sept 2015 |
Event | IEEE International Conference on Communications - London, United Kingdom Duration: 8 Jun 2015 → 12 Jun 2015 |
Conference
Conference | IEEE International Conference on Communications |
---|---|
Abbreviated title | ICC 2015 |
Country/Territory | United Kingdom |
City | London |
Period | 8/06/15 → 12/06/15 |