Topic extraction from microblog posts using conversation structures

Jing Li, Ming Liao, Wei Gao, Yulan He, Kam-Fai Wong

Research output: Chapter in Book/Report/Conference proceedingConference publication

Abstract

Conventional topic models are ineffective for topic extraction from microblog messages since the lack of structure and context among the posts renders poor message-level word co-occurrence patterns. In this work, we organize microblog posts as conversation trees based on reposting and replying relations, which enrich context information to alleviate data sparseness. Our model generates words according to topic dependencies derived from the conversation structures. In specific, we differentiate messages as leader messages, which initiate key aspects of previously focused topics or shift the focus to different topics, and follower messages that do not introduce any new information but simply echo topics from the messages that they repost or reply. Our model captures the different extents that leader and follower messages may contain the key topical words, thus further enhances the quality of the induced topics. The results of thorough experiments demonstrate the
effectiveness of our proposed model.
Original languageEnglish
Title of host publicationThe 54th Annual Meeting of the Association for Computational Linguistics
Subtitle of host publicationproceedings of the conference
PublisherAssociation for Computational Linguistics
Pages2114-2123
Number of pages10
Volume4
ISBN (Electronic)978-1-5108-2758-5
Publication statusPublished - 15 Aug 2016
Event54th Annual Meeting of the Association for Computational Linguistics - Humboldt University, Berlin, Germany
Duration: 7 Aug 201612 Aug 2016

Meeting

Meeting54th Annual Meeting of the Association for Computational Linguistics
Abbreviated titleACL 2016
CountryGermany
CityBerlin
Period7/08/1612/08/16

Bibliographical note

-

Fingerprint Dive into the research topics of 'Topic extraction from microblog posts using conversation structures'. Together they form a unique fingerprint.

  • Research Output

    • 1 Conference contribution

    Jointly event extraction and visualization on Twitter via probabilistic modelling

    Zhou, D., Gao, T. & He, Y., 15 Aug 2016, The 54th Annual Meeting of the Association for Computational Linguistics: proceedings of the conference. Association for Computational Linguistics, Vol. 1. p. 269-278 10 p.

    Research output: Chapter in Book/Report/Conference proceedingConference publication

    Open Access
  • Cite this

    Li, J., Liao, M., Gao, W., He, Y., & Wong, K-F. (2016). Topic extraction from microblog posts using conversation structures. In The 54th Annual Meeting of the Association for Computational Linguistics: proceedings of the conference (Vol. 4, pp. 2114-2123). Association for Computational Linguistics. http://www.aclweb.org/anthology/P/P16/P16-1199.pdf