Unsupervised storyline extraction from news articles

Deyu Zhou, Haiyang Xu, Xin-Yu Dai, Yulan He

Research output: Chapter in Book/Published conference outputConference publication

Abstract

Storyline extraction from news streams aims to extract events under a certain news topic and reveal how those events evolve over time. It requires algorithms capable of accurately extracting events from news articles published in different time periods and linking these extracted events into coherent stories. The two tasks are often solved separately, which might suffer from the problem of error propagation. Existing unified approaches often consider events as topics, ignoring their structured representations. In this paper, we propose a non-parametric generative model to extract structured representations and evolution patterns of storylines simultaneously. In the model, each storyline is modelled as a joint distribution over some locations, organizations, persons, keywords and a set of topics. We further combine this model with the Chinese restaurant process so that the number of storylines can be determined automatically without human intervention. Moreover, per-token Metropolis-Hastings sampler based on light latent Dirichlet allocation is employed to reduce sampling complexity. The proposed model has been evaluated on three news corpora and the experimental results show that it outperforms several baseline approaches.

Original languageEnglish
Title of host publicationProceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence (IJCAI-16)
Place of PublicationPalo Alto, CA (US)
PublisherAAAI
Pages3014-3020
Number of pages7
ISBN (Electronic)978-1-57735-771-1
ISBN (Print)978-1-57735-770-4
Publication statusPublished - 15 Jul 2016
Event25th International Joint Conference on Artificial Intelligence: IJCAI-16 - New York, United States
Duration: 9 Jul 201615 Jul 2016

Conference

Conference25th International Joint Conference on Artificial Intelligence
Country/TerritoryUnited States
CityNew York
Period9/07/1615/07/16

Bibliographical note

-

Fingerprint

Dive into the research topics of 'Unsupervised storyline extraction from news articles'. Together they form a unique fingerprint.
  • Intersubjectivity and sentiment: from language to knowledge

    Gui, L., Xu, R., He, Y., Lu, Q. & Wei, Z., 15 Jul 2016, Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence (IJCAI-16). Palo Alto, CA (US): AAAI, p. 2789-2795 7 p.

    Research output: Chapter in Book/Published conference outputConference publication

    Open Access

Cite this