Unsupervised storyline extraction from news articles

Research output: Chapter in Book/Report/Conference proceedingConference contribution

View graph of relations Save citation



Research units


Storyline extraction from news streams aims to extract events under a certain news topic and reveal how those events evolve over time. It requires algorithms capable of accurately extracting events from news articles published in different time periods and linking these extracted events into coherent stories. The two tasks are often solved separately, which might suffer from the problem of error propagation. Existing unified approaches often consider events as topics, ignoring their structured representations. In this paper, we propose a non-parametric generative model to extract structured representations and evolution patterns of storylines simultaneously. In the model, each storyline is modelled as a joint distribution over some locations, organizations, persons, keywords and a set of topics. We further combine this model with the Chinese restaurant process so that the number of storylines can be determined automatically without human intervention. Moreover, per-token Metropolis-Hastings sampler based on light latent Dirichlet allocation is employed to reduce sampling complexity. The proposed model has been evaluated on three news corpora and the experimental results show that it outperforms several baseline approaches.

Request a copy

Request a copy


Publication date15 Jul 2016
Publication titleProceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence (IJCAI-16)
Place of PublicationPalo Alto, CA (US)
Number of pages7
ISBN (Electronic)978-1-57735-771-1
ISBN (Print)978-1-57735-770-4
Original languageEnglish
Event25th International Joint Conference on Artificial Intelligence: IJCAI-16 - New York, United States
Duration: 9 Jul 201615 Jul 2016


Conference25th International Joint Conference on Artificial Intelligence
CountryUnited States
CityNew York

Bibliographic note


Employable Graduates; Exploitable Research

Copy the text from this field...