Topical Phrase Extraction from Clinical Reports by Incorporating both Local and Global Context

Gabriele Pergola, Yulan He, David Lowe

Research output: Chapter in Book/Published conference outputConference publication


Making sense of words often requires to simultaneously
examine the surrounding context of a term as well as the
global themes characterizing the overall corpus. Several
topic models have already exploited word embeddings
to recognize local context, however, it has been weakly
combined with the global context during the topic inference.
This paper proposes to extract topical phrases
corroborating the word embedding information with the
global context detected by Latent Semantic Analysis,
and then combine them by means of the Polya urn ´
model. To highlight the effectiveness of this combined
approach the model was assessed analyzing clinical reports,
a challenging scenario characterized by technical
jargon and a limited word statistics available. Results
show it outperforms the state-of-the-art approaches in
terms of both topic coherence and computational cost.
Original languageEnglish
Title of host publicationThe 2nd AAAI Workshop on Health Intelligence
Number of pages8
Publication statusE-pub ahead of print - 20 Jun 2018
Event2018 Workshop on Health Intelligence (W3PHIAI 2018) - New Orleans, United States
Duration: 2 Feb 20183 Feb 2018


Conference2018 Workshop on Health Intelligence (W3PHIAI 2018)
Country/TerritoryUnited States
CityNew Orleans

Bibliographical note

© 2018, Association for the Advancement of Artificial
Intelligence ( All rights reserved.


Dive into the research topics of 'Topical Phrase Extraction from Clinical Reports by Incorporating both Local and Global Context'. Together they form a unique fingerprint.

Cite this