Abstract
This paper proposes an innovative graph-based text summarization model for generic single and multi-document summarization. The approach involves four unique processing stages: parsing sentences semantically using Semantic Role Labeling (SRL), grouping semantic arguments while matching semantic roles to Wikipedia concepts, constructing a weighted semantic graph for each document and linking its sentences (nodes) through the semantic relatedness of the Wikipedia concepts. An iterative ranking algorithm is then applied to the document graphs to extract the most important sentences deemed as the summary. The empirical evaluation of the proposed summarization model on a standard dataset from the Document Understanding Conference (DUC) showed the effectiveness of the approach which outperformed the baseline comparators in terms of ROUGE scores.
Original language | English |
---|---|
Title of host publication | Proceedings - 2016 IEEE 2nd International Conference on Big Data Computing Service and Applications, BigDataService 2016 |
Publisher | IEEE |
Pages | 117-120 |
Number of pages | 4 |
ISBN (Electronic) | 9781509022519 |
DOIs | |
Publication status | Published - 19 May 2016 |
Event | 2nd IEEE International Conference on Big Data Computing Service and Applications, BigDataService 2016 - Oxford, United Kingdom Duration: 29 Mar 2016 → 1 Apr 2016 |
Conference
Conference | 2nd IEEE International Conference on Big Data Computing Service and Applications, BigDataService 2016 |
---|---|
Country/Territory | United Kingdom |
City | Oxford |
Period | 29/03/16 → 1/04/16 |
Keywords
- iterative ranking algorithm
- Semantic Role Labelling
- Text summarization
- Wikipedia concepts