What Size of Language Unit Is More Appropriate for Text Summarization?

Mengyun Cao, Hai Zhuge

Research output: Chapter in Book/Published conference outputConference publication

Abstract

Extractive text summarization is to find the important sentences from texts and concatenates these sentences as a summary. However, sentences selected according to ranking rules are usually not coherent. Is a larger language unit such as a group of sentences or a paragraph more appropriate to be selected for summarization? This paper is to answer this question. Investigating the summarization algorithm based on ranking semantic link networks of texts, we find the following three results: 1) comparing with the summaries composed by sentences, the summaries composed by larger language units have similar ROUGE scores but have better readability; 2) using a group of sentences is more effective than using sentence and paragraph; and, 3) the quality of summaries composed by group becomes better when the average length of the source texts increases.
Original languageEnglish
Title of host publicationProceedings - 2018 14th International Conference on Semantics, Knowledge and Grids, SKG 2018
PublisherIEEE
Pages196-202
Number of pages7
ISBN (Electronic)978-1-7281-0441-6
ISBN (Print)978-1-7281-0442-3
DOIs
Publication statusPublished - 2 May 2019
Event2018 14th International Conference on Semantics, Knowledge and Grids (SKG) - Guangzhou, China
Duration: 12 Sept 201814 Sept 2018

Publication series

Name2018 14th International Conference on Semantics, Knowledge and Grids (SKG)
PublisherIEEE
ISSN (Electronic)2325-0623

Conference

Conference2018 14th International Conference on Semantics, Knowledge and Grids (SKG)
Period12/09/1814/09/18

Keywords

  • ranking
  • semantic link network
  • text summarization

Fingerprint

Dive into the research topics of 'What Size of Language Unit Is More Appropriate for Text Summarization?'. Together they form a unique fingerprint.

Cite this