What Size of Language Unit Is More Appropriate for Text Summarization?

Mengyun Cao; Hai Zhuge

doi:10.1109/SKG.2018.00036

What Size of Language Unit Is More Appropriate for Text Summarization?

Mengyun Cao, Hai Zhuge

Computer Science Research Group

Research output: Chapter in Book/Published conference output › Conference publication

Abstract

Extractive text summarization is to find the important sentences from texts and concatenates these sentences as a summary. However, sentences selected according to ranking rules are usually not coherent. Is a larger language unit such as a group of sentences or a paragraph more appropriate to be selected for summarization? This paper is to answer this question. Investigating the summarization algorithm based on ranking semantic link networks of texts, we find the following three results: 1) comparing with the summaries composed by sentences, the summaries composed by larger language units have similar ROUGE scores but have better readability; 2) using a group of sentences is more effective than using sentence and paragraph; and, 3) the quality of summaries composed by group becomes better when the average length of the source texts increases.

Original language	English
Title of host publication	Proceedings - 2018 14th International Conference on Semantics, Knowledge and Grids, SKG 2018
Publisher	IEEE
Pages	196-202
Number of pages	7
ISBN (Electronic)	978-1-7281-0441-6
ISBN (Print)	978-1-7281-0442-3
DOIs	https://doi.org/10.1109/SKG.2018.00036
Publication status	Published - 2 May 2019
Event	2018 14th International Conference on Semantics, Knowledge and Grids (SKG) - Guangzhou, China Duration: 12 Sept 2018 → 14 Sept 2018

Publication series

Name	2018 14th International Conference on Semantics, Knowledge and Grids (SKG)
Publisher	IEEE
ISSN (Electronic)	2325-0623

Conference

Conference	2018 14th International Conference on Semantics, Knowledge and Grids (SKG)
Period	12/09/18 → 14/09/18

Keywords

ranking
semantic link network
text summarization

Access to Document

10.1109/SKG.2018.00036

Cite this

@inproceedings{ded8fdcb8017408098fa363f7255db4a,

title = "What Size of Language Unit Is More Appropriate for Text Summarization?",

abstract = "Extractive text summarization is to find the important sentences from texts and concatenates these sentences as a summary. However, sentences selected according to ranking rules are usually not coherent. Is a larger language unit such as a group of sentences or a paragraph more appropriate to be selected for summarization? This paper is to answer this question. Investigating the summarization algorithm based on ranking semantic link networks of texts, we find the following three results: 1) comparing with the summaries composed by sentences, the summaries composed by larger language units have similar ROUGE scores but have better readability; 2) using a group of sentences is more effective than using sentence and paragraph; and, 3) the quality of summaries composed by group becomes better when the average length of the source texts increases.",

keywords = "ranking, semantic link network, text summarization",

author = "Mengyun Cao and Hai Zhuge",

year = "2019",

month = may,

day = "2",

doi = "10.1109/SKG.2018.00036",

language = "English",

isbn = "978-1-7281-0442-3",

series = "2018 14th International Conference on Semantics, Knowledge and Grids (SKG)",

publisher = "IEEE",

pages = "196--202",

booktitle = "Proceedings - 2018 14th International Conference on Semantics, Knowledge and Grids, SKG 2018",

address = "United States",

note = "2018 14th International Conference on Semantics, Knowledge and Grids (SKG) ; Conference date: 12-09-2018 Through 14-09-2018",

}

Cao, M & Zhuge, H 2019, What Size of Language Unit Is More Appropriate for Text Summarization? in Proceedings - 2018 14th International Conference on Semantics, Knowledge and Grids, SKG 2018., 8703948, 2018 14th International Conference on Semantics, Knowledge and Grids (SKG), IEEE, pp. 196-202, 2018 14th International Conference on Semantics, Knowledge and Grids (SKG), 12/09/18. https://doi.org/10.1109/SKG.2018.00036

What Size of Language Unit Is More Appropriate for Text Summarization? / Cao, Mengyun; Zhuge, Hai.
Proceedings - 2018 14th International Conference on Semantics, Knowledge and Grids, SKG 2018. IEEE, 2019. p. 196-202 8703948 (2018 14th International Conference on Semantics, Knowledge and Grids (SKG)).

Research output: Chapter in Book/Published conference output › Conference publication