Learning task specific distributed paragraph representations using a 2-tier convolutional neural network

Tao Chen; Ruifeng Xu; Yulan He; Xuan Wang

doi:10.1007/978-3-319-26532-2_51

Learning task specific distributed paragraph representations using a 2-tier convolutional neural network

Tao Chen, Ruifeng Xu^*, Yulan He, Xuan Wang

^*Corresponding author for this work

Computer Science Research Group

Research output: Chapter in Book/Published conference output › Conference publication

Abstract

We introduce a type of 2-tier convolutional neural network model for learning distributed paragraph representations for a special task (e.g. paragraph or short document level sentiment analysis and text topic categorization). We decompose the paragraph semantics into 3 cascaded constitutes: word representation, sentence composition and document composition. Specifically, we learn distributed word representations by a continuous bag-of-words model from a large unstructured text corpus. Then, using these word representations as pre-trained vectors, distributed task specific sentence representations are learned from a sentence level corpus with task-specific labels by the first tier of our model. Using these sentence representations as distributed paragraph representation vectors, distributed paragraph representations are learned from a paragraph-level corpus by the second tier of our model. It is evaluated on DBpedia ontology classification dataset and Amazon review dataset. Empirical results show the effectiveness of our proposed learning model for generating distributed paragraph representations.

Original language	English
Title of host publication	Neural information processing
Subtitle of host publication	22nd International Conference, ICONIP 2015, Istanbul, Turkey, November 9-12, 2015, Proceedings, Part I
Place of Publication	Cham (CH)
Publisher	Springer
Pages	467-475
Number of pages	9
ISBN (Electronic)	978-3-319-26532-2
ISBN (Print)	978-3-319-26531-5
DOIs	https://doi.org/10.1007/978-3-319-26532-2_51
Publication status	Published - 12 Nov 2015
Event	22nd International Conference on Neural Information Processing - Istanbul, Turkey Duration: 9 Nov 2015 → 12 Nov 2015

Publication series

Name	Lecture notes in computer science
Publisher	Springer
Volume	9489
ISSN (Print)	0302-9743
ISSN (Electronic)	1611-3349

Conference

Conference	22nd International Conference on Neural Information Processing
Abbreviated title	ICONIP 2015
Country/Territory	Turkey
City	Istanbul
Period	9/11/15 → 12/11/15

Keywords

convolutional neural network
distributed representation
natural language processing

Access to Document

10.1007/978-3-319-26532-2_51

Learning task specific distributed paragraph representationsAccepted author manuscript, 320 KB

http://link.springer.com/chapter/10.1007%2F978-3-319-26532-2_51

Cite this

Chen, T., Xu, R., He, Y., & Wang, X. (2015). Learning task specific distributed paragraph representations using a 2-tier convolutional neural network. In Neural information processing: 22nd International Conference, ICONIP 2015, Istanbul, Turkey, November 9-12, 2015, Proceedings, Part I (pp. 467-475). (Lecture notes in computer science; Vol. 9489). Springer. https://doi.org/10.1007/978-3-319-26532-2_51

@inproceedings{fadff4ebaa5043ff9d23435471806efa,

title = "Learning task specific distributed paragraph representations using a 2-tier convolutional neural network",

abstract = "We introduce a type of 2-tier convolutional neural network model for learning distributed paragraph representations for a special task (e.g. paragraph or short document level sentiment analysis and text topic categorization). We decompose the paragraph semantics into 3 cascaded constitutes: word representation, sentence composition and document composition. Specifically, we learn distributed word representations by a continuous bag-of-words model from a large unstructured text corpus. Then, using these word representations as pre-trained vectors, distributed task specific sentence representations are learned from a sentence level corpus with task-specific labels by the first tier of our model. Using these sentence representations as distributed paragraph representation vectors, distributed paragraph representations are learned from a paragraph-level corpus by the second tier of our model. It is evaluated on DBpedia ontology classification dataset and Amazon review dataset. Empirical results show the effectiveness of our proposed learning model for generating distributed paragraph representations.",

keywords = "convolutional neural network, distributed representation, natural language processing",

author = "Tao Chen and Ruifeng Xu and Yulan He and Xuan Wang",

year = "2015",

month = nov,

day = "12",

doi = "10.1007/978-3-319-26532-2_51",

language = "English",

isbn = "978-3-319-26531-5",

series = "Lecture notes in computer science",

publisher = "Springer",

pages = "467--475",

booktitle = "Neural information processing",

address = "Germany",

note = "22nd International Conference on Neural Information Processing, ICONIP 2015 ; Conference date: 09-11-2015 Through 12-11-2015",

}

Chen, T, Xu, R, He, Y & Wang, X 2015, Learning task specific distributed paragraph representations using a 2-tier convolutional neural network. in Neural information processing: 22nd International Conference, ICONIP 2015, Istanbul, Turkey, November 9-12, 2015, Proceedings, Part I. Lecture notes in computer science, vol. 9489, Springer, Cham (CH), pp. 467-475, 22nd International Conference on Neural Information Processing, Istanbul, Turkey, 9/11/15. https://doi.org/10.1007/978-3-319-26532-2_51

Learning task specific distributed paragraph representations using a 2-tier convolutional neural network. / Chen, Tao; Xu, Ruifeng; He, Yulan et al.
Neural information processing: 22nd International Conference, ICONIP 2015, Istanbul, Turkey, November 9-12, 2015, Proceedings, Part I. Cham (CH): Springer, 2015. p. 467-475 (Lecture notes in computer science; Vol. 9489).

Research output: Chapter in Book/Published conference output › Conference publication

TY - GEN

T1 - Learning task specific distributed paragraph representations using a 2-tier convolutional neural network

AU - Chen, Tao

AU - Xu, Ruifeng

AU - He, Yulan

AU - Wang, Xuan

PY - 2015/11/12

Y1 - 2015/11/12

N2 - We introduce a type of 2-tier convolutional neural network model for learning distributed paragraph representations for a special task (e.g. paragraph or short document level sentiment analysis and text topic categorization). We decompose the paragraph semantics into 3 cascaded constitutes: word representation, sentence composition and document composition. Specifically, we learn distributed word representations by a continuous bag-of-words model from a large unstructured text corpus. Then, using these word representations as pre-trained vectors, distributed task specific sentence representations are learned from a sentence level corpus with task-specific labels by the first tier of our model. Using these sentence representations as distributed paragraph representation vectors, distributed paragraph representations are learned from a paragraph-level corpus by the second tier of our model. It is evaluated on DBpedia ontology classification dataset and Amazon review dataset. Empirical results show the effectiveness of our proposed learning model for generating distributed paragraph representations.

AB - We introduce a type of 2-tier convolutional neural network model for learning distributed paragraph representations for a special task (e.g. paragraph or short document level sentiment analysis and text topic categorization). We decompose the paragraph semantics into 3 cascaded constitutes: word representation, sentence composition and document composition. Specifically, we learn distributed word representations by a continuous bag-of-words model from a large unstructured text corpus. Then, using these word representations as pre-trained vectors, distributed task specific sentence representations are learned from a sentence level corpus with task-specific labels by the first tier of our model. Using these sentence representations as distributed paragraph representation vectors, distributed paragraph representations are learned from a paragraph-level corpus by the second tier of our model. It is evaluated on DBpedia ontology classification dataset and Amazon review dataset. Empirical results show the effectiveness of our proposed learning model for generating distributed paragraph representations.

KW - convolutional neural network

KW - distributed representation

KW - natural language processing

UR - http://www.scopus.com/inward/record.url?scp=84952778703&partnerID=8YFLogxK

U2 - 10.1007/978-3-319-26532-2_51

DO - 10.1007/978-3-319-26532-2_51

M3 - Conference publication

AN - SCOPUS:84952778703

SN - 978-3-319-26531-5

T3 - Lecture notes in computer science

SP - 467

EP - 475

BT - Neural information processing

PB - Springer

CY - Cham (CH)

T2 - 22nd International Conference on Neural Information Processing

Y2 - 9 November 2015 through 12 November 2015

ER -

Chen T, Xu R, He Y, Wang X. Learning task specific distributed paragraph representations using a 2-tier convolutional neural network. In Neural information processing: 22nd International Conference, ICONIP 2015, Istanbul, Turkey, November 9-12, 2015, Proceedings, Part I. Cham (CH): Springer. 2015. p. 467-475. (Lecture notes in computer science). doi: 10.1007/978-3-319-26532-2_51

Learning task specific distributed paragraph representations using a 2-tier convolutional neural network

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this