A gloss composition and context clustering based distributed word sense representation model

Tao Chen; Ruifeng Xu; Yulan He; Xuan Wang

doi:10.3390/e17096007

A gloss composition and context clustering based distributed word sense representation model

Tao Chen, Ruifeng Xu^*, Yulan He, Xuan Wang

^*Corresponding author for this work

Computer Science Research Group

Research output: Contribution to journal › Article › peer-review

Abstract

In recent years, there has been an increasing interest in learning a distributed representation of word sense. Traditional context clustering based models usually require careful tuning of model parameters, and typically perform worse on infrequent word senses. This paper presents a novel approach which addresses these limitations by first initializing the word sense embeddings through learning sentence-level embeddings from WordNet glosses using a convolutional neural networks. The initialized word sense embeddings are used by a context clustering based model to generate the distributed representations of word senses. Our learned representations outperform the publicly available embeddings on half of the metrics in the word similarity task, 6 out of 13 sub tasks in the analogical reasoning task, and gives the best overall accuracy in the word sense effect classification task, which shows the effectiveness of our proposed distributed distribution learning model.

Original language	English
Pages (from-to)	6007-6024
Number of pages	18
Journal	Entropy
Volume	17
Issue number	9
DOIs	https://doi.org/10.3390/e17096007
Publication status	Published - 27 Aug 2015

Bibliographical note

This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Keywords

distributed representation
lexical semantic compositionality
natural language processing
word sense disambiguation

Access to Document

10.3390/e17096007

Gloss composition and context clustering based distributed word sense representation model
This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Final published version, 391 KB

http://www.mdpi.com/1099-4300/17/9/6007

Cite this

@article{0c88804009b74ddaa2f63edc45f2c5d2,

title = "A gloss composition and context clustering based distributed word sense representation model",

abstract = "In recent years, there has been an increasing interest in learning a distributed representation of word sense. Traditional context clustering based models usually require careful tuning of model parameters, and typically perform worse on infrequent word senses. This paper presents a novel approach which addresses these limitations by first initializing the word sense embeddings through learning sentence-level embeddings from WordNet glosses using a convolutional neural networks. The initialized word sense embeddings are used by a context clustering based model to generate the distributed representations of word senses. Our learned representations outperform the publicly available embeddings on half of the metrics in the word similarity task, 6 out of 13 sub tasks in the analogical reasoning task, and gives the best overall accuracy in the word sense effect classification task, which shows the effectiveness of our proposed distributed distribution learning model.",

keywords = "distributed representation, lexical semantic compositionality, natural language processing, word sense disambiguation",

author = "Tao Chen and Ruifeng Xu and Yulan He and Xuan Wang",

note = "This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.",

year = "2015",

month = aug,

day = "27",

doi = "10.3390/e17096007",

language = "English",

volume = "17",

pages = "6007--6024",

journal = "Entropy",

issn = "1099-4300",

publisher = "MDPI AG",

number = "9",

}

TY - JOUR

T1 - A gloss composition and context clustering based distributed word sense representation model

AU - Chen, Tao

AU - Xu, Ruifeng

AU - He, Yulan

AU - Wang, Xuan

N1 - This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PY - 2015/8/27

Y1 - 2015/8/27

N2 - In recent years, there has been an increasing interest in learning a distributed representation of word sense. Traditional context clustering based models usually require careful tuning of model parameters, and typically perform worse on infrequent word senses. This paper presents a novel approach which addresses these limitations by first initializing the word sense embeddings through learning sentence-level embeddings from WordNet glosses using a convolutional neural networks. The initialized word sense embeddings are used by a context clustering based model to generate the distributed representations of word senses. Our learned representations outperform the publicly available embeddings on half of the metrics in the word similarity task, 6 out of 13 sub tasks in the analogical reasoning task, and gives the best overall accuracy in the word sense effect classification task, which shows the effectiveness of our proposed distributed distribution learning model.

AB - In recent years, there has been an increasing interest in learning a distributed representation of word sense. Traditional context clustering based models usually require careful tuning of model parameters, and typically perform worse on infrequent word senses. This paper presents a novel approach which addresses these limitations by first initializing the word sense embeddings through learning sentence-level embeddings from WordNet glosses using a convolutional neural networks. The initialized word sense embeddings are used by a context clustering based model to generate the distributed representations of word senses. Our learned representations outperform the publicly available embeddings on half of the metrics in the word similarity task, 6 out of 13 sub tasks in the analogical reasoning task, and gives the best overall accuracy in the word sense effect classification task, which shows the effectiveness of our proposed distributed distribution learning model.

KW - distributed representation

KW - lexical semantic compositionality

KW - natural language processing

KW - word sense disambiguation

UR - http://www.scopus.com/inward/record.url?scp=84945115074&partnerID=8YFLogxK

U2 - 10.3390/e17096007

DO - 10.3390/e17096007

M3 - Article

AN - SCOPUS:84945115074

SN - 1099-4300

VL - 17

SP - 6007

EP - 6024

JO - Entropy

JF - Entropy

IS - 9

ER -

A gloss composition and context clustering based distributed word sense representation model

Abstract

Bibliographical note

Keywords

Access to Document

Other files and links

Fingerprint

Cite this