AWESSOME: An Unsupervised Sentiment Intensity Scoring Framework Using Neural Word Embeddings

Amal Htait; Leif Azzopardi

doi:10.1007/978-3-030-72240-1_56

AWESSOME: An Unsupervised Sentiment Intensity Scoring Framework Using Neural Word Embeddings

Amal Htait, Leif Azzopardi

Research output: Chapter in Book/Published conference output › Chapter

Abstract

Sentiment analysis (SA) is the key element for a variety of opinion and attitude mining tasks. While various unsupervised SA tools already exist, a central problem is that they are lexicon-based where the lexicons used are limited, leading to a vocabulary mismatch. In this paper, we present an unsupervised word embedding-based sentiment scoring framework for sentiment intensity scoring (SIS). The framework generalizes and combines past works so that pre-existing lexicons (e.g. VADER, LabMT) and word embeddings (e.g. BERT, RoBERTa) can be used to address this problem, with no require training, and while providing fine grained SIS of words and phrases. The framework is scalable and extensible, so that custom lexicons or word embeddings can be used to core methods, and to even create new corpus specific lexicons without the need for extensive supervised learning and retraining. The Python 3 toolkit is open source, freely available from GitHub (https://github. com/cumulative-revelations/awessome) and can be directly installed via pip install awessome.

Original language	Undefined/Unknown
Title of host publication	European Conference on Information Retrieval
Subtitle of host publication	Advances in Information Retrieval
Publisher	Springer
Pages	509-513
Number of pages	5
DOIs	https://doi.org/10.1007/978-3-030-72240-1_56
Publication status	Published - 30 Mar 2021
Event	43rd European Conference on IR Research - Virtual - online Duration: 28 Mar 2021 → 1 Apr 2021

Publication series

Name	Lecture Notes in Computer Science (LNCS)
Publisher	Springer
ISSN (Print)	0302-9743
ISSN (Electronic)	1611-3349

Conference

Conference	43rd European Conference on IR Research
Abbreviated title	ECIR 2021
Period	28/03/21 → 1/04/21

Access to Document

10.1007/978-3-030-72240-1_56

Cite this

@inbook{2b16c21760c24e479d399457f4b43b39,

title = "AWESSOME: An Unsupervised Sentiment Intensity Scoring Framework Using Neural Word Embeddings",

abstract = "Sentiment analysis (SA) is the key element for a variety of opinion and attitude mining tasks. While various unsupervised SA tools already exist, a central problem is that they are lexicon-based where the lexicons used are limited, leading to a vocabulary mismatch. In this paper, we present an unsupervised word embedding-based sentiment scoring framework for sentiment intensity scoring (SIS). The framework generalizes and combines past works so that pre-existing lexicons (e.g. VADER, LabMT) and word embeddings (e.g. BERT, RoBERTa) can be used to address this problem, with no require training, and while providing fine grained SIS of words and phrases. The framework is scalable and extensible, so that custom lexicons or word embeddings can be used to core methods, and to even create new corpus specific lexicons without the need for extensive supervised learning and retraining. The Python 3 toolkit is open source, freely available from GitHub (https://github. com/cumulative-revelations/awessome) and can be directly installed via pip install awessome.",

author = "Amal Htait and Leif Azzopardi",

year = "2021",

month = mar,

day = "30",

doi = "10.1007/978-3-030-72240-1_56",

language = "Undefined/Unknown",

series = "Lecture Notes in Computer Science (LNCS)",

publisher = "Springer",

pages = "509--513",

booktitle = "European Conference on Information Retrieval",

address = "Germany",

note = "43rd European Conference on IR Research ; Conference date: 28-03-2021 Through 01-04-2021",

}

TY - CHAP

T1 - AWESSOME: An Unsupervised Sentiment Intensity Scoring Framework Using Neural Word Embeddings

AU - Htait, Amal

AU - Azzopardi, Leif

PY - 2021/3/30

Y1 - 2021/3/30

N2 - Sentiment analysis (SA) is the key element for a variety of opinion and attitude mining tasks. While various unsupervised SA tools already exist, a central problem is that they are lexicon-based where the lexicons used are limited, leading to a vocabulary mismatch. In this paper, we present an unsupervised word embedding-based sentiment scoring framework for sentiment intensity scoring (SIS). The framework generalizes and combines past works so that pre-existing lexicons (e.g. VADER, LabMT) and word embeddings (e.g. BERT, RoBERTa) can be used to address this problem, with no require training, and while providing fine grained SIS of words and phrases. The framework is scalable and extensible, so that custom lexicons or word embeddings can be used to core methods, and to even create new corpus specific lexicons without the need for extensive supervised learning and retraining. The Python 3 toolkit is open source, freely available from GitHub (https://github. com/cumulative-revelations/awessome) and can be directly installed via pip install awessome.

AB - Sentiment analysis (SA) is the key element for a variety of opinion and attitude mining tasks. While various unsupervised SA tools already exist, a central problem is that they are lexicon-based where the lexicons used are limited, leading to a vocabulary mismatch. In this paper, we present an unsupervised word embedding-based sentiment scoring framework for sentiment intensity scoring (SIS). The framework generalizes and combines past works so that pre-existing lexicons (e.g. VADER, LabMT) and word embeddings (e.g. BERT, RoBERTa) can be used to address this problem, with no require training, and while providing fine grained SIS of words and phrases. The framework is scalable and extensible, so that custom lexicons or word embeddings can be used to core methods, and to even create new corpus specific lexicons without the need for extensive supervised learning and retraining. The Python 3 toolkit is open source, freely available from GitHub (https://github. com/cumulative-revelations/awessome) and can be directly installed via pip install awessome.

UR - https://doi.org/10.1007/978-3-030-72240-1_56

U2 - 10.1007/978-3-030-72240-1_56

DO - 10.1007/978-3-030-72240-1_56

M3 - Chapter

T3 - Lecture Notes in Computer Science (LNCS)

SP - 509

EP - 513

BT - European Conference on Information Retrieval

PB - Springer

T2 - 43rd European Conference on IR Research

Y2 - 28 March 2021 through 1 April 2021

ER -

AWESSOME: An Unsupervised Sentiment Intensity Scoring Framework Using Neural Word Embeddings

Abstract

Publication series

Conference

Access to Document

Other files and links

Cite this