Sentiment Intensity Prediction using Neural Word Embeddings

Amal Htait; Leif Azzopardi

doi:10.1145/3471158.3472254

Sentiment Intensity Prediction using Neural Word Embeddings

Amal Htait, Leif Azzopardi

Research output: Chapter in Book/Published conference output › Conference publication

Abstract

Sentiment analysis is central to the process of mining opinions and attitudes from online texts. While much attention has been paid to the sentiment classification problem, much less work has tried to tackle the problem of predicting the intensity of the sentiment. The go to method is VADER --- an unsupervised lexicon based approach to scoring sentiment. However, such approaches are limited because of the vocabulary mismatch problem. In this paper, we present in detail and evaluate our AWESSOME framework (A Word Embedding Sentiment Scorer Of Many Emotions) for sentiment intensity scoring, that capitalizes on pre-existing lexicons, does not require training and provides fine grained and accurate sentiment intensity scores of words, phrases and text. In our experiments, we used seven Sentiment Collections to evaluate the proposed approach, against lexicon based approaches (e.g., VADER), and supervised methods such as deep learning based approaches (e.g., SentiBERT). The results show that despite not surpassing supervised approaches, the AWESSOME unsupervised approach significantly outperforms existing lexicon approaches and therefore provides a simple and effective approach for sentiment analysis. The AWESSOME framework can be flexibly adapted to cater for different seed lexicons and different neural word embeddings models in order to produce corpus specific lexicons -- without the need for extensive supervised learning or retraining.

Original language	English
Title of host publication	ICTIR '21 : Proceedings of the 2021 ACM SIGIR International Conference on Theory of Information Retrieval
Publisher	ACM
Pages	93-102
ISBN (Print)	9781450386111
DOIs	https://doi.org/10.1145/3471158.3472254
Publication status	Published - 31 Aug 2021

Bibliographical note

Copyright © 2021, Association for Computing Machinery. This is the author's version of the work. It is posted here for your personal use. Not for redistribution. The definitive Version of Record was published in ICTIR '21: Proceedings of the 2021 ACM SIGIR International Conference on Theory of Information Retrieval, https://doi.org/10.1145/3471158.3472254.

Access to Document

10.1145/3471158.3472254

Htait_Azzopardi_ICTIR_2021_Sentiment_intensity_prediction_using_neural_word_embeddings
Copyright © 2021, Association for Computing Machinery. This is the author's version of the work. It is posted here for your personal use. Not for redistribution. The definitive Version of Record was published in ICTIR '21: Proceedings of the 2021 ACM SIGIR International Conference on Theory of Information Retrieval, https://doi.org/10.1145/3471158.3472254.
Accepted author manuscript, 2.21 MB

Cite this

@inproceedings{4be627f063b649c993e7c6edc62ce820,

title = "Sentiment Intensity Prediction using Neural Word Embeddings",

abstract = "Sentiment analysis is central to the process of mining opinions and attitudes from online texts. While much attention has been paid to the sentiment classification problem, much less work has tried to tackle the problem of predicting the intensity of the sentiment. The go to method is VADER --- an unsupervised lexicon based approach to scoring sentiment. However, such approaches are limited because of the vocabulary mismatch problem. In this paper, we present in detail and evaluate our AWESSOME framework (A Word Embedding Sentiment Scorer Of Many Emotions) for sentiment intensity scoring, that capitalizes on pre-existing lexicons, does not require training and provides fine grained and accurate sentiment intensity scores of words, phrases and text. In our experiments, we used seven Sentiment Collections to evaluate the proposed approach, against lexicon based approaches (e.g., VADER), and supervised methods such as deep learning based approaches (e.g., SentiBERT). The results show that despite not surpassing supervised approaches, the AWESSOME unsupervised approach significantly outperforms existing lexicon approaches and therefore provides a simple and effective approach for sentiment analysis. The AWESSOME framework can be flexibly adapted to cater for different seed lexicons and different neural word embeddings models in order to produce corpus specific lexicons -- without the need for extensive supervised learning or retraining.",

author = "Amal Htait and Leif Azzopardi",

note = "Copyright {\textcopyright} 2021, Association for Computing Machinery. This is the author's version of the work. It is posted here for your personal use. Not for redistribution. The definitive Version of Record was published in ICTIR '21: Proceedings of the 2021 ACM SIGIR International Conference on Theory of Information Retrieval, https://doi.org/10.1145/3471158.3472254.",

year = "2021",

month = aug,

day = "31",

doi = "10.1145/3471158.3472254",

language = "English",

isbn = "9781450386111",

pages = "93--102",

booktitle = "ICTIR '21 : Proceedings of the 2021 ACM SIGIR International Conference on Theory of Information Retrieval",

publisher = "ACM",

address = "United States",

}

TY - GEN

T1 - Sentiment Intensity Prediction using Neural Word Embeddings

AU - Htait, Amal

AU - Azzopardi, Leif

N1 - Copyright © 2021, Association for Computing Machinery. This is the author's version of the work. It is posted here for your personal use. Not for redistribution. The definitive Version of Record was published in ICTIR '21: Proceedings of the 2021 ACM SIGIR International Conference on Theory of Information Retrieval, https://doi.org/10.1145/3471158.3472254.

PY - 2021/8/31

Y1 - 2021/8/31

N2 - Sentiment analysis is central to the process of mining opinions and attitudes from online texts. While much attention has been paid to the sentiment classification problem, much less work has tried to tackle the problem of predicting the intensity of the sentiment. The go to method is VADER --- an unsupervised lexicon based approach to scoring sentiment. However, such approaches are limited because of the vocabulary mismatch problem. In this paper, we present in detail and evaluate our AWESSOME framework (A Word Embedding Sentiment Scorer Of Many Emotions) for sentiment intensity scoring, that capitalizes on pre-existing lexicons, does not require training and provides fine grained and accurate sentiment intensity scores of words, phrases and text. In our experiments, we used seven Sentiment Collections to evaluate the proposed approach, against lexicon based approaches (e.g., VADER), and supervised methods such as deep learning based approaches (e.g., SentiBERT). The results show that despite not surpassing supervised approaches, the AWESSOME unsupervised approach significantly outperforms existing lexicon approaches and therefore provides a simple and effective approach for sentiment analysis. The AWESSOME framework can be flexibly adapted to cater for different seed lexicons and different neural word embeddings models in order to produce corpus specific lexicons -- without the need for extensive supervised learning or retraining.

AB - Sentiment analysis is central to the process of mining opinions and attitudes from online texts. While much attention has been paid to the sentiment classification problem, much less work has tried to tackle the problem of predicting the intensity of the sentiment. The go to method is VADER --- an unsupervised lexicon based approach to scoring sentiment. However, such approaches are limited because of the vocabulary mismatch problem. In this paper, we present in detail and evaluate our AWESSOME framework (A Word Embedding Sentiment Scorer Of Many Emotions) for sentiment intensity scoring, that capitalizes on pre-existing lexicons, does not require training and provides fine grained and accurate sentiment intensity scores of words, phrases and text. In our experiments, we used seven Sentiment Collections to evaluate the proposed approach, against lexicon based approaches (e.g., VADER), and supervised methods such as deep learning based approaches (e.g., SentiBERT). The results show that despite not surpassing supervised approaches, the AWESSOME unsupervised approach significantly outperforms existing lexicon approaches and therefore provides a simple and effective approach for sentiment analysis. The AWESSOME framework can be flexibly adapted to cater for different seed lexicons and different neural word embeddings models in order to produce corpus specific lexicons -- without the need for extensive supervised learning or retraining.

UR - https://pureportal.strath.ac.uk/en/publications/3c857820-e7c6-4b94-a67e-e7e5654d42c2

UR - https://dl.acm.org/doi/10.1145/3471158.3472254

U2 - 10.1145/3471158.3472254

DO - 10.1145/3471158.3472254

M3 - Conference publication

SN - 9781450386111

SP - 93

EP - 102

BT - ICTIR '21 : Proceedings of the 2021 ACM SIGIR International Conference on Theory of Information Retrieval

PB - ACM

ER -

Sentiment Intensity Prediction using Neural Word Embeddings

Abstract

Bibliographical note

Access to Document

Other files and links

Fingerprint

Cite this