Contextual semantics for sentiment analysis of Twitter

Hassan Saif; Yulan He; Miriam Fernández; Harith Alani

doi:10.1016/j.ipm.2015.01.005

Contextual semantics for sentiment analysis of Twitter

Hassan Saif^*, Yulan He, Miriam Fernández, Harith Alani

^*Corresponding author for this work

Computer Science Research Group

Research output: Contribution to journal › Article › peer-review

Abstract

Sentiment analysis on Twitter has attracted much attention recently due to its wide applications in both, commercial and public sectors. In this paper we present SentiCircles, a lexicon-based approach for sentiment analysis on Twitter. Different from typical lexicon-based approaches, which offer a fixed and static prior sentiment polarities of words regardless of their context, SentiCircles takes into account the co-occurrence patterns of words in different contexts in tweets to capture their semantics and update their pre-assigned strength and polarity in sentiment lexicons accordingly. Our approach allows for the detection of sentiment at both entity-level and tweet-level. We evaluate our proposed approach on three Twitter datasets using three different sentiment lexicons to derive word prior sentiments. Results show that our approach significantly outperforms the baselines in accuracy and F-measure for entity-level subjectivity (neutral vs. polar) and polarity (positive vs. negative) detections. For tweet-level sentiment detection, our approach performs better than the state-of-the-art SentiStrength by 4-5% in accuracy in two datasets, but falls marginally behind by 1% in F-measure in the third dataset.

Original language	English
Pages (from-to)	5-19
Number of pages	19
Journal	Information Processing and Management
Volume	52
Issue number	1
Early online date	7 Mar 2015
DOIs	https://doi.org/10.1016/j.ipm.2015.01.005
Publication status	Published - Jan 2016

Bibliographical note

© 2015, Elsevier. Licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International http://creativecommons.org/licenses/by-nc-nd/4.0/

Funding: EU-FP7 project SENSE4US (Grant No. 611242); Shenzhen International Cooperation Research Funding (Grant No. GJHZ20120613110641217).

Keywords

contextual semantics
sentiment analysis
Twitter

Access to Document

10.1016/j.ipm.2015.01.005

Contextual semantics for sentiment analysis of Twitter
© 2015, Elsevier. Licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International http://creativecommons.org/licenses/by-nc-nd/4.0/
Accepted author manuscript, 611 KBLicence: CC BY-NC-ND 3.0

Cite this

@article{c4bee73d5d674f12bbb0c4852f02f9a5,

title = "Contextual semantics for sentiment analysis of Twitter",

abstract = "Sentiment analysis on Twitter has attracted much attention recently due to its wide applications in both, commercial and public sectors. In this paper we present SentiCircles, a lexicon-based approach for sentiment analysis on Twitter. Different from typical lexicon-based approaches, which offer a fixed and static prior sentiment polarities of words regardless of their context, SentiCircles takes into account the co-occurrence patterns of words in different contexts in tweets to capture their semantics and update their pre-assigned strength and polarity in sentiment lexicons accordingly. Our approach allows for the detection of sentiment at both entity-level and tweet-level. We evaluate our proposed approach on three Twitter datasets using three different sentiment lexicons to derive word prior sentiments. Results show that our approach significantly outperforms the baselines in accuracy and F-measure for entity-level subjectivity (neutral vs. polar) and polarity (positive vs. negative) detections. For tweet-level sentiment detection, our approach performs better than the state-of-the-art SentiStrength by 4-5% in accuracy in two datasets, but falls marginally behind by 1% in F-measure in the third dataset.",

keywords = "contextual semantics, sentiment analysis, Twitter",

author = "Hassan Saif and Yulan He and Miriam Fern{\'a}ndez and Harith Alani",

note = "{\textcopyright} 2015, Elsevier. Licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International http://creativecommons.org/licenses/by-nc-nd/4.0/ Funding: EU-FP7 project SENSE4US (Grant No. 611242); Shenzhen International Cooperation Research Funding (Grant No. GJHZ20120613110641217).",

year = "2016",

month = jan,

doi = "10.1016/j.ipm.2015.01.005",

language = "English",

volume = "52",

pages = "5--19",

journal = "Information Processing and Management",

issn = "0306-4573",

publisher = "Elsevier",

number = "1",

}

TY - JOUR

T1 - Contextual semantics for sentiment analysis of Twitter

AU - Saif, Hassan

AU - He, Yulan

AU - Fernández, Miriam

AU - Alani, Harith

N1 - © 2015, Elsevier. Licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International http://creativecommons.org/licenses/by-nc-nd/4.0/ Funding: EU-FP7 project SENSE4US (Grant No. 611242); Shenzhen International Cooperation Research Funding (Grant No. GJHZ20120613110641217).

PY - 2016/1

Y1 - 2016/1

N2 - Sentiment analysis on Twitter has attracted much attention recently due to its wide applications in both, commercial and public sectors. In this paper we present SentiCircles, a lexicon-based approach for sentiment analysis on Twitter. Different from typical lexicon-based approaches, which offer a fixed and static prior sentiment polarities of words regardless of their context, SentiCircles takes into account the co-occurrence patterns of words in different contexts in tweets to capture their semantics and update their pre-assigned strength and polarity in sentiment lexicons accordingly. Our approach allows for the detection of sentiment at both entity-level and tweet-level. We evaluate our proposed approach on three Twitter datasets using three different sentiment lexicons to derive word prior sentiments. Results show that our approach significantly outperforms the baselines in accuracy and F-measure for entity-level subjectivity (neutral vs. polar) and polarity (positive vs. negative) detections. For tweet-level sentiment detection, our approach performs better than the state-of-the-art SentiStrength by 4-5% in accuracy in two datasets, but falls marginally behind by 1% in F-measure in the third dataset.

AB - Sentiment analysis on Twitter has attracted much attention recently due to its wide applications in both, commercial and public sectors. In this paper we present SentiCircles, a lexicon-based approach for sentiment analysis on Twitter. Different from typical lexicon-based approaches, which offer a fixed and static prior sentiment polarities of words regardless of their context, SentiCircles takes into account the co-occurrence patterns of words in different contexts in tweets to capture their semantics and update their pre-assigned strength and polarity in sentiment lexicons accordingly. Our approach allows for the detection of sentiment at both entity-level and tweet-level. We evaluate our proposed approach on three Twitter datasets using three different sentiment lexicons to derive word prior sentiments. Results show that our approach significantly outperforms the baselines in accuracy and F-measure for entity-level subjectivity (neutral vs. polar) and polarity (positive vs. negative) detections. For tweet-level sentiment detection, our approach performs better than the state-of-the-art SentiStrength by 4-5% in accuracy in two datasets, but falls marginally behind by 1% in F-measure in the third dataset.

KW - contextual semantics

KW - sentiment analysis

KW - Twitter

UR - http://www.scopus.com/inward/record.url?scp=84924110504&partnerID=8YFLogxK

U2 - 10.1016/j.ipm.2015.01.005

DO - 10.1016/j.ipm.2015.01.005

M3 - Article

AN - SCOPUS:84924110504

SN - 0306-4573

VL - 52

SP - 5

EP - 19

JO - Information Processing and Management

JF - Information Processing and Management

IS - 1

ER -

Contextual semantics for sentiment analysis of Twitter

Abstract

Bibliographical note

Keywords

Access to Document

Other files and links

Fingerprint

Cite this