TY - JOUR
T1 - Detecting sarcasm in customer tweets
T2 - An NLP based approach
AU - Mukherjee, Shubhadeep
AU - Bala, Pradip Kumar
PY - 2017/7/10
Y1 - 2017/7/10
N2 - Purpose - The purpose of this paper is to study sarcasm in online text - specifically on twitter - to better understand customer opinions about social issues, products, services, etc. This can be immensely helpful in reducing incorrect classification of consumer sentiment toward issues, products and services. Design/methodology/approach - In this study, 5,000 tweets were downloaded and analyzed. Relevant features were extracted and supervised learning algorithms were applied to identify the best differentiating features between a sarcastic and non-sarcastic sentence. Findings - The results using two different classification algorithms, namely, Naïve Bayes and maximum entropy show that function words and content words together are most effective in identifying sarcasm in tweets. The most differentiating features between a sarcastic and a non-sarcastic tweet were identified. Practical implications - Understanding the use of sarcasm in tweets let companies do better sentiment analysis and product recommendations for users. This could help businesses attract new customers and retain the old ones resulting in better customer management. Originality/value - This paper uses novel features to identify sarcasm in online text which is one of the most challenging problems in natural language processing. To the authors' knowledge, this is the first study on sarcasm detection from a customer management perspective.
AB - Purpose - The purpose of this paper is to study sarcasm in online text - specifically on twitter - to better understand customer opinions about social issues, products, services, etc. This can be immensely helpful in reducing incorrect classification of consumer sentiment toward issues, products and services. Design/methodology/approach - In this study, 5,000 tweets were downloaded and analyzed. Relevant features were extracted and supervised learning algorithms were applied to identify the best differentiating features between a sarcastic and non-sarcastic sentence. Findings - The results using two different classification algorithms, namely, Naïve Bayes and maximum entropy show that function words and content words together are most effective in identifying sarcasm in tweets. The most differentiating features between a sarcastic and a non-sarcastic tweet were identified. Practical implications - Understanding the use of sarcasm in tweets let companies do better sentiment analysis and product recommendations for users. This could help businesses attract new customers and retain the old ones resulting in better customer management. Originality/value - This paper uses novel features to identify sarcasm in online text which is one of the most challenging problems in natural language processing. To the authors' knowledge, this is the first study on sarcasm detection from a customer management perspective.
KW - Artificial intelligence
KW - Data mining Business intelligence
KW - Natural language processing
KW - Sarcasm detection
KW - Text mining
UR - http://www.scopus.com/inward/record.url?scp=85021913757&partnerID=8YFLogxK
UR - https://www.emerald.com/insight/content/doi/10.1108/IMDS-06-2016-0207/full/html
U2 - 10.1108/IMDS-06-2016-0207
DO - 10.1108/IMDS-06-2016-0207
M3 - Article
AN - SCOPUS:85021913757
SN - 0263-5577
VL - 117
SP - 1109
EP - 1126
JO - Industrial Management and Data Systems
JF - Industrial Management and Data Systems
IS - 6
ER -