Detecting sarcasm in customer tweets: An NLP based approach

Shubhadeep Mukherjee*, Pradip Kumar Bala

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review


Purpose - The purpose of this paper is to study sarcasm in online text - specifically on twitter - to better understand customer opinions about social issues, products, services, etc. This can be immensely helpful in reducing incorrect classification of consumer sentiment toward issues, products and services. Design/methodology/approach - In this study, 5,000 tweets were downloaded and analyzed. Relevant features were extracted and supervised learning algorithms were applied to identify the best differentiating features between a sarcastic and non-sarcastic sentence. Findings - The results using two different classification algorithms, namely, Naïve Bayes and maximum entropy show that function words and content words together are most effective in identifying sarcasm in tweets. The most differentiating features between a sarcastic and a non-sarcastic tweet were identified. Practical implications - Understanding the use of sarcasm in tweets let companies do better sentiment analysis and product recommendations for users. This could help businesses attract new customers and retain the old ones resulting in better customer management. Originality/value - This paper uses novel features to identify sarcasm in online text which is one of the most challenging problems in natural language processing. To the authors' knowledge, this is the first study on sarcasm detection from a customer management perspective.

Original languageEnglish
Pages (from-to)1109-1126
Number of pages18
JournalIndustrial Management and Data Systems
Issue number6
Publication statusPublished - 10 Jul 2017


  • Artificial intelligence
  • Data mining Business intelligence
  • Natural language processing
  • Sarcasm detection
  • Text mining


Dive into the research topics of 'Detecting sarcasm in customer tweets: An NLP based approach'. Together they form a unique fingerprint.

Cite this