Learning sentiment classification model from labeled features

Yulan He

Research output: Chapter in Book/Published conference outputOther chapter contribution

Abstract

We propose a novel framework where an initial classifier is learned by incorporating prior information extracted from an existing sentiment lexicon. Preferences on expectations of sentiment labels of those lexicon words are expressed using generalized expectation criteria. Documents classified with high confidence are then used as pseudo-labeled examples for automatical domain-specific feature acquisition. The word-class distributions of such self-learned features are estimated from the pseudo-labeled examples and are used to train another classifier by constraining the model's predictions on unlabeled instances. Experiments on both the movie review data and the multi-domain sentiment dataset show that our approach attains comparable or better performance than exiting weakly-supervised sentiment classification methods despite using no labeled documents.
Original languageEnglish
Title of host publicationProceeding : CIKM '10 proceedings of the 19th ACM international conference on information and knowledge management
Place of PublicationNew York (US)
PublisherACM
Pages1685-1688
Number of pages4
ISBN (Print)978-1-4503-0099-5
DOIs
Publication statusPublished - 2010
Event19th ACM international conference on information and knowledge management, CIKM '10 - Toronto, Canada
Duration: 26 Oct 201030 Oct 2010

Conference

Conference19th ACM international conference on information and knowledge management, CIKM '10
Country/TerritoryCanada
CityToronto
Period26/10/1030/10/10

Keywords

  • sentiment analysis
  • opinion mining
  • generalized expectation
  • self-learned features
  • weakly-supervised classification

Fingerprint

Dive into the research topics of 'Learning sentiment classification model from labeled features'. Together they form a unique fingerprint.

Cite this