Sentiment classification Survey

Source: Internet
Author: User
Sentiment classification Survey
This is a preliminary survey of the important sentiment classification papers. It involves only a few papers and summarizes the basic and general methods, mainly the summary of Pang Bo-based research work, the following is a summary of the English version. Baseline algorithm • produce a list of sentiment words byintrospection and rely on them alone to classify the texts
Baseline algorithm • Algorithm Used in advance's paperto predict the polarity of user interaction • combine three popolar tries to get an English lexicon • check whether there are more important words or more negative sentiment words in the expressionmachinelearning Methods • Na has ve Bayes • Maximum Entropy Model • Support Vector Machines • Need labeled data- Http://www.cs.cornell.edu/People/pabo/movie-review-data/Na keep ve Bayes • to assign to a given document d the class C * • Bayes 'rule' • Nb classifier • Simple but have high predictive powermaximumentropy model • Estimate of P (c | D)  Takes the following exponential form • Fi, cIs Feature/class functionForfeature
FiAnd class C, Defined as follows • Toolkit: ZHANG Le's (2004) package maximum entropymodeling toolkit for python and C ++ supportvector machines • given a set of training data, the svmclassifier finds the hyper plane such that each training point is correctlyclassified and the hyper plane is as far as possible from the points closest
Toit • Toolkit: svmlight, libsvm, pyml considerationabout features • unigrams. bigrams. both • feature frequency. presence • POS tags • Position of words • negation words • TF-IDF • adjectives and verbs Pang's result
Majorityvoting • combining Bayes, maxent, and svmclassifiers over the same data provided a three to four percents boost over thebest of the individual classifiers alone. integratinga sentence classifier with language model • run on each sentence of the review toobtain a demo-of "positive" or "negative ". • then the sentences are used to "Vote" Thereview as negative or positive on the basis of their probability scores. • After that the decisions of voting fromthe sentence classifier and the review classifier are combined together usingoptimum weights. starting with information from sentence level the sentence classifier • also based on Na naive ve Bayes algorithm • run on the 5331 negative and 5331 positive sentences vote toclassify reviews • the majority vote is used to classify Thereview. • Using the actual scores of positificandnegatificcomputed for the sentence gets a considerable 1.9% improvement inperformance. weightingsentences by positions • weighting the sentence by their positionin the review. • Providing more weights to sentences thatare towards the beginning and end of a review. ineffecate "Focus" of a sentence • "Focus" here means whether a givensentence is talking about a movie or not. • Two approaches:-Check if asentence contains words such as this movie, the plot, the actors, etc. -Check if a sentence consists of a movie name. discussion • using classification of sentences inaddition to the NA provided ve Bayes model increased accuracy of the system by 5.5%. • Use other algorithms like maxent, or svmfor the sentence classifier. • divide the review into paragraphs andprovide different weights to different paragraphs according to positions.
References • 1. Bo Pang, Lillian Lee, and shivakumarvaithyanw.2002. Thumbs up? Sentiment classification using machine learningtechniques. (Proceedings of emnlp ). • 2. soo-min Kim etal. 2004. determining the sentiment ofopinions. (Proceedings of coling ). • 3. sunil Khanal. 2010. sentiment classification using language models andsentence position information. (Stanford CS 224n final project) • 4. micel galley etal.2004.identifying agreement and disagreement in conversational speech: Use ofbayesian networks to model pragmatic dependencies. (Proceedings of ACL)

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.