Semantic Analysis of Movie Reviews for Rating Prediction CS 224N Laureen Lam
Project Overview Problem Description Applications Related Work: Thumbs up? Sentiment classification using machine learning techniques, by Pang & Lee Solution Steps: Data Classifier Training / Testing with Features Results Future Work
Solution Steps Data: http://www.cs.cornell.edu/people/pabo/movie-review-data/ Polarity Dataset v2.0 (+ / - sentiment ratings) Scaled Dataset v1.0 (0-, 1-, 2-star ratings) Classifier: Modified MaxEnt Training / Testing: 80 / 20 split of datasets
Polarity Dataset Results
Polarity Dataset Results
Scaled Dataset Results
Scaled Dataset Results
Comparison to Pang & Lee
Future Work Combine MaxEnt with other classifiers Decision Trees SVM Layer classifiers Spread feature sets among MaxEnt classifiers (serial or parallel pipeline) Use top-level classifier to combine results