Movie Reviews Naive Bayes Subjectivity Classifier for Python NLTK
Overview
This data download is a pre-trained model for a Bayesian classifier. If you do not have experience with Python NLTK, you may not be interested in this data set.
A 93.57% accurate Naive Bayes subjectivity classifier trained on IMDb plots and RottenTomatoes quotes. This classifier can be used for hierarchical sentiment analysis to determine whether text is objective or subjective before using a sentiment classifier to determine polarity. Given a feature dict of words and bigrams, it will classify the text as “quote” or “plot”, where “quote” means the text is subjective, and “plot” means the text is objective. The classifier requires Python & NLTK 2.0 and is licensed for commercial usage.
Application Gallery
Do you have an application, visualization or otherwise great use of this data?
Submit it now, and be featured here!
Infochimps Platform
Use this data on the Infochimps Big Data Platform to unlock:
- Advanced analytical capabilities
- Hosting for customer databases
- Access to tools such as Hadoop, Pig, and R
- …and more to come!
Tags
Stats
| Added by: | japerk | |
|---|---|---|
| Link: | ||
| Created: | 9 months ago | |
| Updated: | 20 days ago | |
Share
