movie_review {text2vec} | R Documentation |
IMDB movie reviews
Description
The labeled dataset consists of 5000 IMDB movie reviews, specially selected for sentiment analysis. The sentiment of the reviews is binary, meaning an IMDB rating < 5 results in a sentiment score of 0, and a rating >=7 has a sentiment score of 1. No individual movie has more than 30 reviews. Important note: we removed non ASCII symbols from the original dataset to satisfy CRAN policy.
Usage
data("movie_review")
Format
A data frame with 5000 rows and 3 variables:
- id
Unique ID of each review
- sentiment
Sentiment of the review; 1 for positive reviews and 0 for negative reviews
- review
Text of the review (UTF-8)
Source
http://ai.stanford.edu/~amaas/data/sentiment/
[Package text2vec version 0.6.4 Index]