The Large Movie Review Database, originally published in the paper, Learning Word Vectors for Sentiment Analysis, by Andrew L. Maas et al, can be downloaded from http://ai.stanford.edu/~amaas/data/sentiment/.
The downloaded archive contains two folders labeled train and test. For train, there are 12,500 positive reviews and 12,500 negative reviews that we will train a classifier on. The test dataset contains the same amount of positive and negative reviews for a grand total of 50,000 positive and negative reviews amongst the two files.
Let's look at an example of one review to see what the data looks like:
"Bromwell High is nothing short of brilliant. Expertly scripted and perfectly delivered, this searing parody of students and teachers at a South London Public School leaves you literally rolling with laughter. It's vulgar, provocative, witty and sharp...