Project information can be found at kaggle.
This mini-project contains a practice of using Bag-of-Words with a Random Forest Classifier to learn IMDB review text and predict review sentiment labels.
This project requires Python 2.7 and the following Python libraries installed:
You will also need to have software installed to run and execute a Jupyter Notebook
Udacity recommends our students install Anaconda, a pre-packaged Python distribution that contains all of the necessary libraries and software for this project.
Complete code is provided in the notebook.ipynb
notebook file. You will also be required to use the dataset file in data
directory to complete your work.
In a terminal or command window, navigate to the top-level project directory (that contains this README) and run the following command:
jupyter notebook notebook.ipynb
This will open the Jupyter Notebook software and project file in your browser.
Training and test data can be downloaded here.