In this project which is a part of Udacity nanodegree program, four different recommendation systems to on real data from the IBM Watson Studio platform is studied:
- Rank Based Recommendation.
- User-User Based Collaborative Filtering.
- Content Based Recommendations.
- Matrix Factorization.
The project was implemented using Anaconda distribution of Python 3.0. Moreover I have used the following python libraries:
- Pickle
- Matplotlib
- NLTK
- NumPy
- Pandas
There is a jupyter note book file and a html file that implements the recommender engine.
Data cleaning, missing value imputation have been done in this part of this project.
The most popular articles are found based on users interactions.
Similarity between users are studied.
Different methods to find similarites between documents NLP techniques are used.
Matrix decomposition is used to predict new articles an individual might interact with.