Repository containing portfolio of data science projects completed by me for academic, self learning, and hobby purposes. Presented in the form of iPython Notebooks.
-
- Estimating the Price of Used Cars: A model is to estimate the price of a used car based on features like kilometer driven, engine, feul type,model name and city. Used car data is scraped from Pak wheels and consists of 35,000 cars data.
- Brest Cancer Prediction: A model is to pridict that the patient has breast cancer or not. Data is taken from sklearn databases and XGBoost Classifier is used as model.
- Digital Number Recognition using Random Forest: A model is to predict the digital number. Data is taken from sklearn databases.
- Email Fraud Detection using Naive Bayes: A model is to predict that email is ham or spam.
- HR Analytics using Logistic Regression: A model is to predict that whether employee is going to leave the company or not.
-
- Exploratory Analysis on Pakistan Largest E-commerce Dataset on kaggle: EDA is perform on the dataset and tried to answer the questions.
- Exploratory analysis on Sales Dataset: EDA is perform on the datast and given answers of the questions.
-
- Pneumonia or Normal: Dataset of 6 thousands X-rays is taken from kaggle. CNN model is used for prediction.
- Demonstration youtube video link is: https://youtu.be/4A5c3Dyw21o
- Teleco Customer Churn: ANN model is used for setting up this project. The dataset is taken from Kaggle.
-
Moive Recomender Sytem: Data of IMDB is taken from Kaggle. This dataset has 5000 movies dataset. Recommeder System recomends the top five movies on the basis of Director, Cast, genres and keywords.