Project idea from Kaggle: COVID-19 World Vaccine Adverse Reactions by Ayush Garg. Project for class CS584ML
Raw data and merged dataset is not uploaded due to large size. Original data can be found: https://www.kaggle.com/ayushggarg/covid19-vaccine-adverse-reactions?select=2021VAERSVAX.csv
CC0: Public Domain
The script include five steps from data processing to graphing to training model and to using model for prediction.
- needs data files inside rawdata directory
- merge 3 datasets into 1
- preprocess text data into numbers, number to text mapping are saved under category directory
- needs data files inside rawdata directory
- graph data into scatter plots
- needs merged dataset file
- split merged dataset into feature and target datasets
- Train model with 4 choices:
- Stochastic Gradient Descent Classification
- Logistic Regression Classification
- K Neighbors Classification
- Neural Network Classification with MLP After training, model are saved under models directory.
- needs trained and saved files inside models direcotory and input file
- print prediction category and predicted outcome for each patients in a list of 0 and 1.