Hospitals contain large databases. We can use that data to discover new useful and potentially life saving knowledge. Here we use datamining especially to predict type 2 diabetes mellitus.Predicting the percentage of chance of occurrence of Diabetes mellitus type 2 with less time complexity and high accuracy.
- Random forest algorithm.
- K-Means.
- Logistic Regression.
- Naive Bayes.
- Support Vector Classifier.
- Recursive Feature Elimination (RFE).
- Recursive Feature Elimination with Cross Validation (RFECV).
- Priciple Component Analysis.
- Grid Search.
- Voting Classifier.
The idea behind using all these algorithms and different concepts is to draw out comparison between them. In the python file attached we neatly show the comparision graph for Logistic Regression, Random Forest and Naive Bayes Algorithm outputs with out RFE, With RFE and with RFECV.