For #2 : Logistic Regresssion on winequaliy.csv #37

SanchiMittal · 2020-03-10T05:05:50Z

I have used Logistic Regression classifier to perform binary classification on winequality.csv and classify the test data into recommended or not recommended wine.
@dzeber and @mlopatka Please let me know what improvements I need to make.
Thanks

Signed-off-by: SanchiMittal <[email protected]>

SanchiMittal · 2020-03-11T21:08:06Z

I will add other ML models like KNN, SVM, Naive Bayes etc. in the next step and present a comparative study. @dzeber @mlopatka Kindly review my work till now, guiding if I need some changes in my workflow.
Regards.

dzeber

Thanks for this PR. The notebook is well-documented and easy to follow, as are your modules. This is sufficient to satisfy the startup task #2, but you're welcome to dig into the modeling further.

Please add a comment indicating how you decided on this train-test split ratio. We are asking for this input at this point because one of goals of the project is to better understand how this choice influences the outcomes of the model.

Also, do you have any other thoughts on the results from your classification report and confusion matrix? Notice that the model seems to assign most wines to the non-recommended category regardless, which might be inflating the overall accuracy. It would be interesting to see the results on an undersampled training set.

dzeber · 2020-03-12T22:38:32Z

dev/SanchiMittal/dataloader.py

+    # Correelation Matrix
+    corr = d.corr()
+    print("Correlation Matrix:")
+    print(corr)


I would not both printing this since you are also displaying the visualizations.

SanchiMittal · 2020-03-14T20:16:16Z

Thank you for the review. I will implement the above suggestios in my work. Also, I would like to dig more into modelling and study the performance of different models.

mlopatka

Excellent work! Thank you for addressing earlier feedback.

SanchiMittal added 3 commits March 10, 2020 10:21

Add Logistic Regression Model for Winequality dataset

3f2972a

Signed-off-by: SanchiMittal <[email protected]>

Add python modules

d2b8470

Signed-off-by: SanchiMittal <[email protected]>

Minor Changes

337dd73

Add Black Formatting

e5df166

dzeber reviewed Mar 12, 2020

View reviewed changes

mlopatka approved these changes Mar 20, 2020

View reviewed changes

mlopatka merged commit 6543432 into mozilla:master Mar 20, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

For #2 : Logistic Regresssion on winequaliy.csv #37

For #2 : Logistic Regresssion on winequaliy.csv #37

SanchiMittal commented Mar 10, 2020 •

edited

Loading

SanchiMittal commented Mar 11, 2020 •

edited

Loading

dzeber left a comment

dzeber Mar 12, 2020

SanchiMittal commented Mar 14, 2020

mlopatka left a comment

For #2 : Logistic Regresssion on winequaliy.csv #37

For #2 : Logistic Regresssion on winequaliy.csv #37

Conversation

SanchiMittal commented Mar 10, 2020 • edited Loading

SanchiMittal commented Mar 11, 2020 • edited Loading

dzeber left a comment

Choose a reason for hiding this comment

dzeber Mar 12, 2020

Choose a reason for hiding this comment

SanchiMittal commented Mar 14, 2020

mlopatka left a comment

Choose a reason for hiding this comment

SanchiMittal commented Mar 10, 2020 •

edited

Loading

SanchiMittal commented Mar 11, 2020 •

edited

Loading