Fixed #2, train and test a classification model on vehicles dataset #30

Bolaji61 · 2020-03-09T02:09:02Z

No description provided.

dzeber

This is a great start on this task. I am happy to see the diversity of models you are looking into.

I have a couple of recommendations:

Please remove .DS_Store from your PR. That is supposed to be excluded automatically by the .gitignore, but I just noticed it's not included in there - I will fix the gitignore on the master branch.
The code you use to test out the models should be moved to a separate module and called from there (and that way you can also reuse the same code as a single function).
I saw you included comments in the notebook explaining what your code does. That is good - however, I'd also like to see more text/markdown cells explaining your thought process: why you decided to do certain analyses, and your interpretations of the results. This is a great help to someone reading your notebook for the first time. For example, the correlation plot shows some fascinating patterns. It would be great to include a few sentences discussing your conclusions from seeing this - what should we pay attention to and how did it influence the rest of your work.
The accuracy of 24% for the SVM looks suspect - I'd double-check to make sure it is working correctly. That's no better than guessing class labels at random.

dev/Omobolaji/vehicles.ipynb

Bolaji61 · 2020-03-15T01:16:41Z

Train and Test a Classification Model on vehicles.csv dataset.
I've been able to create a separate modules file containing the models and functions used.
Next is to study the performance evaluation metrics and give detailed explanations of each line of code.
@dzeber , I hope my pace is not too slow, I'm quite new to open source. Thanks for your earlier review.

Bolaji61 · 2020-03-17T02:52:19Z

I have properly documented my thought process on my choice of model and test_size fraction. Next, is to tune the parameters of the models chosen in order to achieve a higher accuracy.

dzeber · 2020-03-20T20:59:50Z

Thanks for these updates - this has developed into a nicely presented, well-documented notebook. I'm merging this now, as it satisfies the requirements for #2. If you wish to continue working on it, I recommend checking out the other issues in the repo.

A couple of general comments (don't need to fix at present):

The _model() functions in your modules.py have a lot of repeated code. An improvement would be to combine into a single function and supply the parts that change as params.
For the presentation of your very nice analysis of test split sizes, rather than hard-coding the results to present in the table, why not run the evaluations in the notebook, capture them dynamically and present the results directly.

Bolaji61 · 2020-03-20T23:47:08Z

Yaay, this is great news. Thanks a lot, its my first ever merged PR in an Open Source project. I'll work on other issues in the project and fix the issues you commented on later as suggested.
I do appreciate your reviews a lot, I hope to help some others this same way when I'm advanced. Do have a great weekend.

Bolaji61 · 2020-03-25T03:20:58Z

Thanks for these updates - this has developed into a nicely presented, well-documented notebook. I'm merging this now, as it satisfies the requirements for #2. If you wish to continue working on it, I recommend checking out the other issues in the repo.

A couple of general comments (don't need to fix at present):
* The `_model()` functions in your modules.py have a lot of repeated code. An improvement would be to combine into a single function and supply the parts that change as params.

* For the presentation of your very nice analysis of test split sizes, rather than hard-coding the results to present in the table, why not run the evaluations in the notebook, capture them dynamically and present the results directly.

The first suggestion has been fixed accordingly

First attempt in building a model for the dataset

d231eac

dzeber reviewed Mar 11, 2020

View reviewed changes

dev/Omobolaji/vehicles.ipynb Outdated Show resolved Hide resolved

dzeber reviewed Mar 11, 2020

View reviewed changes

dev/Omobolaji/vehicles.ipynb Outdated Show resolved Hide resolved

Bolaji61 added 5 commits March 14, 2020 15:14

Removed .DS_Store from repository

ce0e958

Creating a modules.py file & updating to pass black formatting check

c97f1b5

Updating my copy of the repository

bda4b44

Delete .DS_Store

a1317b1

Moved the models used in my code to a separate modules.py file

6aa3dda

Proper documentation and explanation of my codes

1fc8996

Bolaji61 changed the title ~~WIP: First attempt on "Train and Test a Classification Model on vehicles.csv dataset"~~ Train and Test a Classification Model on vehicles.csv dataset Mar 17, 2020

Evaluating the Performance of LogisticRegression model

48211a9

Bolaji61 changed the title ~~Train and Test a Classification Model on vehicles.csv dataset~~ Fixed #2, train and test model on vehicles dataset Mar 20, 2020

Bolaji61 changed the title ~~Fixed #2, train and test model on vehicles dataset~~ Fixed #2, train and test a classification model on vehicles dataset Mar 20, 2020

Updating my copy of the repo

2f85565

dzeber merged commit 343efaf into mozilla:master Mar 20, 2020

Bolaji61 mentioned this pull request Apr 7, 2020

Fixed Review of Issue #2: Train & Test a Classification model on Vehicles dataset #162

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixed #2, train and test a classification model on vehicles dataset #30

Fixed #2, train and test a classification model on vehicles dataset #30

Bolaji61 commented Mar 9, 2020

dzeber left a comment

Bolaji61 commented Mar 15, 2020

Bolaji61 commented Mar 17, 2020

dzeber commented Mar 20, 2020

Bolaji61 commented Mar 20, 2020

Bolaji61 commented Mar 25, 2020

Fixed #2, train and test a classification model on vehicles dataset #30

Fixed #2, train and test a classification model on vehicles dataset #30

Conversation

Bolaji61 commented Mar 9, 2020

dzeber left a comment

Choose a reason for hiding this comment

Bolaji61 commented Mar 15, 2020

Bolaji61 commented Mar 17, 2020

dzeber commented Mar 20, 2020

Bolaji61 commented Mar 20, 2020

Bolaji61 commented Mar 25, 2020