First attempt on vehicle dataset with a random forest classifier #13

Sidrah-Madiha · 2020-03-07T07:50:21Z

This is the first attempt to classify vehicle.csv dataset with random forest classifier.
Nest step is to try to improve this classifer.
Then move to experiment with other classifiers, to see which archives the best accuracy.

Sidrah-Madiha · 2020-03-07T09:51:09Z

Typos in above comment, correct comment is:
This is the first attempt to classify vehicle.csv dataset with random forest classifier.
Next step is to try to improve this classifer.
Then move to experiment with other classifiers, to see which achieves the best accuracy.

Sidrah-Madiha

minor changes

Sidrah-Madiha · 2020-03-09T08:21:16Z

it was my first attempt for fixes #2

mlopatka

This is a nice, modular PR addressing the issue very succinctly.
If you would like, you can investigate the misclassified points in more depth, or submit that as a new PR.

Feel free to merge in.

dzeber

Overall, very nice PR which does a good job splitting the code between the module and the notebook. Requesting changes to fix the test set size discrepancy.

dzeber · 2020-03-10T21:11:43Z

dev/Sidrah-Madiha/allcustommodules.py

+#     print('The target variable: ')
+#     print(y[:5])
+#     Split dataset into training set and test set
+    X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2 ,random_state = 21)


The 0.2 test proportion does not match the comment in the notebook. In general, how did decide on this test set size? I would be good to include a comment about this in the notebook.

dzeber · 2020-03-10T21:13:08Z

dev/Sidrah-Madiha/vehicles_dataset_classifer_v1.ipynb

+   "metadata": {},
+   "source": [
+    "### Conclusion:\n",
+    "Overall I got 78 percent accuaracy which doesn't seem good, next step for today will be to first try to improve this model then I will experiment with other models to see comparative performance of other models on this dataset."


Looking at the confusion matrix and evaluation metric table, can you expand on your interpretation of this overall accuracy? I notice that the per-class accuracy scores are quite different across the classes.

First attempt on vehicle data with a random forest calssifier

9ae1c5f

minor changes

f320c86

Sidrah-Madiha commented Mar 8, 2020

View reviewed changes

Sidrah-Madiha added 3 commits March 8, 2020 13:24

Comparative model evaluation for vehicle dataset

9e3aeba

first attempt for implementing task 7

a7acad0

fixes mozilla#8

4d94959

fixes mozilla#4, attempt 1

c253921

mlopatka approved these changes Mar 10, 2020

View reviewed changes

dzeber suggested changes Mar 10, 2020

View reviewed changes

dzeber mentioned this pull request Mar 10, 2020

Start up task: Vehicle dataset comparative evaluation models #19

Merged

Sidrah-Madiha added 2 commits March 16, 2020 01:46

implemeneted all change requests

2a54a7d

formatted code for all helper files

7121a90

Sidrah-Madiha mentioned this pull request Mar 15, 2020

Traversal of the space of train test splits, fixes #3 #46

Merged

minor fix

6ca9c1d

mlopatka approved these changes Mar 16, 2020

View reviewed changes

mlopatka merged commit bd53913 into mozilla:master Mar 16, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

First attempt on vehicle dataset with a random forest classifier #13

First attempt on vehicle dataset with a random forest classifier #13

Sidrah-Madiha commented Mar 7, 2020

Sidrah-Madiha commented Mar 7, 2020

Sidrah-Madiha left a comment

Sidrah-Madiha commented Mar 9, 2020

mlopatka left a comment

dzeber left a comment

dzeber Mar 10, 2020

dzeber Mar 10, 2020

First attempt on vehicle dataset with a random forest classifier #13

First attempt on vehicle dataset with a random forest classifier #13

Conversation

Sidrah-Madiha commented Mar 7, 2020

Sidrah-Madiha commented Mar 7, 2020

Sidrah-Madiha left a comment

Choose a reason for hiding this comment

Sidrah-Madiha commented Mar 9, 2020

mlopatka left a comment

Choose a reason for hiding this comment

dzeber left a comment

Choose a reason for hiding this comment

dzeber Mar 10, 2020

Choose a reason for hiding this comment

dzeber Mar 10, 2020

Choose a reason for hiding this comment