WIP: Visualize misclassifications #2

SaumyaSinghal · 2020-03-25T20:04:26Z

No description provided.

Updating KaairaGupta/master

…ogistic Regression (mozilla#12) * Data Loaded from vehicles.csv * Data visulaization and training model with ifferent algorithms * Evaluation of model is done. * Changed model from Logistic Regression to Support Vector Machine At first attempt i used three differnet models but Logistic Regression , Support Vector Machine and Decision Tree, and the overall accuracy with LR was better than any other but with changing validation parameters in SVM classification , model accuacy increased from 82% to 88%. * Delete train and test model-checkpoint.ipynb * Changed file named. * all python modules were added * docstrings were added * labels added in confusion matrix * Histogram colors were changed into single color * solved histogram issue * Update modules.py * changes made in histogram * Update modules.py * sorted histogram * Update modules.py * labels were added for confusion matrix * Python Custom Modules were added * Update Vehicle_Classifier.ipynb * Update modules.py * Update modules.py * Update modules.py * added labels in confusion matrix * Update modules.py * Update modules.py * Update modules.py * Update modules.py * Update modules.py * Requested Changes were made * Update modules.py * change categorical data into numerical data * change areguments in LR model * Update modules.py * Update modules.py * Update modules.py * Update modules.py * Update modules.py * Update modules.py * Update modules.py * Update modules.py * Update modules.py * Update modules.py * Update modules.py * Update modules.py * Update modules.py * Update modules.py * Update modules.py * Update modules.py * Update modules.py * Update modules.py * Update modules.py * Requested changes were made * Update modules.py * Update modules.py * Update modules.py * Delete Untitled.ipynb * code formatted using python Black * Requested changes were made * shifted classifier's code from modules.py to ModelEvaluation.py * removed learning curves from file * added function for model evaluation in Model Evaluation file * updated svm and lr * added comments * added doc strings * Update ModelEvaluation.py * Update ModelEvaluation.py * Update Vehicle_Classifier.ipynb * Update modules.py * added descriptions * added interpretations of visualization

…nition of classes (mozilla#86) * Contribution to issue #2 * Eliminació de l'arxiu de prova * Eliminació de l'arxiu de prova * Fixed axes when not numbers and removed superfluous function. * Initial commit to Issue mozilla#7 Visualization of missclassifications. * Adding explanatory graphs for the notebook regarding Issue mozilla#7 * Better graph to illustrate option 'hits-fails'.

* Add Logistic Regression Model for Winequality dataset Signed-off-by: SanchiMittal <[email protected]> * Add python modules Signed-off-by: SanchiMittal <[email protected]> * Minor Changes * Add Black Formatting

Fixed #2, train and test a classification model on vehicles dataset

…n the train-test split ratio (mozilla#72) * Contribution to issue #2 * Eliminació de l'arxiu de prova * Eliminació de l'arxiu de prova * Fixed axes when not numbers and removed superfluous function. * Minor changes to modules in file data_exploration.py. Co-authored-by: mlopatka <[email protected]>

these committed changes fixes issue #3 of traversal space of train-test splits using KNN model.in #2 i have used decision tree and further recommended outlier detection algorithm for classification. so in this PR i have used KNN and compared results with previous classfication.this PR uses already defined modules in #2.

#3 traversal of train_test_split

* Create Readme.md * Create files for exploring issue #2 * Format using black * Remove notebook from master * Increase modularization * create file for issue 6 * remove file added by mistake * Create notebook for issue 6 * Re-upload to the right folder * Delete file from the incorrect folder

* Update .gitignore * Preliminary Analysis * Helper modules (Bar and Hist graph) * Rough KNN algorithm implemented * Delete libraries.py * KNN classifier refactored and polished Returns only variable of intests for use the metrics calculations. * refactored for performance just the required functions imported * draft mlp classifier implemented to be reviewed * ... * Threshold conversion logic implemented Since knn.predict calculates a probability, we implement a logic for binary classification * Prelimary cleaning and knn model classification implemented! * Adjusted plor error with title placement * ... * Files reformated with 'Black' * Logistic Regression classifier * Refactores modules to improve modularity * Implemented Log Reg * Deleted mpl module to focus on knn and log reg * Refactors gotignore to my personal folder * refactored for readability * Implementation to add counts and relative percentages on bars graph * Refactored name #2, Completed Prelimary Analysis and Interpreted Results * Update Issue #2 - Train and test a classification model (PRESC).ipynb * Files reformated with 'Black' * Display Error corrected * Interpreted choice of hyper-parameters * Refactored and Added Modules used for Issue 3 * Prelimanry Analysis - Traversal of the space of train_test splits * Issue#3 complete * Removed Issues #2 and #3 ipynb * Issue mozilla#4 - completed Issue mozilla#4 - Traversal of the space of cross-validation folds * Delete defaults_data.csv Removing duplication of the existing data set which can be loaded from the repos root directory. Co-authored-by: mlopatka <[email protected]>

…ozilla#92) * Classification model wine.csv * Classification model wine.csv * Merging modifications

#2 Dropped quality, shifted the logic to python file, shifted imports to the top, added confusion_matrix and classification_report

…el (Stochastic Gradient Descent) on winequality.csv (mozilla#58) * adds incomplete files * adds .ipynb, .py and updates environment.yml * Delete winequality.ipynb removing duplicate files * Delete winequality_modules.py removing duplicate files * Delete winequality.ipynb removing incomplete files * Delete winequality_modules.py removing incomplete files * adds .ipynb, .py and updates environment.yml * adds description and deatiled reasoning for the methods, models and parameters used * drops quality column * updates .py file * adds files in a new folder * updates .yml

…la#111) * WIP: #2 on the dataset 'eeg.csv' WIP: #2 on the dataset 'eeg.csv' * Add files via upload * Delete WIP: #2 on the dataset 'eeg.csv' * Delete #2 Train and test a classification model, eeg.csv-checkpoint.ipynb * WIP: #2 on the dataset 'eeg.csv' * Delete #2 Train and test a classification model, eeg.csv-checkpoint.ipynb * WIP: #2 Train and test a classification model, eeg.csv dataset * Delete #2 Train and test a classification model, eeg.csv.ipynb * Create README * WIP: #2 Train and test a classification model, eeg.csv dataset * Delete README

For #2: on the dataset 'winequality.csv'

* WIP:Issue #2 KNN Classifier for eeg.csv -Added separate modules for preprocessing eeg.csv -Added a notebook with the results -This commit addresses the startup task - Issue #2 * Updated Notenook results Updated the results in the notebook for review Co-authored-by: swatik718 <>

Exploration of the Vehicles dataset based on Startup task #2

* Create Readme.md * Create files for exploring issue #2 * Format using black * Remove notebook from master * Increase modularization * create file for issue 6 * remove file added by mistake * WIP: Importance score for datapoints * Re-upload to the correct directory * Delete file from wrong directory

SaumyaSinghal and others added 10 commits March 10, 2020 02:53

Create Readme.md

e66b42c

Create files for exploring issue #2

feb4f30

Format using black

a4dfac2

Remove notebook from master

b228db9

Increase modularization

4903b99

create file for issue 6

b30404a

remove file added by mistake

153df2c

WIP: Visualize misclassifications

2924ea2

Re-upload to the correct folder

5184d52

Delete from wrong directory

66fa71a

SaumyaSinghal force-pushed the master branch from 30ccac8 to ff5ade8 Compare March 31, 2020 19:06

SaumyaSinghal pushed a commit that referenced this pull request Mar 31, 2020

Merge pull request #2 from mozilla/master

b13e89f

Updating KaairaGupta/master

SaumyaSinghal closed this in 3734dd3 Mar 31, 2020

SaumyaSinghal pushed a commit that referenced this pull request Mar 31, 2020

Merge pull request mozilla#30 from Bolaji61/master

343efaf

Fixed #2, train and test a classification model on vehicles dataset

SaumyaSinghal pushed a commit that referenced this pull request Mar 31, 2020

FIXES for demo of #2

98ce20b

SaumyaSinghal pushed a commit that referenced this pull request Mar 31, 2020

Merge pull request #2 from asthad16/asthad16-issue3-train_test_split

f9f64f4

#3 traversal of train_test_split

SaumyaSinghal pushed a commit that referenced this pull request Mar 31, 2020

issue #2 Training classification model for winequality.csv dataset (m…

5dc5a98

…ozilla#92) * Classification model wine.csv * Classification model wine.csv * Merging modifications

SaumyaSinghal pushed a commit that referenced this pull request Mar 31, 2020

Merge pull request mozilla#40 from simran0117/master

885e7b2

#2 Dropped quality, shifted the logic to python file, shifted imports to the top, added confusion_matrix and classification_report

SaumyaSinghal pushed a commit that referenced this pull request Mar 31, 2020

Merge pull request mozilla#18 from KaairaGupta/master

d08cd19

For #2: on the dataset 'winequality.csv'

SaumyaSinghal pushed a commit that referenced this pull request Mar 31, 2020

Merge pull request mozilla#77 from opeyemiferanmi1/My-contributions

a5b823e

Exploration of the Vehicles dataset based on Startup task #2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: Visualize misclassifications #2

WIP: Visualize misclassifications #2

SaumyaSinghal commented Mar 25, 2020

WIP: Visualize misclassifications #2

WIP: Visualize misclassifications #2

Conversation

SaumyaSinghal commented Mar 25, 2020