2018-08-23 workshop notes #14

agitter · 2018-08-24T15:37:59Z

Here are some of my notes and possible revisions from the pilot workshop. We can discuss these in person before implementing any changes.

Agenda and slides:

Software:

Mac OS opens the wrapper script in an editor instead of executing it. Need alternative instructions for launching the software.
Need more guidance for running the software on Windows when Anaconda is not on the PATH. (Improving the wrapper script for Windows #21)
Determine why Windows does not launch the GUI the first time the batch script is run. (Improving the wrapper script for Windows #21)
Add a note about the warning Windows shows about running a batch script from an unknown publisher. (Improving the wrapper script for Windows #21)
Add note about common NumPy or other warnings that can be safely ignored.
Clear the unlabeled data after loading a new labeled dataset
Neural networks do not provide a class weight. This is because scikit-learn does not implement it yet. See pull request https://github.com/scikit-learn/scikit-learn/pull/11723 for progress. (Neural networks class weight gitter-lab/ml4bio#8)
Create an issue with the error message a student received on Mac OS. (Error message #15)

Data and guides:

Document what pre-processing was done in the neurotoxicity dataset to reduce the features to 1000 genes. This is in the paper.
Update the performance guide to explain the performance of a random classifier and how the area depends on the class imbalance. (Random PR curve for performance guide #22)
Consider adding a toy dataset that is imbalanced and non linearly separable to help explore different performance measures.
Data cleaning and pre-processing guide with examples (data carpentry resources?)

The text was updated successfully, but these errors were encountered:

agitter · 2018-12-05T19:21:12Z

@fmu2 completed almost all of these suggestions from the 2018-08-23 workshop. I created new specific issues for the remaining comments we may want to address. The others can be safely ignored in my opinion.

agitter mentioned this issue Oct 11, 2018

Upload to PyPI gitter-lab/ml4bio#3

Closed

This was referenced Dec 5, 2018

Instructor material for slides #20

Closed

Improving the wrapper script for Windows #21

Closed

Neural networks class weight gitter-lab/ml4bio#8

Open

Random PR curve for performance guide #22

Open

agitter closed this as completed Dec 5, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

2018-08-23 workshop notes #14

2018-08-23 workshop notes #14

agitter commented Aug 24, 2018 •

edited

Loading

agitter commented Dec 5, 2018

2018-08-23 workshop notes #14

2018-08-23 workshop notes #14

Comments

agitter commented Aug 24, 2018 • edited Loading

agitter commented Dec 5, 2018

agitter commented Aug 24, 2018 •

edited

Loading