Comments on software #35

fmu2 · 2018-07-30T18:49:11Z

The text was updated successfully, but these errors were encountered:

agitter · 2018-07-30T20:08:16Z

I converted the list to the GitHub checkbox format. We can check off the short term issues and create new issues for specific items that take more than a few days or are longer-term efforts.

fmu2 · 2018-08-06T03:30:50Z

Done:

added a title for each stage
increased font size from 6 to 7 in the plotting area
added links to sklearn docs in the 2nd page
sample threshold for leave-one-out: 50
Data Plot is only implemented for datasets with exactly two continuous features
metrics rounded to 3 decimal points instead of 2
training in a new thread (terminating training half-way currently not supported)

To do:

allow user-supplied test set
save classifier parameters

agitter · 2018-08-07T16:01:06Z

Excellent progress!

allow user-supplied test set

This could wait for v2. It may be a less common use case.

For the documentation, I suggest documenting the code instead of writing a separate Markdown file. There are Python documentation conventions that can be used to generate documentation files from the code comments. sklearn is actually a great example of this because they have strong documentation. If you inspect their source code, for example the decision tree, you can see how the functions, arguments, examples, etc. are all documented in the code. They are somehow using Circle CI to automatically build and deploy the documentation, but we wouldn't need that complexity.

I believe that Sphinx is the underlying system that translates the comments to external documents. Let's explore that as an option for documentation.

agitter · 2018-08-26T20:39:14Z

I added some good suggestions from @csmagnano to the list above. Thank you for the testing and great feedback.

The wrapper script warnings he saw were:

ml4bio/lib/python3.5/importlib/_bootstrap.py:222: RuntimeWarning: numpy.dtype size changed, may indicate binary incompatibility. Expected 96, got 88
  return f(*args, **kwds)
ml4bio/lib/python3.5/site-packages/sklearn/ensemble/weight_boosting.py:29: DeprecationWarning: numpy.core.umath_tests is an internal NumPy module and should not be imported. It will be removed in a future NumPy release.
  from numpy.core.umath_tests import inner1d

Both seem relatively harmless but will confuse beginners.

agitter transferred this issue from carpentries-incubator/ml4bio-workshop Mar 4, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments on software #35

Comments on software #35

fmu2 commented Jul 30, 2018

agitter commented Jul 30, 2018

fmu2 commented Aug 6, 2018 •

edited

Loading

agitter commented Aug 7, 2018

agitter commented Aug 26, 2018

Comments on software #35

Comments on software #35

Comments

fmu2 commented Jul 30, 2018

agitter commented Jul 30, 2018

fmu2 commented Aug 6, 2018 • edited Loading

agitter commented Aug 7, 2018

agitter commented Aug 26, 2018

fmu2 commented Aug 6, 2018 •

edited

Loading