Phenopype: a phenotyping pipeline for Python #23

mluerig · 2020-03-26T16:57:45Z

Submitting Author: Name (@mluerig)
Package Name: Phenopype
One-Line Description of Package: a phenotyping pipeline for Python
Repository Link (if existing): https://github.com/mluerig/phenopype

Description

Phenopype is a high throughput phenotyping pipeline for Python that aims at supporting biologists in their efforts to extract high dimensional phenotypic data from digital images. Phenopype provides high level functions for image processing that can be stacked and executed sequentially to efficiently process single images or large data sets in a semi or fully automated fashion. Users can assemble their own function-stacks that can be customized and stored along with raw data for full reproducibility (check the high throughput workflow). Phenopype can be run from Python or from a Python Integrated Development Environment (IDE), like Spyder. Some Python knowledge is necessary, but most of the heavy lifting is done in the background. Phenopype can be installed from the Python Package Index (PYPI) using pip install phenopype.

Scope

Please indicate which category or categories this package falls under:
- Data retrieval
- Data extraction
- Data munging
- Data deposition
- Data visualization
- Reproducibility
- Geospatial
- Education
- Unsure/Other (explain below)
Explain how the and why the package falls under these categories (briefly, 1-2 sentences). Please note any areas you are unsure of:

Phenopype is designed to extract phenotypic data (https://en.wikipedia.org/wiki/Phenotype) of plants, animals, and other organisms from images and videos.

Who is the target audience and what are scientific applications of this package?

Phenopype is intended for ecologists and evolutionary biologists that work with phenotypic data. Phenotypic data are an essential component of ecological and evolutionary research (https://www.nature.com/articles/nrg2897)

Are there other Python packages that accomplish the same thing? If so, how does yours differ?

Only low level computer vision packages like OpenCV or scikit-image are out there that require a lot of configuring and a relatively deep understanding of computer vision and Python in general. Phenopype offers high level functions so that users can focus on the relevant analytic parts of image analysis.

Any other questions or issues we should be aware of?:

Documentation (https://mluerig.github.io/phenopype/) is semi-complete. I am working on finishing up all docstrings and making the package PEP8 conform.

P.S. Have feedback/comments about our review process? Leave a comment here

EDIT: fixed typos

lwasser · 2020-03-26T20:41:56Z

hi @mluerig ! welcome to pyopensci and thank you for your submission! I will get back to you with comments in the next week or so. Thank you for your patience!!

lwasser · 2020-04-07T14:53:19Z

hey @mluerig can you provide a bit more explanation about the exact functionality that phenotype provides? i was a bit unclear after looking at the docs! many thanks!

mluerig · 2020-04-07T15:50:51Z

I guess the documentation is still a bit confusing. In short phenopype aims at providing a comprehensive and easy-to-use high throughput image analysis workflow using classic computer vision (no machine learning - yet). It aims at ecologists and evolutionary biologists that want to quickly analyze images of organisms and extract phenotypic data.

The provided functions span image data set management (through projects), preprocessing (e.g. setting a common size and color reference and correct images), segmentation (e.g. thresholding and watershed), measurement (e.g. pixel-intensity or landmarks), as well as visualizing and exporting the produced results. The provided functions are designed to be intuitive and intend to minimize user interaction and manual work as much as possible. So, everything is streamlined towards getting solid results fast, even if you don't have a strong programming background.

The other idea is that all settings, intermediate image analysis steps, (e.g. the contour of detected objects) and raw data are available after the analysis. With human-readable configuration files scientists can generate "cookie-cutter" methods that can be reused, and shared. This will also allow reviewers to reproduce the obtained results with a single line of code, so nobody has to dig through complex scripts. This makes the collected data very reproducible, which is becoming more and more important.

Does that makes it a bit more clear? Please let me know where the documentation is unclear. Also, there is a written manuscript that provides more comprehensive information - let me know if you would like to see it.

lwasser · 2020-04-07T16:56:32Z

thank you @mluerig !! i will get back to you. The challenge I am facing right now is just ensuring that we at pyopensci have reviewers who can effectively review packages that are on the analytics side of things! if you were to submit this, can you think of 2 people who would have the skills required to review? i also can ask around on twitter but to be transparent, we are trying to decide what packages we can support review of now and so that is something i consider when a package comes in to us. thank you so much for the speedy response! and again my apologies for such a slow response time.

mluerig · 2020-04-07T17:09:25Z

yupp I have a few people in mind. should I contact them and then get back to you here, or how should we do this?

lwasser · 2020-04-07T19:01:56Z

thank you @mluerig give me a little bit of time.
What i'm trying to sess out is how analytics focused this package is.
The steps will be

you submit a full submission rather than a pre submission as this one is! i will ask you to do this if i can get a bit more info from our team.
I will then ask you about suggesting reviewers and we will ping them here.

we have a meeting coming up on thursday if you have a chance to attend. i plan to bring up this and another presubmission package to ensure it's "in scope". Just so yo uknow my only concern has been analytics focused packages can be difficult to quality check given so few people have the expertise needed. However on the other hand, if we can find people with sufficient expertise, and trust in our reviewers similar to journals I think we should consider a broader range of packages . if you can hang tight until thursday, that would be great. you are also WELCOME to join us at 11am mountain time which i know might be late for you so i understand if that doesn't work!

mluerig · 2020-04-07T22:31:32Z

okay no problem it can wait. I'm also happy to join the meeting if it helps sorting things out (my email is here)

mluerig · 2020-04-09T05:49:48Z

just let me know how I can join, then I'll try to make it (I'll be in the mountains all day)

lwasser · 2020-04-09T16:06:44Z

https://pyopensci.discourse.group/t/april-9-community-meeting/169 if you login to our discourse (you can use your github login!) you will have the meeting information. sorry this took me a bit of time to get to -- was struggling with how to avoid being "bombed" online in a meeting!!

lwasser · 2020-04-09T17:08:00Z

great @mluerig please go ahead and submit an actual submission and we will get this in our review queu

mluerig · 2020-04-18T15:29:58Z

@lwasser Phenopype is almost in ok shape to be reviewed. However, one module (video-analysis, which is an extension of the core image analysis kit) is still not working great and I would like to spend a few more weeks with it. It's an important, but code-wise, peripheral feature. Would you agree to a review of the program as it is now (linux builds passing, 70% coverage, all docstrings are there, tutorials and vignettes as well), and then I add the fully functional video-analysis module to a later re-submission?

lwasser · 2020-04-20T16:23:03Z

@mluerig we prefer that you get the package in full working order PRIOR to submitting it for a review. So let's leave this presubmission open for the time being. Please ping me when all of the code is in a state that you think is acceptable for review. Thank you for checking in / asking about this!

mluerig · 2020-04-20T16:30:46Z

ok I'll finish it up before then.

one more thing about CI: I recently started using travis CI with my program, and I discovered i) that travis doesn't support python builds for windows and macOS, and ii) testing some of the functions is tricky because they open up a GUI requiring user interaction.

I can't change i), and I tried to mimic as much user input for ii) as possible, but ultimately, I will have to run the tests locally to get fully coverage. is this acceptable?

lwasser · 2020-04-20T16:43:49Z

hi @mluerig these are both good questions
re travis. there are a few options. I believe in our cookiecutter example we have an example implementation for appveyor which runs windows. circleci also has windows options now (i haven't tried it yet!). But there are several windows options available. we've been using appveyor for our packages.

Re the gui implementation for tests... i need to dig a little bit more into that. let me see what folks say on the discourse forum / twitter and i will get back to you. Ideally tests can be all run via ci. it seems like you could potentially implement some sort of monkey patching to mimic user input but i really don't know enough about this to make any suggestions! more to come on this.

mluerig · 2020-04-20T16:51:41Z

@lwasser ah I didn't know about appveyor - I'll check it out

yeah I have been monkeypatching keyboard-input using the mock package, that works great. but clicking into an image is something else. I think I could get GUI functions working on CI by supplying some default coordinates and timers, but this would require a special testing interface for some of the functions, which I would implement as a last resort

lwasser · 2020-04-20T17:59:38Z

@mluerig yes appveyor works pretty well!

i can totally get how clicking on an image would be difficult to recreate. let me do a bit of digging and i'll get back to you! specifically what is the user input providing? aoi regions of the image to analyze or training data or something to that effect?

mluerig · 2020-04-20T18:03:52Z

yupp sometimes a mask needs to be selected to detect blobs within or a reference card measured to for automatic detection. it's not unimportant, but again, if we don't find anything I'll fix up an appropriate testing interface in the function (there is really only one complex class handling all the user input, so it's actually not so hard to do this)

mluerig · 2020-05-04T16:52:30Z

closing - actual submission here: #24

@lwasser let me know if you would like to receive my suggestions for reviewers

mluerig added the presubmission label Mar 26, 2020

lwasser added the Submission Requested label Apr 9, 2020

mluerig mentioned this issue May 4, 2020

Phenopype: a phenotyping pipeline for Python #24

Closed

22 tasks

mluerig closed this as completed May 4, 2020

lwasser added this to presubmission-inquiries Apr 6, 2024

lwasser moved this to Done in presubmission-inquiries Apr 6, 2024

lwasser moved this from Done to Closed in presubmission-inquiries Apr 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Phenopype: a phenotyping pipeline for Python #23

Phenopype: a phenotyping pipeline for Python #23

mluerig commented Mar 26, 2020 •

edited

Loading

lwasser commented Mar 26, 2020

lwasser commented Apr 7, 2020

mluerig commented Apr 7, 2020 •

edited

Loading

lwasser commented Apr 7, 2020

mluerig commented Apr 7, 2020

lwasser commented Apr 7, 2020

mluerig commented Apr 7, 2020

mluerig commented Apr 9, 2020

lwasser commented Apr 9, 2020

lwasser commented Apr 9, 2020

mluerig commented Apr 18, 2020

lwasser commented Apr 20, 2020

mluerig commented Apr 20, 2020

lwasser commented Apr 20, 2020

mluerig commented Apr 20, 2020

lwasser commented Apr 20, 2020

mluerig commented Apr 20, 2020 •

edited

Loading

mluerig commented May 4, 2020

Phenopype: a phenotyping pipeline for Python #23

Phenopype: a phenotyping pipeline for Python #23

Comments

mluerig commented Mar 26, 2020 • edited Loading

Description

Scope

lwasser commented Mar 26, 2020

lwasser commented Apr 7, 2020

mluerig commented Apr 7, 2020 • edited Loading

lwasser commented Apr 7, 2020

mluerig commented Apr 7, 2020

lwasser commented Apr 7, 2020

mluerig commented Apr 7, 2020

mluerig commented Apr 9, 2020

lwasser commented Apr 9, 2020

lwasser commented Apr 9, 2020

mluerig commented Apr 18, 2020

lwasser commented Apr 20, 2020

mluerig commented Apr 20, 2020

lwasser commented Apr 20, 2020

mluerig commented Apr 20, 2020

lwasser commented Apr 20, 2020

mluerig commented Apr 20, 2020 • edited Loading

mluerig commented May 4, 2020

mluerig commented Mar 26, 2020 •

edited

Loading

mluerig commented Apr 7, 2020 •

edited

Loading

mluerig commented Apr 20, 2020 •

edited

Loading