-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Submission: PrepPy (Python) #5
Comments
[Draft review - work in progress] Package Review
DocumentationThe package includes all the following forms of documentation:
Readme requirements
The README should include, from top to bottom:
Functionality
Estimated hours spent reviewing: 2.5 Review CommentsOverall, quite a useful package that resolves some of the pain points I had with |
Package ReviewPlease check off boxes as applicable, and elaborate in comments below. Your review is not limited to these topics, as described in the reviewer guide
DocumentationThe package includes all the following forms of documentation:
Readme requirements
The README should include, from top to bottom:
Functionality
Estimated hours spent reviewing:
|
Hi James, Thank you for your valuable feedbacks and we have addressed the following items:
The most recent release could be found through this link. Thanks, Jasmine |
Hi @moniquewong Thank you for your valuable feedback! We have addressed the following items:
The most recent release can be found here. Thanks, Matt |
Submitting Authors: George Thio (@gptzjs), Matthew Connell (@matthewconnell), Jasmine Qin (@jasmineqyj), Chimaobi Amadi ( @camadi)
Package Name: preppy524
One-Line Description of Package: A python package for data preprocessing for machine learning
Repository Link: https://github.com/UBC-MDS/PrepPy
Version submitted: v1.2.0
Editor: Varada Kolhatkar (@kvarada)
Reviewer 1: Monique Wong (@moniquewong)
Reviewer 2: Mengzhe Huang (@jamesh4)
Archive: TBD
Version accepted: TBD
Description
preppy524 is a package for Python to help preprocessing in machine learning tasks. There are certain repetitive tasks that come up often when doing a machine learning project and this package aims to alleviate those chores. Some of the issues that come up regularly are: finding the types of each column in a dataframe, splitting the data (whether into train/test sets or train/test/validation sets, one-hot encoding, and scaling features. This package will help with all of those tasks.
Scope
* Please fill out a pre-submission inquiry before submitting a data visualization package. For more info, see this section of our guidebook.
train_valid_test_split
: This function splits the data set into train, validation, and test sets.data_type
: This function identifies data types for each column/feature. It returns one dataframe for each type of data.one-hot
: This function performs one-hot encoding on the categorical features and returns a dataframe for the train, test, validation sets with sensible column names.scaler
: This function performs standard scaling on the numerical features.Machine Learning Engineers, Data Scientists, students and any other person who is interested in preprocessing data before running machine learning models.
No single package does the four different functions of
preppy524
but there are some functions that does some part of thepreppy524
package.@tag
the editor you contacted:None
Technical checks
For details about the pyOpenSci packaging requirements, see our packaging guide. Confirm each of the following by checking the box. This package:
Publication options
No
JOSS Checks
paper.md
matching JOSS's requirements with a high-level description in the package root or ininst/
.Note: Do not submit your package separately to JOSS
Are you OK with Reviewers Submitting Issues and/or pull requests to your Repo Directly?
This option will allow reviewers to open smaller issues that can then be linked to PR's rather than submitting a more dense text based review. It will also allow you to demonstrate addressing the issue via PR links.
Code of conduct
P.S. Have feedback/comments about our review process? Leave a comment here
Editor and Review Templates
Editor and review templates can be found here
The text was updated successfully, but these errors were encountered: