Skip to content

brandonlockhart/dataprep_cleaning_api

Repository files navigation

DataPrep Cleaning APIs

Roadmap

End of Summer 2020

  • implement two APIs
  • edit the API explanation files if needed to clarify the requirements or add ideas for implementation
  • add real world messy datasets or links to messy datasets to the datasets folder

Fall 2020

Get DataPrep's Data Cleaning component off to a strong start by implementing a few APIs with high quality, thoroughly tested, and well structured code.

Planned APIs to implement:

  1. clean_text()
  2. clean_email()
  3. clean_url()
  4. clean_ml()
  5. clean_date()
  6. clean_address()
  7. clean_country()
  8. clean_phone()
  9. clean_lat_long()
  10. clean_ip()
  11. clean_currency()

Spring 2020

  • Implement more APIs
  • Consider domain specific cleaning scenarios (e.g., finance, biology)
  • Marketing: blog post and videos

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published