- implement two APIs
- edit the API explanation files if needed to clarify the requirements or add ideas for implementation
- add real world messy datasets or links to messy datasets to the datasets folder
Get DataPrep's Data Cleaning component off to a strong start by implementing a few APIs with high quality, thoroughly tested, and well structured code.
clean_text()
clean_email()
clean_url()
clean_ml()
clean_date()
clean_address()
clean_country()
clean_phone()
clean_lat_long()
clean_ip()
clean_currency()
- Implement more APIs
- Consider domain specific cleaning scenarios (e.g., finance, biology)
- Marketing: blog post and videos