0.4.0
v0.4.0
Bugfixes 🐛
- eda: fix string type (b7e3321)
- eda: fix value table display (57281bc)
- eda: remove imdt output from plot (5c227e1)
- eda: adjusted save report method to accept one parameter (4ceefcc)
- eda: clean config code and fix scatter sample param (8ab27f9)
- plot_diff: fix ci issue (44ce81c)
- clean: clean_duplication issue 646 (ca9f708)
- eda: fix category type error (9750694)
Features ✨
- eda: refactored code and added density parameter to plot_diff(df) (323ae6b)
- eda: save imdt as json file (7867386)
- connector: integrate connectorx into connector (106457e, a64e356, 9f89d3b)
- clean: add clean_ml function (909cd19)
- clean: add multiple clean functions for number types (3c05be5)
- eda.diff: add plot_diff([df1..dfn], continuous) (3bfb4f5)
- clean: support conversion into packed binary format in clean_ip (7e30f93, 37a83b0)
Code Quality + Testing 💯
Performance 🚀
- clean: update documentation of clean_duplication (50f90fa)
Documentation 📃
- clean: change the introduction (862b447)
- eda: change eda colab position (ce25b17, d00b0bd)
- clean: add documentation for multiple clean functions for number types (732480f)
- clean: add documentation for clean_ml function (0c139db)
- eda: scattter.sample_rate added to documentation (549b319)
- eda: fix plot show (0b40a40)
- readme: add benchmark link (e807f79)
- readme: small text change on clean and connector (e193a6a)
- readme: fix titanc link (29cc06c)
Contributors this release 🏆
The following users contributed code to DataPrep since the last release.
- Devin Lu <[email protected]> (First time contributor) ⭐️
- dylanzxc <[email protected]>
- Jinglin Peng <[email protected]>
- Noir Tree <[email protected]> (First time contributor) ⭐️
- pwwang <[email protected]> (First time contributor) ⭐️
- qidanrui <[email protected]>
- sahmad11 <[email protected]> (First time contributor) ⭐️
- waterpine <[email protected]>
- Weiyuan Wu <[email protected]>
- Xiaoying Wang <[email protected]> (First time contributor) ⭐️
🎉🎉 Thank you! 🎉🎉