-
-
Notifications
You must be signed in to change notification settings - Fork 64
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
1708 adjust data cleaning script to prune data points outside of la neighborhood districts #1744
1708 adjust data cleaning script to prune data points outside of la neighborhood districts #1744
Conversation
Check out this pull request on See visual diffs & provide feedback on Jupyter Notebooks. Powered by ReviewNB |
Hi @mru-hub, will review before Sunday 6/9. Thank you! |
Hi @mru-hub, I reviewed the script and did some testing. I think we need a video chat session to discuss further on how to implment this before I can add documentation for my review here. I've send you a slack message, please let me know in Slack when would be a good time to chat. |
Hi @mru-hub I pushed the modified script that we discussed to this PR. |
@Skydodle Connected with Johnny and decided to test the code locally by directly passing a filtered Parquet file to the logic instead of using the Hugging Face file. |
Hi @Skydodle I pushed the modified script to this PR. Kindly review. |
Update: enabled testing on browser by pointing the file registration to the filtered csv file in public folder. @mru-hub will continue in-depth testing and implement finalized script to integrate with cron job. Thanks. |
Based on new requirements, @mru-hub will just provide 1 script (containing instructions on how to test locally) |
c22cfe3
to
fc1c7a0
Compare
…ning-script-to-prune-data-points-outside-of-la-neighborhood-districts
Cleaning Script: updateHfDataset_FilterByBoundaries.py :Added a new Python script, updateHfDataset_FilterByBoundaries.py, to streamline the data filtering based on geographic boundaries. Local Testing Instructions: This script contains instructions on how to test the functionality locally. Integration Testing Steps: Included steps for integration testing to facilitate future updates via cron jobs. Functionality Changes from current/base 'updateHfDataset.py' script:
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
👍 thanks so much @mru-hub, this is approved. Merge when ready!
Fixes #1708
main
branchAny questions? See the getting started guide