-
Notifications
You must be signed in to change notification settings - Fork 8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Timeseries data check failing - coersion issue #169
Comments
Update: This bug is very complicated. The initial suspicion that this was due to NA values in the dataset turned out to be incorrect. I've test the dataset against itself, and there are no NA values in the dataset to start with. This bug still comes up when there are no NA values in the datasets and the datasets are identical. The next guess that was looked into was related to how the data is transformed during the Side note: I tried to put the data in as MST, seeing as that's the timezone it was collected in, however, this did not do what was intended. It changed the time to be 7 hours earlier than UTC time when I tried to import it like that. Additionally, even when the data was timezone aware, it still got changed to the UTC timezone during the check. Even with all data as timezone aware in the same timezone, both the warning and TypeError are still present. I'll update this further with what else I try to get around this error. |
Alright we found the issue. Turns out the bug and the warning are not as related as we previously suspected. However, they are fixed by the same change to the code. The warning was being caused by trying to convert a Pandas DataFrame column with a The error we were getting, So the fix is relatively straight forward. We will be adding in the following chunk of code:
This changes the dates to integers which can than be properly checked by I'll make a pr with these changes and a few tests to make sure they behave properly. @lwasser |
awesome @nkorinek thank you!!! |
gist explaining my current dilemna: https://gist.github.com/nkorinek/accc2b905145763dc2035300a895204c |
it looks like we have a fix with this in #185 !! yay! @ryla5068 will test this fix against homework 1. and if it works, he will update the tests there that don't work now... and then we can merge this PR!! Next step will be writing tests for the time series module!! |
notes about this issue """" matplotlib DOES provide functions for converting data from this weird format matplotlib's documentation claims that negative values for datetime data are As you might imagine, this presents a number of issues for comparing datetime Additionally, there is the issue of floating point truncation error. matplotlib So to solve these problems, we have done two things: First, we don't bother to try to convert from matplotlib's data ourselves. Second, we use numpy.testing.assert_array_max_ulp() for comparing datetime data For more info about the issues we've faced with this, take a look at PR #185 |
this will be closed via #219 which is almost done!! |
This can be closed @lwasser |
closing!! |
TEST
Output error: i suspect this has something to do with NAN values but i am not sure
The text was updated successfully, but these errors were encountered: