You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Describe the bug
If a column fails the initial type check, no index is provided for that row in the failure_cases["index"], so I am unable to drop the row from the original dataframe.
I have checked that this issue has not already been reported.
I have confirmed this bug exists on the latest version of pandera.
(optional) I have confirmed this bug exists on the master branch of pandera.
Note: Please read this guide detailing how to provide the necessary information for us to reproduce your bug.
I would expect the failure cases index to include the index for the 3rd row that contained an integer for Col1 instead of a string. This way, the failing row could be dropped if my desire is to ignore the failing rows and continue processing rather than halt the application. However there is no index present. Printing out exc.failure_cases["index"] reads:
Out[1]:
0 None
Name: index, dtype: object
Desktop (please complete the following information):
OS: Windows
The text was updated successfully, but these errors were encountered:
Hi @shanetorres which version of pandera are you using?
I'm seeing a different issue, which is that validation is passing, which is a bug in the way str types are validated. Can you try doing a development installation of the main branch and re-running your code?
created this PR to fix the issue I was seeing on the main branch. @shanetorres let me know if you can confirm that fixes the issue you're seeing in your code
Describe the bug
If a column fails the initial type check, no index is provided for that row in the failure_cases["index"], so I am unable to drop the row from the original dataframe.
Note: Please read this guide detailing how to provide the necessary information for us to reproduce your bug.
Code Sample, a copy-pastable example
Expected behavior
I would expect the failure cases index to include the index for the 3rd row that contained an integer for Col1 instead of a string. This way, the failing row could be dropped if my desire is to ignore the failing rows and continue processing rather than halt the application. However there is no index present. Printing out
exc.failure_cases["index"]
reads:Desktop (please complete the following information):
The text was updated successfully, but these errors were encountered: