Reduce Pandas deprecation warnings. #263

jimwhite · 2024-09-18T20:18:06Z

Hi Stefan,

This is the bulk of changes needed to reduce/eliminate Pandas future/deprecation warnings when using Pandas 2.2. There were about 33000 when running the tests and this gets down to around 1500. This change is a WIP but I wanted to create the PR early and get your feedback on how to proceed. I'm also still dealing with errors in tests/data/test_daily_bars.py and tests/test_bar_data.py (I think the issue has to do with the timestamp format in the test data init).

I've only tested with Python 3.11 on MacOS so I expect to see issues from the other versions in the GitHub Actions testing.

I really appreciate the work you've done to keep Zipline alive. I've got a Polygon.io stock data bundle in progress (https://github.com/fovi-llc/zipline-polygon-bundle) which is finally working end-to-end (but needing much to do yet).

Thanks!
Jim

stefan-jansen · 2024-09-26T14:27:12Z

Thanks @jimwhite , please let me know if you have any questions. Hope the new release suits your situation.

jimwhite · 2024-09-26T21:28:30Z

Thanks @jimwhite , please let me know if you have any questions. Hope the new release suits your situation.

Well, the problem (high warning count when running tests) is now 53371 (up from around 31K):

=================================================================================== short test summary info ===================================================================================
FAILED tests/pipeline/test_factor.py::SummaryTestCase::test_summaries_after_fillna - zipline.testing.core.SubTestFailures: failures:
FAILED tests/pipeline/test_factor.py::SummaryTestCase::test_summary_methods - zipline.testing.core.SubTestFailures: failures:
FAILED tests/pipeline/test_statistical.py::TestStatisticalBuiltIns::test_correlation_factors[4-2] - KeyError: TestingDataSet<US>.float_col::float64
FAILED tests/pipeline/test_statistical.py::TestStatisticalBuiltIns::test_simple_beta_target - assert Equity(1 [A]) is Equity(1 [A])
FAILED tests/pipeline/test_statistical.py::StatisticalMethodsTestCase::test_factor_correlation_methods - zipline.testing.core.SubTestFailures: failures:
================================================= 5 failed, 3163 passed, 16 skipped, 5 xfailed, 1 xpassed, 53371 warnings in 96.53s (0:01:36) =================================================

My reason for trying to get the warning count down to something more manageable is I'm thinking in terms of future maintenance and enhancements for zipline-reloaded.

For example I'm working on a Polygon.io bundle (https://github.com/fovi-llc/zipline-polygon-bundle) and that bcolz is now also reloaded has me thinking about replacing that with (or adding the alternative of) using PyArrow Hive.

Shall I update my PR to this new HEAD?

Jim White added 8 commits September 18, 2024 01:46

Fix df.stack future_stack warning.

52a409a

Reduce warnings like for iloc and future_stack.

68b7901

Fix warning.

51a8a7c

Reduce warnings.

a076b97

Reduce warnings.

375ed8b

Changes to reduce Pandas deprecation warnings from 33000 to 1600.

a1a924f

Dict.iteritems deprecated in Python 3, use Dist.items instead.

057a7fc

Another couple .ilocs needed.

2eec36c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce Pandas deprecation warnings. #263

Reduce Pandas deprecation warnings. #263

jimwhite commented Sep 18, 2024 •

edited

Loading

stefan-jansen commented Sep 26, 2024

jimwhite commented Sep 26, 2024 •

edited

Loading

Reduce Pandas deprecation warnings. #263

Are you sure you want to change the base?

Reduce Pandas deprecation warnings. #263

Conversation

jimwhite commented Sep 18, 2024 • edited Loading

stefan-jansen commented Sep 26, 2024

jimwhite commented Sep 26, 2024 • edited Loading

jimwhite commented Sep 18, 2024 •

edited

Loading

jimwhite commented Sep 26, 2024 •

edited

Loading