extract_paths, use get_object_state #209

albertz · 2024-10-11T10:36:57Z

This fixes the same problems as #207 but now for extract_paths, by using the same shared code (get_object_state).

This fixes the same problems as #207 but now for extract_paths, by using the same shared code (get_object_state).

albertz · 2024-10-15T08:23:50Z

@NeoLegends @michelwi @curufinwe what is the status here?

michelwi · 2024-10-17T07:03:34Z

This change leads to a RecursionError in one of our gmm training setups
cf. https://bitbucket.org/omnifluent/apptek_asr/pull-requests/1268

(I did not start any debugging besides triggering the pipeline, sorry)

albertz · 2024-10-17T08:42:51Z

Can you tell me for what object this recursion error happens? I checked the AppTek PR pipeline error, but there is no information on the variables, so I cannot tell from that. Maybe you can also enable better_exchook (maybe better_exchook.setup_all()) for that, so then I can see it?

michelwi · 2024-10-18T10:05:00Z

sisyphus/hash.py

+    # so we keep consistent to the behavior of sis_hash_helper.
+    if obj is None:
+        return None
+    if isinstance(obj, (bool, int, float, complex, str)):


if obj is of type np.float then it is an instance of float so get_object_state simply returns it.

But sis_hash_helper(obj) checks for type(obj) in (int, float, bool, str, complex):, which is False and therefore we end up in the else case byte_list.append(sis_hash_helper(get_object_state(obj))) which leads to an infinite recursion.

I pushed some change for that which makes this check more consistent. Can you check again?

Note, np.float is a bad example: This will actually break depending on your Numpy version. In (very old) Numpy versions, yes, np.float is not float but derived from float. However, in newer Numpy versions, np.float is just an alias to float.

DeprecationWarning: np.float is a deprecated alias for the builtin float. To silence this warning, use float by itself. Doing this will not modify any behavior and is safe. If you specifically wanted the numpy scalar type, use np.float64 here.
Deprecated in NumPy 1.20; for more details and guidance: https://numpy.org/devdocs/release/1.20.0-notes.html#deprecations

So it means, if you really used np.float in your setup, the hash will break when you update Numpy...

A better example is a namedtuple. And here I accidentally also broke some hash now, but this is fixed now. Specifically, due to the type(obj) in (tuple, list) check, which was False, it also falls back to the get_object_state logic for namedtuples, so it's important that get_object_state behaves the same as before for namedtuples. This is what I do now.

curufinwe · 2024-10-21T09:07:47Z

AppTek pipeline no longer crashes.

albertz · 2024-10-21T09:21:37Z

So it means ok to merge?

curufinwe

LGTM

extract_paths, use get_object_state

691e0c3

This fixes the same problems as #207 but now for extract_paths, by using the same shared code (get_object_state).

albertz requested review from critias, curufinwe and michelwi October 11, 2024 10:37

albertz marked this pull request as draft October 11, 2024 10:37

albertz requested a review from NeoLegends October 11, 2024 10:38

fix extract_paths

9c7d70e

albertz marked this pull request as ready for review October 11, 2024 11:35

albertz added 2 commits October 11, 2024 14:20

cleanup

6c725da

test_extract_paths_functools_partial

681a36e

michelwi reviewed Oct 18, 2024

View reviewed changes

albertz added 5 commits October 18, 2024 12:12

fix infinite recursion, more consistent type check

b504060

small fix

e4b3f85

fix hash

c5abce1

small fix

91db76f

fix

5f3681a

curufinwe approved these changes Oct 21, 2024

View reviewed changes

NeoLegends approved these changes Oct 21, 2024

View reviewed changes

albertz merged commit 311ebfd into master Oct 21, 2024
3 checks passed

albertz deleted the albert-fix-extract-paths branch October 21, 2024 11:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

extract_paths, use get_object_state #209

extract_paths, use get_object_state #209

albertz commented Oct 11, 2024

albertz commented Oct 15, 2024

michelwi commented Oct 17, 2024

albertz commented Oct 17, 2024

michelwi Oct 18, 2024

albertz Oct 18, 2024

This comment was marked as resolved.

This comment was marked as resolved.

albertz Oct 18, 2024

curufinwe commented Oct 21, 2024

albertz commented Oct 21, 2024

curufinwe left a comment

extract_paths, use get_object_state #209

extract_paths, use get_object_state #209

Conversation

albertz commented Oct 11, 2024

albertz commented Oct 15, 2024

michelwi commented Oct 17, 2024

albertz commented Oct 17, 2024

michelwi Oct 18, 2024

Choose a reason for hiding this comment

albertz Oct 18, 2024

Choose a reason for hiding this comment

This comment was marked as resolved.

This comment was marked as resolved.

albertz Oct 18, 2024

Choose a reason for hiding this comment

curufinwe commented Oct 21, 2024

albertz commented Oct 21, 2024

curufinwe left a comment

Choose a reason for hiding this comment