Disable numba caching via environment variable #869

timothymillar · 2022-07-03T23:39:45Z

Edit: related to #371

I've recently started experimenting with sgkit on a SLURM cluster which is working well with the exception of methods using guvectorize with cache=True. Calling these functions results in a segmentation fault on the worker. This only seems to be an issue with guvectorize (not the jit or vectorize decorators) and there is no segmentation fault if I set cache=False.

There are a couple of open issues that may be related although neither quite match what I'm seeing (need to dig some more):

There is also an open issue for globally disabling numba caching which would provide a workaround although it might be stale:

Disabling on-disk cache numba/numba#4549

In the meantime, for the sake of debugging and workarounds, it'd be useful to be able to disable numba-caching in sgkit using an environment variable.

The text was updated successfully, but these errors were encountered:

tomwhite · 2022-08-02T16:37:10Z

Can this be closed now that #870 is in?

timothymillar · 2022-08-02T21:24:15Z

Maybe we should leave it open for now to document the SGKIT_DISABLE_NUMBA_CACHE variable. I also wondered if you had a suggestion for testing that setting that variable works as expected in CI?

benjeffery · 2023-03-09T14:30:49Z

I've hit this via #1051. Interestingly I get a different error (TypeError: can not serialize 'numpy.int64' object) if I disable task fusion in dask (dask.config.set({"optimization.fuse.active": False}))

benjeffery · 2023-03-09T16:11:40Z

After much digging I have discovered some interesting things about these segfaults.
As above turning off dask task fusion results in the serialization error above. Digging in to the code this is because we are passing numpy.int64 to some dask methods instead of int. For example if I change:

@wraps(gufunc)
    def func(x: ArrayLike, cohort: ArrayLike, n: int, axis: int = -1) -> ArrayLike:
        x = da.swapaxes(da.asarray(x), axis, -1)

(from cohort_numba_fns.py) to:

@wraps(gufunc)
    def func(x: ArrayLike, cohort: ArrayLike, n: int, axis: int = -1) -> ArrayLike:
        n = int(n)
        axis = int(axis)
        x = da.swapaxes(da.asarray(x), axis, -1)

Then the serialisation error is fixed!

BUT If I then turn dask task fusion back on, the segfault is gone!! So I think that in the fused task a compiled func is expecting an int, but getting a numpy.int64, and then segfaulting?

There are other segfaults still happening - I assume they are for similar issues.

(@jeromekelleher numpy ints strike again!)

timothymillar mentioned this issue Jul 3, 2022

Toggle numba caching by environment variable #870

Merged

jeromekelleher added the process + tools label Jul 4, 2022

timothymillar added this to the 0.6.0 milestone Jan 4, 2023

timothymillar self-assigned this Jan 4, 2023

timothymillar mentioned this issue Jan 4, 2023

Release 0.6.0 #984

Closed

timothymillar added a commit to timothymillar/sgkit that referenced this issue Jan 16, 2023

Document SGKIT_DISABLE_NUMBA_CACHE variable sgkit-dev#869

756c25a

timothymillar mentioned this issue Jan 16, 2023

Document SGKIT_DISABLE_NUMBA_CACHE variable #869 #996

Merged

1 task

mergify bot pushed a commit that referenced this issue Jan 16, 2023

Document SGKIT_DISABLE_NUMBA_CACHE variable #869

cc1d929

mergify bot closed this as completed in #996 Jan 16, 2023

timothymillar mentioned this issue Mar 9, 2023

Tests should use process-based dask #1051

Open

timothymillar mentioned this issue Sep 4, 2023

Improve performance of variant_stats #1119

Merged

5 tasks

benjeffery mentioned this issue Nov 24, 2023

Add site_density filter benjeffery/tsinfer-snakemake#32

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Disable numba caching via environment variable #869

Disable numba caching via environment variable #869

timothymillar commented Jul 3, 2022 •

edited

Loading

tomwhite commented Aug 2, 2022

timothymillar commented Aug 2, 2022

benjeffery commented Mar 9, 2023

benjeffery commented Mar 9, 2023 •

edited

Loading

Disable numba caching via environment variable #869

Disable numba caching via environment variable #869

Comments

timothymillar commented Jul 3, 2022 • edited Loading

tomwhite commented Aug 2, 2022

timothymillar commented Aug 2, 2022

benjeffery commented Mar 9, 2023

benjeffery commented Mar 9, 2023 • edited Loading

timothymillar commented Jul 3, 2022 •

edited

Loading

benjeffery commented Mar 9, 2023 •

edited

Loading