Check exact equality for worker state #3483

bnaul · 2020-02-15T18:53:50Z

Per #3321 (comment), I agree it makes more sense to check for exact equality instead of relying on the hash (just in case 🙂).

I could see an argument for wrapping this tuple in a property so it can be reused, do we think that's preferable? I'd originally wanted to use identity for hash and eq but that has a bit too much going on IMO.

bnaul · 2020-02-15T18:54:04Z

cc @StephanErb

mrocklin · 2020-02-15T19:00:00Z

Thanks for picking this up @bnaul

This looks good to me. I'll merge after either a +1 from @StephanErb or a couple days.

mrocklin · 2020-02-16T01:06:54Z

My guess is that all of the test failures are intermittent. When we changed to spawn by default it unearthed a bunch of things. Squashing down a couple now.

mrocklin · 2020-02-16T01:07:15Z

Well, this one is interesting

    @gen_cluster(timeout=1000, client=True)
    def test_recompute_released_key(c, s, a, b):
        x = c.submit(inc, 100)
        result1 = yield x
        xkey = x.key
        del x
        import gc
    
        gc.collect()
        yield gen.moment
>       assert c.refcount[xkey] == 0
E       assert 1 == 0
E         -1
E         +0

StephanErb

Thanks for doing the follow up. As you have requested my review I have looked into the code a bit deeper. Please see my questions below.

StephanErb · 2020-02-16T13:37:41Z

distributed/scheduler.py

+        return type(self) == type(other) and (self.name, self.host) == (
+            other.name,
+            other.host,
+        )


I am wondering if using name and host is correct in all cases:

name is optional and could be None

host is not necessary unique as multiple workers could be running on the same machine

I am lacking necessary Dask context, so I am wondering why you have not picked address instead? It is used for indexing the WorkerState and the documentation says it is [t]his worker's unique key..

So I would assume, this here should work as expected:

def __hash__(self): return hash(self.address) def __eq__(self, other): return type(self) == type(other) and self.address == other.address

I was just pulling items from the identity() dictionary, no personal reason to prefer one to the other. @TomAugspurger @mrocklin would you agree address makes more sense here?

mrocklin · 2020-02-16T21:50:54Z

Yes, address is probably the best thing to do in both cases.

…

On Sun, Feb 16, 2020 at 1:23 PM Brett Naul ***@***.***> wrote: ***@***.**** commented on this pull request. ------------------------------ In distributed/scheduler.py <#3483 (comment)>: > + return type(self) == type(other) and (self.name, self.host) == ( + other.name, + other.host, + ) I was just pulling items from the identity() function, no personal reason to prefer one to the other. @TomAugspurger <https://github.com/TomAugspurger> @mrocklin <https://github.com/mrocklin> would you agree address makes more sense here? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#3483?email_source=notifications&email_token=AACKZTHHWYYG6RFFP5E66PLRDGVFTA5CNFSM4KV3HXIKYY3PNVWWK3TUL52HS4DFWFIHK3DMKJSXC5LFON2FEZLWNFSXPKTDN5WW2ZLOORPWSZGOCVWHPWY#discussion_r379934436>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AACKZTAXOVLPZTZ4VHWKZBLRDGVFTANCNFSM4KV3HXIA> .

bnaul · 2020-02-16T22:22:07Z

thanks @StephanErb @mrocklin , updated w/ your suggestions

StephanErb reviewed Feb 16, 2020

View reviewed changes

Check exact equality for worker state

b93ef94

Use address instead of name+host

3cf2bab

bnaul force-pushed the worker_state_eq branch from 4724d85 to 3cf2bab Compare February 16, 2020 22:00

mrocklin merged commit cc7ecdf into dask:master Feb 17, 2020

jrbourbeau mentioned this pull request Mar 21, 2020

Make Listeners awaitable #3611

Merged

gjoseph92 mentioned this pull request May 20, 2022

Worker addresses are treated as unique identifiers, but may not be #6392

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Check exact equality for worker state #3483

Check exact equality for worker state #3483

bnaul commented Feb 15, 2020

bnaul commented Feb 15, 2020

mrocklin commented Feb 15, 2020

mrocklin commented Feb 16, 2020

mrocklin commented Feb 16, 2020

StephanErb left a comment

StephanErb Feb 16, 2020 •

edited

Loading

bnaul Feb 16, 2020 •

edited

Loading

mrocklin commented Feb 16, 2020 via email

bnaul commented Feb 16, 2020

Check exact equality for worker state #3483

Check exact equality for worker state #3483

Conversation

bnaul commented Feb 15, 2020

bnaul commented Feb 15, 2020

mrocklin commented Feb 15, 2020

mrocklin commented Feb 16, 2020

mrocklin commented Feb 16, 2020

StephanErb left a comment

Choose a reason for hiding this comment

StephanErb Feb 16, 2020 • edited Loading

Choose a reason for hiding this comment

bnaul Feb 16, 2020 • edited Loading

Choose a reason for hiding this comment

mrocklin commented Feb 16, 2020 via email

bnaul commented Feb 16, 2020

StephanErb Feb 16, 2020 •

edited

Loading

bnaul Feb 16, 2020 •

edited

Loading