Limit coroutines using a pool instead of chunks #1544

sisp · 2024-03-13T09:27:21Z

I've changed the implementation of the function _run_coros_in_chunks() by using a coroutine pool instead of running coroutines in chunks.

The goal of this function is to limit the number of simultaneous coroutines, but chunking may lead to inferior throughput when coroutines have different execution times, as the next chunk will run only once all coroutines in the previous chunk have completed. In contrast, a coroutine pool will run the next coroutine in the queue as soon as any of the running coroutines completes.

fsspec/asyn.py

sisp · 2024-03-13T09:31:20Z

The check CI / gcsfs-pytest is failing with this error:

FAILED gcsfs/gcsfs/tests/test_core.py::test_sign - ImportError: cannot import name 'storage' from 'google.cloud' (unknown location)

But I doubt it's related to this PR.

martindurant · 2024-03-13T17:55:39Z

fsspec/asyn.py

+    semaphore = asyncio.BoundedSemaphore(batch_size)
+
+    async def _worker(coro):
+        async with semaphore:


With this approach, I agree that a limited number of IO tasks are concurrently running/waiting, but is there any potential downside of simply having a very large number of tasks created and waiting on this semaphore? The previous incantation would only have the given number of top-level tasks at a time.

Good question. I'm not aware of any downsides but can't prove there aren't any. I was also considering to use asyncio.as_completed and pass an iterable of asyncio.Tasks, but it also materializes this iterator as a set internally.

Let's leave this question open a little while, to see if any knowledgable person comments.

FWIW, I've used this implementation to limit the simultaneous execution of 70,000 coroutines and it worked fine.

You might have a point: https://stackoverflow.com/a/62404509 I wonder whether we could use the shown queue-based approach instead.

I've pushed an implementation based on asyncio.wait inspired by https://death.andgravity.com/limit-concurrency#asyncio-wait. I think this looks good and tests are passing.

I actually think the linked solution may be more efficient. Does this solution run into the issue that the next batch does not start until the current batch finishes? That means with large batch size, future workers will be stalled waiting on the slowest worker?

Or am I misunderstanding the current batch async method.

Does this solution run into the issue that the next batch does not start until the current batch finishes? That means with large batch size, future workers will be stalled waiting on the slowest worker?

Correct, the slowest coroutine in a batch determines the runtime of the whole batch and the next batch won't start before all coroutines in the current batch have finished.

I think some of the solutions I linked would have alleviated that issue so it could have eagerly continued the next batch.

The solution that @martindurant merged is essentially https://death.andgravity.com/limit-concurrency#asyncio-wait, which you linked.

See https://death.andgravity.com/limit-concurrency#asyncio-wait for inspiration.

fsspec/asyn.py

Limit coroutines using a pool instead of chunks

4a9face

sisp commented Mar 13, 2024

View reviewed changes

fsspec/asyn.py Outdated Show resolved Hide resolved

test dep

acfdd2b

martindurant reviewed Mar 13, 2024

View reviewed changes

Use asyncio.wait based approach instead of a semaphore

5890050

See https://death.andgravity.com/limit-concurrency#asyncio-wait for inspiration.

sisp force-pushed the perf/coroutine-pool branch from bbb8b3d to 5890050 Compare March 15, 2024 09:38

sisp requested a review from martindurant March 15, 2024 09:47

martindurant reviewed Mar 15, 2024

View reviewed changes

fsspec/asyn.py Show resolved Hide resolved

Preserve order of coroutines and results

b35fa8c

sisp commented Mar 15, 2024

View reviewed changes

fsspec/asyn.py Show resolved Hide resolved

martindurant merged commit f2f4c26 into fsspec:master Mar 15, 2024
10 checks passed

sisp deleted the perf/coroutine-pool branch March 15, 2024 19:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Limit coroutines using a pool instead of chunks #1544

Limit coroutines using a pool instead of chunks #1544

sisp commented Mar 13, 2024

sisp commented Mar 13, 2024

martindurant Mar 13, 2024

sisp Mar 13, 2024

martindurant Mar 13, 2024

sisp Mar 13, 2024

sisp Mar 13, 2024

sisp Mar 15, 2024

Skylion007 Mar 15, 2024

sisp Mar 15, 2024

Skylion007 Mar 15, 2024

sisp Mar 15, 2024

Limit coroutines using a pool instead of chunks #1544

Limit coroutines using a pool instead of chunks #1544

Conversation

sisp commented Mar 13, 2024

sisp commented Mar 13, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment