Improve readers by parallelizing I/O and compute operations #5401

ypatia · 2024-12-06T15:16:20Z

Today when a reader issues the I/O request to VFS, we block waiting for all I/O to finish before moving to unfiltering. We then block again waiting for unfiltering to be done for all tiles and then continue to processing the results.

This PR is removing the need to wait for all I/O to be done, and uses async tasks to signal when a tile is done reading so that it can proceed to unfiltering, and when a tile is done unfiltering so that it can proceed to result processing before copying to the user buffers.

TYPE: IMPROVEMENT
DESC: Improve readers by parallelizing I/O and compute operations

This removes the read from waiting on all I/O operations and instead moves the I/O task to be owned by the datablock itself. If the I/O threadpool task is valid, we block on data access. This lets I/O and compute be interleaved by only blocking on data when its ready to be processed and allows for better background data loading.

…e state

This allows for copying the task/future an enabled multiple threads to check the status of the task in a thread-safe manner.

…checking. While the ThreadPool::SharedTask is designed to be used by multiple threads, its designed for copying. The data structure itself is not thread safe. A recursive mutext is needed because some functions like load_chunk_data call back into filtered_data() and would deadlock. This could be handled by also release the locking in load_chunk_data(), but a recursive_mutex is used for better safety against deadlocks.

… forward declaration issues currently

This is needed because we need to access the data buffer from inside the unfiltering task to unfilter into. We can't block on unfiltering being done from inside the unfiltering task so we need different accessors which let us bypass the check on if the unfiltering task is completed.

This is needed because zip_coordinates is called from the unfilter task itself.

…t_block_io

This reverts commit 9647fff.

ypatia and others added 30 commits November 29, 2024 15:28

Threadpool changes

93f16c0

Add threadpool helper function

277272b

Clang format changes

e000852

Cleanup not needed changes

2d0a155

Add some more documentation to new classes

77b52a2

Default initialize and check for null threadpool in task classes

bdd4b48

Address review comments

c92a83c

Address review comments round 2

1066214

Fix unit test

85dee35

Switch to SharedTask in order to allow multi-threaded access to futur…

02d85d2

…e state

Fix unit test compilation

f1eac3a

WIP: parallelize filter pipeline and interleave comparisons

deabecf

Switch to ThreadPool::SharedTask instead of shared_ptr

aaed67a

This allows for copying the task/future an enabled multiple threads to check the status of the task in a thread-safe manner.

WIP: try to store reference to FilterData on result tile, need to fix…

d81dd29

… forward declaration issues currently

Adjust lambdas and avoid task components going out of scope.

e79d531

Add stats tracking to new tasks for reading and unfiltering tiles

c3891fd

Fix unit test compilation

b30b3f9

Add new zip_coordinates_unsafe

fe65979

This is needed because zip_coordinates is called from the unfilter task itself.

Wait until tasks are done before freeing tiles

04ecccc

Remove redundant shared future get

338aa12

Fix null counts, check tasks are valid and other fixes

ceedb1f

Fix RLE and dict decompression

c8c6b17

Fix budget tests, g.o.r. result tile is now 3496 bytes in size

0bb8cdd

Merge branch 'dev' into yt/sc-59606/threadpool_with_tasks

766a8bb

Adaptations to new threadpool

f5c0003

Fix compute task outliving dense reader

15e7a4d

Remove mutex that causes problems (TBD)

0ae71fb

ypatia and others added 4 commits December 6, 2024 16:08

Fix deadlock in merge_result_cell_slabs

bc79307

Fix linux compilation issue

dfdc581

Fix gcc future exception

accc681

Fix missing unit test threadpool linkage

2158495

ypatia marked this pull request as ready for review December 7, 2024 15:05

Merge branch 'yt/sc-59606/threadpool_with_tasks' into yt/sc-59605/don…

68a8235

…t_block_io

ypatia closed this Dec 7, 2024

ypatia reopened this Dec 7, 2024

ypatia marked this pull request as draft December 7, 2024 15:09

ypatia marked this pull request as ready for review December 9, 2024 08:32

Base automatically changed from yt/sc-59606/threadpool_with_tasks to dev December 9, 2024 08:36

Merge branch 'dev' into yt/sc-59605/dont_block_io

3782594

ypatia marked this pull request as draft December 9, 2024 10:06

ypatia added 7 commits December 12, 2024 14:24

Fix tile missing threadpool linkage

c51ba44

Remove duplicate library in cmake

af4857e

Disable temporarily flaky test

3de9216

Attempt to fix asan error

9647fff

Fix segfault in legacy reader

888652a

Revert "Attempt to fix asan error"

81f3166

This reverts commit 9647fff.

Fix some windows tests

88c0ecb

ypatia force-pushed the yt/sc-59605/dont_block_io branch from 73a2245 to 88c0ecb Compare December 18, 2024 09:53

Fix lifetime issues and some namings

d391375

ypatia force-pushed the yt/sc-59605/dont_block_io branch from b1d8be7 to d391375 Compare December 18, 2024 15:30

ypatia added 2 commits December 19, 2024 10:28

Fix ASAN: Destructors of base classes must be virtual

0a97612

Some more PR cleanup

446700a

ypatia force-pushed the yt/sc-59605/dont_block_io branch from ed9b334 to 446700a Compare December 19, 2024 13:11

ypatia changed the title ~~WIP: Improve readers by parallelizing I/O and compute operations~~ Improve readers by parallelizing I/O and compute operations Dec 20, 2024

ypatia marked this pull request as ready for review December 20, 2024 07:59

ypatia requested review from Shelnutt2 and rroelke December 20, 2024 08:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve readers by parallelizing I/O and compute operations #5401

Improve readers by parallelizing I/O and compute operations #5401

ypatia commented Dec 6, 2024 •

edited

Loading

Improve readers by parallelizing I/O and compute operations #5401

Are you sure you want to change the base?

Improve readers by parallelizing I/O and compute operations #5401

Conversation

ypatia commented Dec 6, 2024 • edited Loading

ypatia commented Dec 6, 2024 •

edited

Loading