Partial preemption for groups with multiple sequences #574

popovaan · 2024-07-03T09:39:23Z

Partial preemption for groups with multiple sequences.
Fixed a bug in can_append_stots()

Ticket: 140648

…n_imporvement

iefode · 2024-07-06T19:02:19Z

LGTM

ilya-lavrenov · 2024-07-30T08:54:37Z

src/cpp/continuous_batching/src/scheduler.hpp

-            total_num_released_blocks += released_blocks;
-            prev_blocks_count = m_block_manager.num_free_blocks();
-
+        if (num_running_sequences > 1) {


do we need to distinguish such cases? Looks like multiple sequences within a group is more generic case and should cover single sequence case as well.

Not necessary to distinguish. But for single sequence case we can release blocks more efficiently than in general case using resize and not release blocks layer by layer:

openvino.genai/src/cpp/src/block_manager.hpp

Line 413 in 9d35767

m_block_table[seq_id].resize(m_block_table[seq_id].size() - block_num);

So it was distinguished only in terms of efficiency.

Removed distinguishing between these cases in PR as discussed.

ilya-lavrenov · 2024-07-30T08:57:57Z

src/cpp/continuous_batching/src/block_manager.hpp

+    const bool free_group_partially_multiple_runnning_sequence(SequenceGroup::Ptr sequence_group, size_t num_required_blocks, size_t& phisical_blocks_released, size_t& logical_blocks_released) {
+        phisical_blocks_released = 0;
+        logical_blocks_released = 0;
+        while (num_required_blocks > phisical_blocks_released) {


if case we need to preempt very long sequence, such loop can be expensive.

If it's possible, it would be great to compute a number of preempted blocks based on required number of blocks.

Simplified this method using formula blocks_to_remove_per_sequence == ceil(num_required_blocks / sequence_num).

ilya-lavrenov · 2024-07-30T08:59:10Z

src/cpp/continuous_batching/src/block_manager.hpp

@@ -110,6 +110,78 @@ class BlockManager {
        return m_block_table[seq_id];
    }

+    const size_t free_rightest_blocks(SequenceGroup::Ptr sequence_group) {


if this (or some other) methods are not planned to be used as public API, let's move them to private section.

Removed this method.

Implemented partial preemtion for groups with multiple sequences.

54481f0

popovaan requested a review from iefode July 3, 2024 09:39

popovaan added 5 commits July 3, 2024 11:40

Merge remote-tracking branch 'upstream/master' into partial_preemptio…

8f0c1b7

…n_imporvement

Removed not used method.

d7fe87f

Uncommented multinomial test.

2cc5114

Merge remote-tracking branch 'upstream/master' into partial_preemptio…

59d9cde

…n_imporvement

Removed not used code.

a2fd89a

iefode approved these changes Jul 6, 2024

View reviewed changes

Wovchena enabled auto-merge July 7, 2024 08:32

Merge branch 'master' into partial_preemption_imporvement

0c60e64

pavel-esir approved these changes Jul 8, 2024

View reviewed changes

Wovchena added this pull request to the merge queue Jul 8, 2024

Merged via the queue into openvinotoolkit:master with commit 602f099 Jul 8, 2024
24 checks passed

ilya-lavrenov reviewed Jul 30, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Partial preemption for groups with multiple sequences #574

Partial preemption for groups with multiple sequences #574

popovaan commented Jul 3, 2024

iefode commented Jul 6, 2024

ilya-lavrenov Jul 30, 2024

popovaan Jul 30, 2024 •

edited

Loading

popovaan Aug 2, 2024

ilya-lavrenov Jul 30, 2024

popovaan Aug 2, 2024

ilya-lavrenov Jul 30, 2024

popovaan Aug 2, 2024

Partial preemption for groups with multiple sequences #574

Partial preemption for groups with multiple sequences #574

Conversation

popovaan commented Jul 3, 2024

iefode commented Jul 6, 2024

ilya-lavrenov Jul 30, 2024

Choose a reason for hiding this comment

popovaan Jul 30, 2024 • edited Loading

Choose a reason for hiding this comment

popovaan Aug 2, 2024

Choose a reason for hiding this comment

ilya-lavrenov Jul 30, 2024

Choose a reason for hiding this comment

popovaan Aug 2, 2024

Choose a reason for hiding this comment

ilya-lavrenov Jul 30, 2024

Choose a reason for hiding this comment

popovaan Aug 2, 2024

Choose a reason for hiding this comment

popovaan Jul 30, 2024 •

edited

Loading