Audio Transcript heuristic for dynamic thread allocation on client #2023

gfd2020 · 2023-12-11T15:20:12Z

This PR implements a heuristic that always tries to keep the remote audio transcription server busy while not leaving the client idle. I believe that this heuristic should only be turned on when the remote transcription server is slow. For fast servers, it might be better to leave it turned off.

It is based on 3 principles:

Dynamically adjust the number of remote transcription threads based on server response speed.
Rearrange the audio items so that they are spread across the queue, this will help the client to always have some processing instead of standing still or only carrying out the work at the end.
The client will also help the server with the transcription task, only if the client has no other tasks to do.

This heuristic must be configured in the 'AudioTranscriptConfig.txt' configuration file:

#Performs a heuristic for dynamic thread allocation and spaced requeue. Helps improve performance of slow transcription servers.
clientDynamicThreadRequeueHeuristics = true

#If active, the client will also help the server with the transcription task, only if the client has no other tasks to do. The heuristic must be turned on
clientTranscriptHelp = true

#Defines the implementation class for client help, must be a local implementation ( not remote transcript task )
clientTranscriptHelpImplementationClass = iped.engine.task.transcript.Wav2Vec2TranscriptTask

#Advanced Parameter. Defines which part of the queue the items will be sent to. 4 = 1/4 size. Values greater than or equal to 1
clientSplitQueueRatio = 4

#Advanced Parameter. Sets the delta time in milliseconds when consecutive items are requested to be requeued, provides better spacing.
clientRequeueDeltaTime = 5000

To test the PR, the parameters must be uncommented, by default they are turned off.
Audio transcription must be turned on and configured as usual.

Teste Cases: Any UFDR report with multiple processing items in addition to audio to be transcribed.

lfcnassif · 2023-12-13T15:09:20Z

Thank you @gfd2020! I think I'll have time to review this just when I return back from vacation next year, in the second half of January, if no other dev reviews it before me.

gfd2020 added 10 commits December 7, 2023 17:31

add reenqueue item property and fallbacktask

26a6d13

add heuristic config variables to be set on RemoteWav2VectTranscript

dd6570f

add config variables treatment

0adf8c8

ass requeue item property and fallbacktask

b74ab22

compatibility fix for new properties

ab38841

setup config parameters

5b2748f

setup fallback task and call it when necessary

57e49ad

add new reenqueue method with spaced positioning

ad1586f

add new method to add itens on middle of the queue

de1fcf0

add static variables and logic to control the requeue heuristic

e5687f6

gfd2020 mentioned this pull request Dec 11, 2023

Remote Audio Transcription and Idle Client #2022

Open

gfd2020 added 10 commits June 17, 2024 15:07

resolve conflict

6e54dda

fix conflict

36a48d9

fix fork conflicts

95be1ab

fix conflits

5bf47e4

fix conflicts

728c713

fix conflicts

e2b1f05

Merge branch 'sepinf-inc:master' into audio-transcript-heuristic

2abc1ca

fix conflicts

4c434b8

fix conflicts

0d870f3

Merge branch 'sepinf-inc:master' into audio-transcript-heuristic

063ebef

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Audio Transcript heuristic for dynamic thread allocation on client #2023

Audio Transcript heuristic for dynamic thread allocation on client #2023

gfd2020 commented Dec 11, 2023

lfcnassif commented Dec 13, 2023

Audio Transcript heuristic for dynamic thread allocation on client #2023

Are you sure you want to change the base?

Audio Transcript heuristic for dynamic thread allocation on client #2023

Conversation

gfd2020 commented Dec 11, 2023

lfcnassif commented Dec 13, 2023