You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thanks for pointing this out. But I don't see an C# API yet to try this out.
From the comment in WhisperNativeParams.cs I think it is only a point in time ("moment in audio") but I would need start and end for each word
// [EXPERIMENTAL] Token-level timestamps with DTW
// do not use if you haven't computed token-level timestamps with dtw
// Roughly corresponds to the moment in audio in which the token was output
ulong t_dtw;
Thank you for porting Whisper to Unity!
One issue I face is that the timestamps are not very accurate.
I found that there are related projects that try to improve this
Is it possible somehow to make use of these to improve timestamp accuracy?
What would be necessary to achieve this?
Note that there are also related issues on whisper.cpp
The text was updated successfully, but these errors were encountered: