Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[MM-53432] Improvements #4

Merged
merged 13 commits into from
Nov 15, 2023
Merged

[MM-53432] Improvements #4

merged 13 commits into from
Nov 15, 2023

Conversation

streamer45
Copy link
Contributor

Summary

Implementing some fixes and improvements. (see commits)

@jupenur I am addressing here the requests from previous PRs. Please take a look.

Related PR

mattermost/mattermost-plugin-calls#565

@streamer45 streamer45 added 2: Dev Review Requires review by a core committer 3: Security Review labels Nov 14, 2023
@streamer45 streamer45 self-assigned this Nov 14, 2023
Segment: Segment{
Text: "test sentence",
},
Speaker: "うずまき ナルト",
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

😂

Copy link
Member

@jupenur jupenur left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@streamer45 streamer45 added 3: Reviews Complete All reviewers have approved the pull request and removed 2: Dev Review Requires review by a core committer 3: Security Review labels Nov 15, 2023
@streamer45 streamer45 merged commit dd52d13 into MM-54242 Nov 15, 2023
2 checks passed
@streamer45 streamer45 deleted the MM-53432-improvements branch November 15, 2023 17:22
streamer45 added a commit that referenced this pull request Nov 16, 2023
* Implement speech detection step to improve transcription accuracy

* Update build files

* Add comments

* [MM-53432] Improvements (#4)

* Implement more human friendly filenames

* Include Silero VAD model v4

* Cache CGO dependencies and Whisper models in Docker build

* Update silero-vad-go

* Add SHA check for ONNXRuntime

* Support language autodetection

* Initial multi-threading support

* Fix marshaling case

* Tune speech detector silence duration threshold

* Sanitize Text and Speaker strings

* Build as position-independent executable

* Update rtcd client dependency

* Better escaping
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
3: Reviews Complete All reviewers have approved the pull request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants