Audio Streaming Gateway Suzu

About Shiguredo's open source software

We will not respond to PRs or issues that have not been discussed on Discord. Also, Discord is only available in Japanese.

Please read https://github.com/shiguredo/oss/blob/master/README.en.md before use.

時雨堂のオープンソースソフトウェアについて

利用前に https://github.com/shiguredo/oss をお読みください。

Audio Streaming Gateway Suzu について

Suzu は WebRTC SFU Sora 専用の音声解析用ゲートウェイです。 Suzu は Sora から送られてくる音声ストリーミングを HTTP/2 経由で受け取り、音声解析サービスに転送し、その解析結果を Sora に送ります。 Sora は Suzu から送られてきた解析結果を、プッシュ API を経由してリアルタイムにクライアントへ通知します。

目的

リアルタイム通話で気軽に音声解析サービスを利用できる仕組みを提供することです。

特徴

Sora から音声データを HTTP/2 経由で受け取り、音声解析サービスへ送信します
音声解析サービスの解析結果を HTTP/2 レスポンスで Sora に戻します
Sora は受け取った解析結果をクライアントへプッシュで送信します
- DataChannel 経由のシグナリングの利用を推奨します
音声解析に必要とされる言語コードをクライアントごとに指定できます
無限リトライ対応
mTLS 対応

使ってみる

Suzu を使ってみたい人は USE.md をお読みください。

Suzu と GCP Speech to Text

sequenceDiagram
    participant client1 as クライアント1<br>sendrecv
    participant client2 as クライアント2<br>recvonly
    participant sora as WebRTC SFU Sora
    participant suzu as Audio Streaming Gateway Suzu
    participant app as アプリケーションサーバー
    participant gcp as GCP Speech to Text
    note over client1, sora: WebRTC 確立
    sora-)client1: "type": "switched"
    note over client1, sora: DataChannel 確立
    par
        client1-)sora: Opus over SRTP
        sora-)suzu: Opus over HTTP/2
        note over suzu: Opus を Ogg コンテナに詰める
        suzu-)gcp: Ogg over HTTP/2
        note over gcp: 音声データが十分ではないためまだ解析結果が返せない
    and
        client1-)sora: Opus over SRTP
        sora-)suzu: Opus over HTTP/2
        suzu-)gcp: Ogg over HTTP/2
        gcp-)suzu: 音声解析結果<br>JSON over HTTP/2
        suzu-)sora: 音声解析結果<br>JSON over HTTP/2
        sora-)client1: プッシュ通知<br>音声解析結果<br>JSON over DataChannel
    end
    par
        note over client2, sora: WebRTC 確立
        sora-)client2: "type": "switched"
        note over client2, sora: DataChannel 確立
    and
        client1-)sora: Opus over SRTP
        sora-)suzu: Opus over HTTP/2
        suzu-)gcp: Ogg over HTTP/2
        gcp-)suzu: 音声解析結果<br>JSON over HTTP/2
        suzu-)sora: 音声解析結果<br>JSON over HTTP/2
    end
    par
        sora-)client1: プッシュ通知<br>音声解析結果<br>JSON over DataChannel
    and
        sora-)client2: プッシュ通知<br>音声解析結果<br>JSON over DataChannel
    end

対応サービス

ライセンス

Copyright 2022-2024, Hiroshi Yoshida (Original Author)
Copyright 2022-2024, Shiguredo Inc.

Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at

    http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.

優先実装

優先実装とは Sora のライセンスを契約頂いているお客様限定で Suzu の実装予定機能を有償にて前倒しで実装することです。

優先実装が可能な機能一覧

詳細は Discord やメールなどでお気軽にお問い合わせください。

Google Cloud Speech-to-Text V2 API
Google Cloud Media Translation
Microsoft Azure Speech to Text 対応
Microsoft Azure Speech Translation 対応
Deepgram 対応
AmiVoice Cloud Platform 対応
ウェブフック機能対応
- クライアントごとに接続先サービスを変更できるようになる

Name		Name	Last commit message	Last commit date
Latest commit History 613 Commits
.github		.github
_third_party/pion		_third_party/pion
cert		cert
cmd/suzu		cmd/suzu
doc		doc
patch		patch
script		script
testdata		testdata
.air.toml		.air.toml
.gitignore		.gitignore
CHANGES.md		CHANGES.md
LICENSE		LICENSE
Makefile		Makefile
NOTICE.md		NOTICE.md
README.md		README.md
VERSION		VERSION
amazon_transcribe.go		amazon_transcribe.go
amazon_transcribe_handler.go		amazon_transcribe_handler.go
config.go		config.go
config_example.ini		config_example.ini
errors.go		errors.go
go.mod		go.mod
go.sum		go.sum
handler.go		handler.go
handler_test.go		handler_test.go
healthcheck_handler.go		healthcheck_handler.go
healthcheck_handler_test.go		healthcheck_handler_test.go
languages.go		languages.go
logging.go		logging.go
oggwriter.go		oggwriter.go
oggwriter_ext.go		oggwriter_ext.go
packet_dump_handler.go		packet_dump_handler.go
server.go		server.go
service_handler.go		service_handler.go
speech_to_text.go		speech_to_text.go
speech_to_text_handler.go		speech_to_text_handler.go
staticcheck.conf		staticcheck.conf
test_handler.go		test_handler.go
test_handler_test.go		test_handler_test.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Audio Streaming Gateway Suzu

About Shiguredo's open source software

時雨堂のオープンソースソフトウェアについて

Audio Streaming Gateway Suzu について

目的

特徴

使ってみる

Suzu と GCP Speech to Text

対応サービス

ライセンス

優先実装

優先実装が可能な機能一覧

About

Releases 14

Contributors 6

Languages

License

shiguredo/suzu

Folders and files

Latest commit

History

Repository files navigation

Audio Streaming Gateway Suzu

About Shiguredo's open source software

時雨堂のオープンソースソフトウェアについて

Audio Streaming Gateway Suzu について

目的

特徴

使ってみる

Suzu と GCP Speech to Text

対応サービス

ライセンス

優先実装

優先実装が可能な機能一覧

About

Resources

License

Stars

Watchers

Forks

Releases 14

Contributors 6

Languages