[BUG] Input audio resampling is not efficiently implemented #28

mikejgray · 2023-11-07T04:10:34Z

Description

Input audio resampling is not efficiently implemented here. Consider pulling the implementation from ovos-stt-http-server or using helpers from neon_utils.file_utils

Steps to Reproduce

Record voice in Gradio interface
Press Submit to send to Neon
Wait 15-16+ seconds for the audio to convert and everything to process

Relevant Code

No response

Other Notes

No response

NeonDaniel · 2023-11-09T00:48:16Z

It may be better to skip the resampling here and add it to neon-speech so we can handle other clients that may not have sufficient processing power to do any conversion.

…#180) # Description Adds explicit handling of API input audio to set sample_rate, sample_width, and channels to match listener configuration Refactor to prevent creating an audio stream object not used for non-streaming STT # Issues NeonGeckoCom/neon-iris#28 # Other Notes  Co-authored-by: Daniel McKnight <[email protected]>

…#180 Closes #28

* Remove audio resampling moved to neon-speech NeonGeckoCom/neon_speech#180 Closes #28 * Add timing context and logging to go with: NeonGeckoCom/neon_speech#181 NeonGeckoCom/neon_audio#154 * More timing metrics and logging * Refactor timing and add debug log Prevent response error from affecting next input --------- Co-authored-by: Daniel McKnight <[email protected]>

# Changelog ## [0.0.5a21](https://github.com/NeonGeckoCom/neon-iris/tree/0.0.5a21) (2023-12-13) [Full Changelog](0.0.5a20...0.0.5a21) **Merged pull requests:** - Update Documentation and Logging [\#48](#48) ([NeonDaniel](https://github.com/NeonDaniel)) ## [0.0.5a20](https://github.com/NeonGeckoCom/neon-iris/tree/0.0.5a20) (2023-12-13) [Full Changelog](0.0.5a19...0.0.5a20) **Merged pull requests:** - Improved config handling [\#46](#46) ([NeonDaniel](https://github.com/NeonDaniel)) ## [0.0.5a19](https://github.com/NeonGeckoCom/neon-iris/tree/0.0.5a19) (2023-12-11) [Full Changelog](0.0.5a18...0.0.5a19) **Merged pull requests:** - Update release automation [\#47](#47) ([NeonDaniel](https://github.com/NeonDaniel)) ## [0.0.5a18](https://github.com/NeonGeckoCom/neon-iris/tree/0.0.5a18) (2023-12-11) [Full Changelog](0.0.4...0.0.5a18) **Implemented enhancements:** - \[FEAT\] Gradio User Settings Intents [\#35](#35) - \[FEAT\] After submitting recorded input audio the UI element should reset [\#30](#30) - \[FEAT\] Speech input should print STT results to the chat UI in addition to response [\#29](#29) **Fixed bugs:** - \[BUG\] Input audio resampling is not efficiently implemented [\#28](#28) - \[BUG\] Implement per-user settings in web UI [\#27](#27) **Merged pull requests:** - Add GitHub pre-release automation [\#45](#45) ([NeonDaniel](https://github.com/NeonDaniel)) - Update neon-mq-connector dependency to stable spec [\#43](#43) ([NeonDaniel](https://github.com/NeonDaniel)) - Minor fixes to audio input handling [\#42](#42) ([NeonDaniel](https://github.com/NeonDaniel)) - Refactor audio responses to utilize Chatbot UI [\#40](#40) ([NeonDaniel](https://github.com/NeonDaniel)) - Get Language Support from Core [\#37](#37) ([NeonDaniel](https://github.com/NeonDaniel)) - Handle profile updates [\#36](#36) ([NeonDaniel](https://github.com/NeonDaniel)) - Handle STT Transcripts in Chat UI [\#34](#34) ([NeonDaniel](https://github.com/NeonDaniel)) - Remove audio resampling and add timing context support [\#33](#33) ([NeonDaniel](https://github.com/NeonDaniel)) - Fix web\_client language handling to respect configured input language [\#32](#32) ([NeonDaniel](https://github.com/NeonDaniel)) - Threaded input handling and multi-session support [\#31](#31) ([NeonDaniel](https://github.com/NeonDaniel)) - Add Gradio web UI with Docker Container [\#24](#24) ([NeonDaniel](https://github.com/NeonDaniel)) - Add NeonVoiceClient class for minimal remote audio client [\#23](#23) ([NeonDaniel](https://github.com/NeonDaniel)) - Resolve client compat. bug [\#22](#22) ([NeonDaniel](https://github.com/NeonDaniel)) - Add CLI endpoints to interact with API and LLM endpoints [\#21](#21) ([NeonDaniel](https://github.com/NeonDaniel)) - Specify `setup.py` path explicitly [\#20](#20) ([NeonDaniel](https://github.com/NeonDaniel)) - Update build automation to default branch [\#19](#19) ([NeonDaniel](https://github.com/NeonDaniel)) - Update to use shared version\_bump.py script [\#18](#18) ([NeonDaniel](https://github.com/NeonDaniel)) - Remove invalid release Docker action [\#17](#17) ([NeonDaniel](https://github.com/NeonDaniel)) - Update automation to shared repository [\#15](#15) ([NeonDaniel](https://github.com/NeonDaniel)) - Add methods with CLI entrypoints to get STT/TTS [\#12](#12) ([NeonDaniel](https://github.com/NeonDaniel)) \* *This Changelog was automatically generated by [github_changelog_generator](https://github.com/github-changelog-generator/github-changelog-generator)*

mikejgray added the bug Something isn't working label Nov 7, 2023

NeonDaniel self-assigned this Nov 9, 2023

NeonDaniel mentioned this issue Nov 9, 2023

Resample API input wav audio to ensure format matches listener config NeonGeckoCom/neon_speech#180

Merged

NeonDaniel added a commit that referenced this issue Nov 9, 2023

Remove audio resampling moved to neon-speech NeonGeckoCom/neon_speech…

dfb130a

…#180 Closes #28

NeonDaniel mentioned this issue Nov 9, 2023

Remove audio resampling and add timing context support #33

Merged

NeonDaniel closed this as completed in #33 Nov 21, 2023

github-actions bot mentioned this issue Dec 13, 2023

0.1.0 #49

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] Input audio resampling is not efficiently implemented #28

[BUG] Input audio resampling is not efficiently implemented #28

mikejgray commented Nov 7, 2023

NeonDaniel commented Nov 9, 2023

[BUG] Input audio resampling is not efficiently implemented #28

[BUG] Input audio resampling is not efficiently implemented #28

Comments

mikejgray commented Nov 7, 2023

Description

Steps to Reproduce

Relevant Code

Other Notes

NeonDaniel commented Nov 9, 2023