-
Notifications
You must be signed in to change notification settings - Fork 113
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How to detect the noise, breaks and multi-person speak in a audio? #96
Comments
Just drop a note here. Possibly, you might be interested in this project. |
Dear @youngercloud and @GUUser91 will you please kind and please recommend:
With UVR I can separate Music and Audio, even Reverb / Echo removal... Thanks ahead! 🙏 |
@AlonDan As for Speaker Separation, the only application I know that has this feature is SpectraLayers 11, but that cost money and it can't separate voices talking over each other. Also there's a UVR fork that has a gradio demo. |
Thanks for the detailed reply, 🙏 Sorry I wasn't clear enough: I'm looking for local installation not online / cloud. Can you recommend on any Local Gradio / GUI solutions for multiple speakers separation ? |
@AlonDan |
I didn't know that it's possible to separate overlapping multiple speakers via ffmpeg... that's new to me!😮 If anyone know how to get: These 2 are not for separating multiple speakers but it's the best enhanced/cleanup I've tested so far compare to Kim2 and other MDX models. I wonder if there is a local Gradio app just like THIS ONE, it's so simple yet very useful, but I'm looking for local installation: I'm still looking for multiple separation like THIS wonderful project but locally, if anyone find something like that please share. |
@AlonDan This fork has BS-RoFormer and Mel-RoFormer I believe Resemble enhance has local gradio demo and can separate vocals from background music. |
Thank you @GUUser91 for your kind help. 🙏 I'll give the UVR5-UI a chance it seems like it's supporting the models I mentioned and I hope that the installation is simple as it looks (batch file). I hope that one day this project will have a simple GUI to try locally, it's very impressive! |
Thx, that's a good project! |
Hello, I am not in the audio field. I would like to ask, for a reference audio, I have removed BGM and reverberation to a certain extent, but the effect of inputting it into the sound cloning is still not good. Is there any better way to detect whether there is noise, distortion, and multiple people speaking in the reference audio?
The text was updated successfully, but these errors were encountered: