Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Regarding the problem of SeamlessM4T translating and cloning timbre/spoken language, here are cases #289

Open
curui opened this issue Mar 14, 2024 · 5 comments

Comments

@curui
Copy link

curui commented Mar 14, 2024

Regarding using only one audio sample, you can speak multiple languages using the tone of the audio sample. In fact, what you use is: seamless
You can refer to this: https://replicate.com/adirik/seamless-expressive
He seems to have also quoted seamless: https://github.com/replicate/cog-seamlessexpressive
demo:https://www.youtube.com/watch?v=lgL_rCF02Ng
You can refer to the recently popular ones: https://github.com/RVC-Boss/GPT-SoVITS

@curui curui closed this as completed Mar 14, 2024
@curui curui reopened this Mar 14, 2024
@rsxdalv
Copy link
Owner

rsxdalv commented Mar 14, 2024

Is GPT-SoVITS a new replacement for RVC?

@curui
Copy link
Author

curui commented Mar 14, 2024

https://replicate.com/adirik/seamless-expressive

You can upload an audio test, as if translated directly and clone the sound at the same time https://replicate.com/adirik/seamless-expressive

@curui
Copy link
Author

curui commented Mar 14, 2024

He uses seamless :https://huggingface.co/facebook/seamless-expressive
屏幕截图 2024-03-15 060657 I don't know why you do this

@curui
Copy link
Author

curui commented Mar 14, 2024

Is GPT-SoVITS a new replacement for RVC?
GPT-SoVITS can also Too complicated to use 。
You only need to upload a piece of audio, and you can use SeamlessExpressive. However, the SeamlessExpressive model needs to be reviewed before it can be obtained. I don’t know what the difference is between it and SeamlessM4T.

@rsxdalv
Copy link
Owner

rsxdalv commented Mar 15, 2024

He uses seamless :https://huggingface.co/facebook/seamless-expressive 屏幕截图 2024-03-15 060657 I don't know why you do this

Ah now I remember this seamless-expressive, I got it confused with seamlessM4T. To be honest, I doubt that many people will be willing to fill out the form.
GPT-SoVITS and SeamlessM4T can be done though.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants