Automatic-Speech-to-Speech-translation-with-lip-synchronization

Kaggle link - https://www.kaggle.com/code/winiiash/ai-dubbing-system-with-lip-synchronization

The Project proposes the development of an automated pipeline capable of translating a video of a person speaking in language A into a target language B with voice transfer/cloning & realistic lip synchronization, consisting of several pre-trained, open-source deep learning models for enhanced lip synchronized video generation.

The whole pipeline consists of several phases :

Video preprocessing & extracting audio from the original video - Python Scripts
Speech transctiption of the extracted audio (Speech to text) - Speech matics API / Wav2Vec2
Text translation (Text to text , translating the content of text to another language) - Speech matics
Text to speech with voice transfer (speech synthesis in the target language with voice cloning) - Tortoise TTS
Lip Synchronization - Wav2lip
Enhancing video quality - ESRGAN

The convergence of artificial intelligence and multimedia technologies has paved the way for innovative solutions in the domain of speech-to-speech translation. Traditional approaches often fall short in capturing the nuances of natural speech and maintaining lip synchronization, leading to poor user experiences. This project aims to overcome these limitations by incorporating deep learning architectures and techniques to create a robust and efficient translation pipeline. By integrating audio and visual modalities , we aim to build an automated pipeline to achieve accurate lip synchronized video.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.DS_Store		.DS_Store
README.md		README.md
ai-dubbing-system-with-lip-synchronization-2.ipynb		ai-dubbing-system-with-lip-synchronization-2.ipynb
cloned.py		cloned.py
download_from_ytb.py		download_from_ytb.py
exp.py		exp.py
mpeg.py		mpeg.py
mychunk.py		mychunk.py
noise.py		noise.py
process.py		process.py
saveit.py		saveit.py
video_audio.py		video_audio.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Automatic-Speech-to-Speech-translation-with-lip-synchronization

About

Releases

Packages

Languages

Ashwinip343/Automatic-Speech-to-Speech-translation-with-lip-synchronization

Folders and files

Latest commit

History

Repository files navigation

Automatic-Speech-to-Speech-translation-with-lip-synchronization

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages