Live Translation Pipeline

Currently a work in progress

A low-latency live translation pipeline designed to provide real-time captions, translation, and dubbing for video conferencing and streaming applications. The system prioritizes speed and lightweight deployment by leveraging Faster Whisper Tiny for speech-to-text, OPUS-MT models for machine translation, and Piper for text-to-speech synthesis.

How to use

Install dependencies

pip install -r requirements.txt

Install Machine Translation and Voice models

Piper TTS voices (pick these from the repo):
- https://huggingface.co/rhasspy/piper-voices
  - vi: vi/vi_VN/vais1000/medium
  - en: en/en_US/amy/low
  - fr: fr/fr_FR/gilles/low
Machine Translation (OPUS-MT / Marian) (pick these model pages):
- https://huggingface.co/Helsinki-NLP/
  - opus-mt-vi-en, opus-mt-en-vi, opus-mt-fr-en, opus-mt-en-fr, opus-mt-fr-vi, opus-mt-vi-fr

Then update your model url in the .env file

Run the service: You can run test indepedently for now by running each layers in /pipeline and run:

python app/pipeline/stt.py
python app/pipeline/mt.py
python app/pipeline/tts.py

Note: running the test required you to open your mic, please have that enabled before running the tests.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
bot		bot
bot_orchestrator		bot_orchestrator
demo		demo
.DS_Store		.DS_Store
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
compose.yaml		compose.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Live Translation Pipeline

Currently a work in progress

How to use

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Live Translation Pipeline

Currently a work in progress

How to use

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages