feat: add newer model tts chatterbox to ai-runner text-to-speech #647

JJassonn69 · 2025-06-23T07:27:08Z

This PR adds the feature for text-to-speech using newer ResembleAI/Chatterbox which as voice cloning feature.

So as not to cause change in the go-livepeer it uses the same application/json format to relay the information between orchestrator and runner.

You can build the text-to-speech docker image from the Dockerfile.text-to-speech similar to any other images.

Docker build -t livepeer/ai-runner:text-to-speech -f Dockerfile.text-to.speech .

for testing purpose you can access the uvicorn server after starting the container with the pipeline and model

docker run --name text-to-speech -e PIPELINE=text-to-speech -e MODEL_ID=chatterbox --gpus all -p 8000:8000 -v ./models:/models <docker-image-build-above>

In the uvicorn server, you will see params for model_id, text and prompt_audio_base64

model_id: ResembleAI/Chatterbox
text:
prompt_audio_base64:

The text converted to base64 looks like UklGRjiTIQBXQVZ.....AADUAADQAADAAACkAAA==

Here is a attached file with the base64 converted audio from my voice.
audio_prompt_base64.txt

…ling

…xpects the weights of the models to be in .pt format and would throw error if they were in .safetensors It does download the weights if none are found automatically

JJassonn69 added 8 commits June 20, 2025 14:33

feat: integrate Chatterbox TTS model with support for voice cloning

2c02ec5

feat: integrate Chatterbox TTS model with Docker and API endpoints

3758d20

feat: update text-to-speech API to use base64 encoded audio prompts

46e9636

feat: add file size limit check to audio_to_text endpoint

cbf6ece

chore: fix a small formatting issue

39cea18

chore: regenerate api bindinds

5a27642

refactor: update text-to-speech API params and add async request hand…

ea26142

…ling

minor change as the current version of hugginface in the chatterbox e…

d84d607

…xpects the weights of the models to be in .pt format and would throw error if they were in .safetensors It does download the weights if none are found automatically

JJassonn69 requested a review from ad-astra-video June 23, 2025 07:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add newer model tts chatterbox to ai-runner text-to-speech #647

feat: add newer model tts chatterbox to ai-runner text-to-speech #647

Uh oh!

JJassonn69 commented Jun 23, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

feat: add newer model tts chatterbox to ai-runner text-to-speech #647

Are you sure you want to change the base?

feat: add newer model tts chatterbox to ai-runner text-to-speech #647

Uh oh!

Conversation

JJassonn69 commented Jun 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

JJassonn69 commented Jun 23, 2025 •

edited

Loading