A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing emotion, speaking style, and paralinguistics, and features robust zero-shot text-to-speech
-
Updated
Dec 22, 2025 - Python
A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing emotion, speaking style, and paralinguistics, and features robust zero-shot text-to-speech
Aligning latent space of speaking style with human perception using a re-embedding strategy
Scripts for analyzing how the extent of coarticulation varies across different communicative contexts using speech samples from the LUCID corpus
Add a description, image, and links to the speaking-style topic page so that developers can more easily learn about it.
To associate your repository with the speaking-style topic, visit your repo's landing page and select "manage topics."