Clone a voice in 5 seconds to generate arbitrary speech in real-time
-
Updated
Jul 23, 2024 - Python
Clone a voice in 5 seconds to generate arbitrary speech in real-time
🤯 Lobe Chat - an open-source, modern-design LLMs/AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Bedrock / Azure / Mistral / Perplexity ), Multi-Modals (Vision/TTS) and plugin system. One-click FREE deployment of your private ChatGPT chat application.
🚀AI nghĩ thanh: 5 giây nội clone ngài thanh âm cũng sinh thành tùy ý giọng nói nội dung Clone a voice in 5 seconds to generate arbitrary speech in real-time
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
A generative speech model for daily dialogue.
Instant voice cloning by MyShell.
🤖 The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed inference
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
🤖 💬 Deep learning for Text to Speech (Discussion forum:https://discourse.mozilla.org/c/tts)
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available inhttps://plachtaa.github.io
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
A sound cloning tool with a web interface, using your voice or any sound to record audio / một cái mang web giao diện thanh âm clone công cụ, sử dụng ngươi âm sắc hoặc tùy ý thanh âm tới thu âm tần
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
🤖 wukong-robot là một cái đơn giản, linh hoạt, ưu nhã tiếng Trung giọng nói đối thoại người máy / trí năng loa hạng mục, duy trì ChatGPT nhiều luân đối thoại năng lực, còn có thể là đầu cái duy trì não cơ lẫn nhau khai nguyên trí năng loa hạng mục.
🎤 hơi mềm giọng âm hợp thành công cụ, sử dụng Electron + Vue + ElementPlus + Vite xây dựng.
Add a description, image, and links to the tts topic page so that developers can more easily learn about it.
To associate your repository with the tts topic, visit your repo's landing page and select "manage topics."