A comprehensive ComfyUI integration for Microsoft's VibeVoice text-to-speech model, enabling high-quality single and multi-speaker voice synthesis directly within your ComfyUI workflows.
-
Updated
Feb 18, 2026 - Python
A comprehensive ComfyUI integration for Microsoft's VibeVoice text-to-speech model, enabling high-quality single and multi-speaker voice synthesis directly within your ComfyUI workflows.
🔊 Kokoro Web: Free AI text-to-speech, online or self-hosted, OpenAI compatible!
ComfyUI custom node for the VibeVoice TTS. Expressive, long-form, multi-speaker conversational audio
AI Plugin is a powerful extension for the Payload CMS, integrating advanced AI capabilities to enhance content creation and management.
ComfyUI node for highly expressive speech and realistic zero-shot voice cloning
AI-powered multi-voice audiobook generator — LLM script annotation, voice cloning, voice design, LoRA training, per-line style control, and export to MP3, chaptered M4B, or Audacity multi-track. Built on Qwen3-TTS.
EDUMCP is a protocol that integrates the Model Context Protocol (MCP) with applications in the education field, dedicated to achieving seamless interconnection and interoperability among different AI models, educational applications, smart hardware, and teaching AGENTs.
Beautiful voice app: record or upload to train a voice, generate speech from text or files, save & download voices.
Official AllVoiceLab Model Context Protocol (MCP) server, supporting interaction with powerful text-to-speech and video translation APIs.
✨ NovelAI api python sdk, easy to use, modern and user-friendly.
🗣️🎤 elevenlabs-api is an open source Java wrapper around the ElevenLabs Voice Synthesis and Cloning Web API.
ComfyUI node for modular, human‑like Kani TTS. Generate natural, high‑quality speech from text
OpenAI TTS Compatible Ukrainian TTS StyleTTS2 Pipeline
TTSLab is THE place to easily test ANY text to text to speech model on your own pc with 0 cost
AI generates conversational podcast for ANY research paper, vividly!
A Docker-based OpenAI-compatible Text-to-Speech API server powered by Kyutai's TTS models with GPU acceleration support.
Voice Alignment and Conversion with Neural Networks and the WORLD codec.
Local, portable GUI for Qwen3-TTS. Optimized for NVIDIA RTX 50 Series (CUDA 12.8). One-click install.
Fast, local, OpenAI-compatible TTS server with voice cloning support powered by Kyutai's Pocket TTS
Add a description, image, and links to the voice-generation topic page so that developers can more easily learn about it.
To associate your repository with the voice-generation topic, visit your repo's landing page and select "manage topics."