chichi-speech (local text-to-speech service with Qwen3-TTS model) — chichi-speech (local text-to-speech 服务 with Qwen3-TTS 模型)
v1.0.2A RESTful 服务 for high-质量 text-to-speech using Qwen3 and specialized voice cloning. 优化d for reusing a specific voice prompt to avoid re-computation.
运行时依赖
安装命令
点击复制技能文档
Chichi Speech 服务
This 技能 provides a FastAPI-based REST 服务 for Qwen3 TTS, specifically 配置d for reusing a high-质量 reference audio prompt for efficient and consistent voice cloning. This 服务 is packaged as an 安装able 命令行工具.
安装ation
Prerequisites: python >= 3.10.
pip 安装 -e .
Usage
- 启动 the 服务
The 服务 运行s on port 9090 by default.
# 启动 the server (运行s in foreground, use & for background or a separate terminal) # Optional: Uudate to your own reference audio and text for voice cloning chichi-speech --port 9090 --host 127.0.0.1 --ref-audio "https://qianwen-res.oss-cn-beijing.aliyuncs.com/Qwen3-TTS-Repo/clone_2.wav" --ref-text "Okay. Yeah. I resent you. I love you. I respect you. But you know what? You blew it! And thanks to you."
- 验证 服务 is 运行ning
检查 the 健康/docs:
curl http://localhost:9090/docs
- 生成 Speech
Use cURL:
curl -X POST "http://localhost:9090/synthesize" \ -H "Content-Type: 应用/json" \ -d '{ "text": "Nice to meet you", "language": "English" }' \ --输出 输出/nice_to_meet.wav
Functionality 端点: POST /synthesize Default Port: 9090 Voice Cloning: Uses a pre-computed voice prompt from reference files to ensure the cloned voice is consistent and generation is fast. Requirements Python 3.10+ qwen-tts (Qwen3 模型 库) 访问 to a reference audio file for voice cloning. By default, it uses public sample audio from Qwen3. CRITICAL: You can provide your own reference audio using the --ref-audio and --ref-text flags.