Merge pull request #21 from 0xrushi/feat/docupdate

ylxmf2005 · web-flow · commit 3656a3e59cf5 · 2025-10-12T13:25:07.000+08:00
feat: add eleven labs doc
diff --git a/docs/user-guide/backend/tts.md b/docs/user-guide/backend/tts.md
@@ -296,3 +296,69 @@ MiniMax提供的在线的TTS服务，`speech-02-turbo`等模型具有强大的TT
       pronunciation_dict: ''
 ```
 其中`voice_id`是可以配置的声音音色，具体的支持声音列表可以查看[官方文档中查询可用声音ID的部分](https://platform.minimaxi.com/document/get_voice)。`pronunciation_dict`是可以支持的自定义发声规则，比如您可以把`牛肉`发音为`neuro`，可以用类似示例的方法来定义这个发声规则。
+
+## ElevenLabs TTS (在线，需要API密钥)
+> 自版本 `v1.2.1` 起可用
+
+ElevenLabs 提供高质量、自然流畅的文本转语音服务，支持多种语言和声音克隆功能。
+
+### 功能特点
+- **高质量音频**：行业领先的语音合成质量
+- **多语言支持**：支持英语、中文、日语、韩语等多种语言
+- **声音克隆**：上传音频样本进行声音克隆
+- **丰富的语音库**：提供多种预设语音和社区语音
+- **实时生成**：低延迟语音合成
+
+### 配置步骤
+1. **注册并获取API密钥**
+   - 访问 [ElevenLabs](https://elevenlabs.io/) 注册账户
+   - 从 ElevenLabs 控制台获取您的 API 密钥
+
+2. **选择语音**
+   - 在 ElevenLabs 控制台中浏览可用语音
+   - 复制您喜欢的语音的 Voice ID
+   - 您也可以上传音频样本进行声音克隆
+
+3. **配置 `conf.yaml`**
+   在配置文件的 `elevenlabs_tts` 段落中，按以下格式填写参数：
+
+```yaml
+elevenlabs_tts:
+  api_key: 'your_elevenlabs_api_key'  # 必需：您的 ElevenLabs API 密钥
+  voice_id: 'JBFqnCBsd6RMkjVDRZzb'   # 必需：ElevenLabs 语音 ID
+  model_id: 'eleven_multilingual_v2'  # 模型 ID（默认：eleven_multilingual_v2）
+  output_format: 'mp3_44100_128'      # 输出音频格式（默认：mp3_44100_128）
+  stability: 0.5                      # 语音稳定性（0.0 到 1.0，默认：0.5）
+  similarity_boost: 0.5               # 语音相似度增强（0.0 到 1.0，默认：0.5）
+  style: 0.0                         # 语音风格夸张度（0.0 到 1.0，默认：0.0）
+  use_speaker_boost: true            # 启用说话人增强以获得更好质量（默认：true）
+```
+
+### 参数说明
+- **api_key**（必需）：您的 ElevenLabs API 密钥
+- **voice_id**（必需）：语音的唯一标识符，在 ElevenLabs 控制台中找到
+- **model_id**：要使用的 TTS 模型。可用选项：
+  - `eleven_multilingual_v2`（默认）- 支持多种语言
+  - `eleven_monolingual_v1` - 仅英语
+  - `eleven_turbo_v2` - 更快的生成速度
+- **output_format**：音频输出格式。常用选项：
+  - `mp3_44100_128`（默认）- MP3，44.1kHz，128kbps
+  - `mp3_44100_192` - MP3，44.1kHz，192kbps
+  - `pcm_16000` - PCM，16kHz
+  - `pcm_22050` - PCM，22.05kHz
+  - `pcm_24000` - PCM，24kHz
+  - `pcm_44100` - PCM，44.1kHz
+- **stability**：控制语音一致性（0.0 = 更多变化，1.0 = 更一致）
+- **similarity_boost**：增强与原始语音的相似度（0.0 到 1.0）
+- **style**：控制风格夸张度（0.0 = 中性，1.0 = 更具表现力）
+- **use_speaker_boost**：启用说话人增强以提高音频质量
+
+### 使用技巧
+- **语音选择**：先尝试预设语音，然后考虑使用声音克隆获得自定义语音
+- **参数调优**：调整 `stability` 和 `similarity_boost` 以获得最佳效果
+- **成本管理**：ElevenLabs 按使用量收费，大量使用前请先测试
+- **网络要求**：需要稳定的网络连接以确保服务可用
+
+:::tip
+ElevenLabs 提供免费试用额度，您可以在购买付费计划前先测试质量。
+:::
diff --git a/i18n/en/docusaurus-plugin-content-docs/current/user-guide/backend/tts.md b/i18n/en/docusaurus-plugin-content-docs/current/user-guide/backend/tts.md
@@ -304,3 +304,69 @@ minimax_tts:
       pronunciation_dict: ''
 ```
 The `voice_id` parameter can be configured to different voice tones. You can check the [voice ID query section in the official documentation](https://platform.minimaxi.com/document/get_voice) for a complete list of supported voices. The `pronunciation_dict` supports custom pronunciation rules - for example, you can define rules to pronounce "牛肉" as "neuro" using the format shown in the example.
+
+## ElevenLabs TTS (Online, API Key Required)
+> Available since version `v1.2.1`
+
+ElevenLabs provides high-quality, natural-sounding text-to-speech with support for multiple languages and voice cloning capabilities.
+
+### Features
+- **High-Quality Audio**: Industry-leading speech synthesis quality
+- **Multi-language Support**: Supports English, Chinese, Japanese, Korean, and many other languages
+- **Voice Cloning**: Upload audio samples to clone voices
+- **Rich Voice Library**: Multiple preset voices and community voices available
+- **Real-time Generation**: Low-latency speech synthesis
+
+### Configuration Steps
+1. **Register and Get API Key**
+   - Visit [ElevenLabs](https://elevenlabs.io/) to register an account
+   - Get your API key from the ElevenLabs dashboard
+
+2. **Choose a Voice**
+   - Browse available voices in the ElevenLabs dashboard
+   - Copy the Voice ID of your preferred voice
+   - You can also upload audio samples for voice cloning
+
+3. **Configure `conf.yaml`**
+   In the `elevenlabs_tts` section of your configuration file, enter parameters as follows:
+
+```yaml
+elevenlabs_tts:
+  api_key: 'your_elevenlabs_api_key'  # Required: Your ElevenLabs API key
+  voice_id: 'JBFqnCBsd6RMkjVDRZzb'   # Required: ElevenLabs Voice ID
+  model_id: 'eleven_multilingual_v2'  # Model ID (default: eleven_multilingual_v2)
+  output_format: 'mp3_44100_128'      # Output audio format (default: mp3_44100_128)
+  stability: 0.5                      # Voice stability (0.0 to 1.0, default: 0.5)
+  similarity_boost: 0.5               # Voice similarity boost (0.0 to 1.0, default: 0.5)
+  style: 0.0                         # Voice style exaggeration (0.0 to 1.0, default: 0.0)
+  use_speaker_boost: true            # Enable speaker boost for better quality (default: true)
+```
+
+### Parameter Descriptions
+- **api_key** (required): Your ElevenLabs API key
+- **voice_id** (required): Unique identifier for the voice, found in your ElevenLabs dashboard
+- **model_id**: TTS model to use. Available options:
+  - `eleven_multilingual_v2` (default) - Supports multiple languages
+  - `eleven_monolingual_v1` - English only
+  - `eleven_turbo_v2` - Faster generation
+- **output_format**: Audio output format. Common options:
+  - `mp3_44100_128` (default) - MP3, 44.1kHz, 128kbps
+  - `mp3_44100_192` - MP3, 44.1kHz, 192kbps
+  - `pcm_16000` - PCM, 16kHz
+  - `pcm_22050` - PCM, 22.05kHz
+  - `pcm_24000` - PCM, 24kHz
+  - `pcm_44100` - PCM, 44.1kHz
+- **stability**: Controls voice consistency (0.0 = more variable, 1.0 = more consistent)
+- **similarity_boost**: Enhances similarity to the original voice (0.0 to 1.0)
+- **style**: Controls style exaggeration (0.0 = neutral, 1.0 = more expressive)
+- **use_speaker_boost**: Enables speaker boost for improved audio quality
+
+### Usage Tips
+- **Voice Selection**: Try preset voices first, then consider voice cloning for custom voices
+- **Parameter Tuning**: Adjust `stability` and `similarity_boost` for optimal results
+- **Cost Management**: ElevenLabs charges based on usage, test first before heavy usage
+- **Network Requirements**: Stable internet connection required for service availability
+
+:::tip
+ElevenLabs offers free trial credits, so you can test the quality before purchasing a paid plan.
+:::