
podcast tts mcp
基于 Microsoft Edge TTS 技术,生成高质量、多语言的播客对话内容。
Repository Info
About This Server
基于 Microsoft Edge TTS 技术,生成高质量、多语言的播客对话内容。
Model Context Protocol (MCP) - This server can be integrated with AI applications to provide additional context and capabilities, enabling enhanced AI interactions and functionality.
Documentation
Multilingual Podcast Conversation Server
A specialized Model Context Protocol (MCP) server for generating high-quality podcast conversations with alternating male and female voices using Microsoft Edge's text-to-speech technology. Supports English, Simplified Chinese, and Traditional Chinese.
What is this?
This server makes it easy to create natural-sounding podcast conversations by alternating between professional male and female voices. It's designed specifically for podcast creators and content producers who want to quickly prototype or generate podcast-style content in multiple languages.
Key Features
- Multilingual Support: Generate podcasts in English, Simplified Chinese, and Traditional Chinese
- Simple Conversation Format: Just specify speakers and text in a straightforward JSON structure
- Professional Voices: Uses high-quality neural voices from Microsoft Edge TTS
- Easy Integration: Works seamlessly with Claude and other MCP-compatible AI assistants
- Optimized Performance: Efficient generation and processing of multi-speaker audio
- Single-Purpose Design: Focused solely on podcast conversations for simplicity
Quick Start
Prerequisites
- macOS operating system
- Claude Desktop application (or another MCP client)
- Python 3.8 or higher
- Edge TTS library (
pip install edge-tts) - MCP library (
pip install mcp[cli])
Installation
- Clone this repository:
git clone https://github.com/mcai/podcast-tts-mcp.git
cd podcast-tts-mcp
- Install the required dependencies:
pip install -r requirements.txt
Integrating with Claude Desktop
- Add the server to Claude's configuration file:
{
"mcpServers": {
"podcast-tts": {
"command": "bash",
"args": ["/full/path/to/podcast-tts-mcp/start.sh"]
}
}
}
The configuration file is located at:
- macOS:
~/Library/Application Support/Claude/claude_desktop_config.json
- Restart Claude Desktop
Usage
Conversation Format
The server accepts a simple JSON format where each segment specifies a speaker and their text:
[
{
"speaker": "male",
"text": "Welcome to our podcast! I'm Alex, your host for today."
},
{
"speaker": "female",
"text": "And I'm Jordan. We have an exciting topic to discuss today."
},
{
"speaker": "male",
"text": "That's right! Today we're diving into the fascinating world of..."
}
]
Using from Claude
Can you create a podcast script about [topic] with two hosts, and then use the podcast-tts MCP server to generate audio for it in [language]?
Voice Options
The server uses dedicated podcast-quality voices for each supported language:
English (language code: "en")
- Male Voice: en-US-GuyNeural
- Female Voice: en-US-AriaNeural
Simplified Chinese (language code: "zh-CN")
- Male Voice: zh-CN-YunyangNeural (professional and reliable, great for news reading)
- Female Voice: zh-CN-XiaoxiaoNeural (warm tone, suitable for news and novels)
Traditional Chinese (language code: "zh-TW")
- Male Voice: zh-TW-YunJheNeural (friendly and positive, ideal for general content)
- Female Voice: zh-TW-HsiaoChenNeural (friendly and positive, suitable for general use)
These voices were chosen for their clarity, professional sound, and natural conversational quality.
Technical Details
Required Parameters
- conversation: Array of segments, each with "speaker" and "text" fields
Optional Parameters
- language: Language code - "en" (default), "zh-CN", or "zh-TW"
- rate: Speaking rate adjustment (e.g., "+0%", "-10%", "+20%")
- volume: Volume adjustment (e.g., "+0%", "+10%", "-5%")
Example with Language Selection
{
"conversation": [
{"speaker": "male", "text": "欢迎收听我们的播客!我是今天的主持人Alex。"},
{"speaker": "female", "text": "我是Jordan。今天我们有一个令人兴奋的话题要讨论。"}
],
"language": "zh-CN",
"rate": "+0%",
"volume": "+5%"
}
Output
The server returns a JSON response with:
- Status (success/error)
- Language used
- Number of segments processed
- Total word count
- Processing time
- Audio file path
- Detailed information about each segment
Limitations
- Maximum conversation length: ~64KB total text
- Maximum audio length: ~10 minutes
- Limited to supported languages (English, Simplified Chinese, Traditional Chinese)
- Limited to two voice options per language (male/female)
Troubleshooting
If you encounter issues:
- Check the server logs:
cat /tmp/podcast_tts_debug.log - Verify the server is running
- Ensure your conversation format is correct (see examples above)
- Check that both "speaker" and "text" fields are present for each segment
- Verify you're using a supported language code ("en", "zh-CN", or "zh-TW")
Further Resources
- Model Context Protocol Documentation
- Claude Desktop MCP Guide
- Microsoft Edge TTS Documentation
License
This project is open source and free to use.
Quick Start
Clone the repository
git clone https://github.com/mcai/podcast-tts-mcpInstall dependencies
cd podcast-tts-mcp
npm installFollow the documentation
Check the repository's README.md file for specific installation and usage instructions.
Repository Details
Recommended MCP Servers
Discord MCP
Enable AI assistants to seamlessly interact with Discord servers, channels, and messages.
Knit MCP
Connect AI agents to 200+ SaaS applications and automate workflows.
Apify MCP Server
Deploy and interact with Apify actors for web scraping and data extraction.
BrowserStack MCP
BrowserStack MCP Server for automated testing across multiple browsers.
Zapier MCP
A Zapier server that provides automation capabilities for various apps.