https://github.com/PsychArch/minimax-mcp-tools
Async MCP server with Minimax API integration for image generation and text-to-speech
https://github.com/PsychArch/minimax-mcp-tools
Last synced: 6 days ago
JSON representation
Async MCP server with Minimax API integration for image generation and text-to-speech
- Host: GitHub
- URL: https://github.com/PsychArch/minimax-mcp-tools
- Owner: PsychArch
- License: mit
- Created: 2025-03-16T10:02:17.000Z (8 months ago)
- Default Branch: main
- Last Pushed: 2025-08-14T14:26:12.000Z (3 months ago)
- Last Synced: 2025-09-19T15:46:46.019Z (about 2 months ago)
- Language: TypeScript
- Homepage:
- Size: 435 KB
- Stars: 51
- Watchers: 1
- Forks: 12
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- awesome-mcp-servers - **minimax-mcp-tools** - Async MCP server with Minimax API integration for image generation and text-to-speech `typescript` `mcp` `server` `api` `npm install PsychArch/minimax-mcp-tools` (đ Web Development)
README
# Minimax MCP Tools

A Model Context Protocol (MCP) server for Minimax AI integration, providing async image generation and text-to-speech with advanced rate limiting and error handling.
English | [įŽäŊ䏿](README.zh-CN.md)
### MCP Configuration
Add to your MCP settings:
```json
{
"mcpServers": {
"minimax-mcp-tools": {
"command": "npx",
"args": ["minimax-mcp-tools"],
"env": {
"MINIMAX_API_KEY": "your_api_key_here"
}
}
}
}
```
## Async Design - Perfect for Content Production at Scale
This MCP server uses an **asynchronous submit-and-barrier pattern** designed for **batch content creation**:
đŦ **Narrated Slideshow Production** - Generate dozens of slide images and corresponding narration in parallel
đ **AI-Driven Audiobook Creation** - Produce chapters with multiple voice characters simultaneously
đŧī¸ **Website Asset Generation** - Create consistent visual content and audio elements for web projects
đ¯ **Multimedia Content Pipelines** - Perfect for LLM-driven content workflows requiring both visuals and audio
### Architecture Benefits:
1. **Submit Phase**: Tools return immediately with task IDs, tasks execute in background
2. **Smart Rate Limiting**: Adaptive rate limiting (10 RPM images, 20 RPM speech) with burst capacity
3. **Barrier Synchronization**: `task_barrier` waits for all tasks and returns comprehensive results
4. **Batch Optimization**: Submit multiple tasks to saturate rate limits, then barrier once for maximum throughput
## Tools
### `submit_image_generation`
**Submit Image Generation Task** - Generate images asynchronously.
**Required:** `prompt`, `outputFile`
**Optional:** `aspectRatio`, `customSize`, `seed`, `subjectReference`, `style`
### `submit_speech_generation`
**Submit Speech Generation Task** - Convert text to speech asynchronously.
**Required:** `text`, `outputFile`
**Optional:** `highQuality`, `voiceId`, `speed`, `volume`, `pitch`, `emotion`, `format`, `sampleRate`, `bitrate`, `languageBoost`, `intensity`, `timbre`, `sound_effects`
### `task_barrier`
**Wait for Task Completion** - Wait for ALL submitted tasks to complete and retrieve results. Essential for batch processing.
## Architecture
```mermaid
sequenceDiagram
participant User
participant MCP as MCP Server
participant TM as Task Manager
participant API as Minimax API
Note over User, API: Async Submit-and-Barrier Pattern
User->>MCP: submit_image_generation(prompt1)
MCP->>TM: submitImageTask()
TM-->>MCP: taskId: img-001
MCP-->>User: "Task img-001 submitted"
par Background Execution (Rate Limited)
TM->>API: POST /image/generate
API-->>TM: image data + save file
end
User->>MCP: submit_speech_generation(text1)
MCP->>TM: submitTTSTask()
TM-->>MCP: taskId: tts-002
MCP-->>User: "Task tts-002 submitted"
par Background Execution (Rate Limited)
TM->>API: POST /speech/generate
API-->>TM: audio data + save file
end
User->>MCP: submit_image_generation(prompt2)
MCP->>TM: submitImageTask()
TM-->>MCP: taskId: img-003
MCP-->>User: "Task img-003 submitted"
par Background Execution (Rate Limited)
TM->>API: POST /image/generate (queued)
API-->>TM: image data + save file
end
User->>MCP: task_barrier()
MCP->>TM: barrier()
TM->>TM: wait for all tasks
TM-->>MCP: results summary
MCP-->>User: â
All tasks completed
Files available at specified paths
Note over User, API: Immediate Task Submission + Background Rate-Limited Execution
```
## License
MIT