An open API service indexing awesome lists of open source software.

https://github.com/PsychArch/minimax-mcp-tools

Async MCP server with Minimax API integration for image generation and text-to-speech
https://github.com/PsychArch/minimax-mcp-tools

Last synced: 6 days ago
JSON representation

Async MCP server with Minimax API integration for image generation and text-to-speech

Awesome Lists containing this project

README

          

# Minimax MCP Tools

![Banner](assets/banner.png)

A Model Context Protocol (MCP) server for Minimax AI integration, providing async image generation and text-to-speech with advanced rate limiting and error handling.

English | [įŽ€äŊ“中文](README.zh-CN.md)

### MCP Configuration
Add to your MCP settings:
```json
{
"mcpServers": {
"minimax-mcp-tools": {
"command": "npx",
"args": ["minimax-mcp-tools"],
"env": {
"MINIMAX_API_KEY": "your_api_key_here"
}
}
}
}
```

## Async Design - Perfect for Content Production at Scale

This MCP server uses an **asynchronous submit-and-barrier pattern** designed for **batch content creation**:

đŸŽŦ **Narrated Slideshow Production** - Generate dozens of slide images and corresponding narration in parallel
📚 **AI-Driven Audiobook Creation** - Produce chapters with multiple voice characters simultaneously
đŸ–ŧī¸ **Website Asset Generation** - Create consistent visual content and audio elements for web projects
đŸŽ¯ **Multimedia Content Pipelines** - Perfect for LLM-driven content workflows requiring both visuals and audio

### Architecture Benefits:
1. **Submit Phase**: Tools return immediately with task IDs, tasks execute in background
2. **Smart Rate Limiting**: Adaptive rate limiting (10 RPM images, 20 RPM speech) with burst capacity
3. **Barrier Synchronization**: `task_barrier` waits for all tasks and returns comprehensive results
4. **Batch Optimization**: Submit multiple tasks to saturate rate limits, then barrier once for maximum throughput

## Tools

### `submit_image_generation`
**Submit Image Generation Task** - Generate images asynchronously.

**Required:** `prompt`, `outputFile`
**Optional:** `aspectRatio`, `customSize`, `seed`, `subjectReference`, `style`

### `submit_speech_generation`
**Submit Speech Generation Task** - Convert text to speech asynchronously.

**Required:** `text`, `outputFile`
**Optional:** `highQuality`, `voiceId`, `speed`, `volume`, `pitch`, `emotion`, `format`, `sampleRate`, `bitrate`, `languageBoost`, `intensity`, `timbre`, `sound_effects`

### `task_barrier`
**Wait for Task Completion** - Wait for ALL submitted tasks to complete and retrieve results. Essential for batch processing.

## Architecture
```mermaid
sequenceDiagram
participant User
participant MCP as MCP Server
participant TM as Task Manager
participant API as Minimax API

Note over User, API: Async Submit-and-Barrier Pattern

User->>MCP: submit_image_generation(prompt1)
MCP->>TM: submitImageTask()
TM-->>MCP: taskId: img-001
MCP-->>User: "Task img-001 submitted"

par Background Execution (Rate Limited)
TM->>API: POST /image/generate
API-->>TM: image data + save file
end

User->>MCP: submit_speech_generation(text1)
MCP->>TM: submitTTSTask()
TM-->>MCP: taskId: tts-002
MCP-->>User: "Task tts-002 submitted"

par Background Execution (Rate Limited)
TM->>API: POST /speech/generate
API-->>TM: audio data + save file
end

User->>MCP: submit_image_generation(prompt2)
MCP->>TM: submitImageTask()
TM-->>MCP: taskId: img-003
MCP-->>User: "Task img-003 submitted"

par Background Execution (Rate Limited)
TM->>API: POST /image/generate (queued)
API-->>TM: image data + save file
end

User->>MCP: task_barrier()
MCP->>TM: barrier()
TM->>TM: wait for all tasks
TM-->>MCP: results summary
MCP-->>User: ✅ All tasks completed
Files available at specified paths

Note over User, API: Immediate Task Submission + Background Rate-Limited Execution
```

## License
MIT