An open API service indexing awesome lists of open source software.

https://github.com/logancyang/minimax-tts-skill

Agent Skill for minimax tts and voice cloning
https://github.com/logancyang/minimax-tts-skill

Last synced: about 2 months ago
JSON representation

Agent Skill for minimax tts and voice cloning

Awesome Lists containing this project

README

          

# MiniMax TTS Skill

An agent skill for text-to-speech generation and voice cloning using the [MiniMax](https://www.minimaxi.com/) cloud API.

## Features

- **Text-to-Speech** — Generate speech from text with configurable voice, speed, pitch, and format
- **Voice Cloning** — Clone a voice from a reference audio file and use it for TTS
- No pip dependencies — uses only Python stdlib

## Setup

Set the following environment variables:

```bash
MINIMAX_API_KEY="your-api-key"
MINIMAX_GROUP_ID="your-group-id" # required for voice cloning
```

## Quick Start

```bash
# Generate speech
python scripts/tts.py tts --text "Hello world" --voice male-qn-qingse --output hello.mp3 --format mp3

# Clone a voice
python scripts/tts.py clone --audio reference.wav --voice-id my_voice --text "Test sentence"
```

See [SKILL.md](SKILL.md) for full usage details and available options.