{"id":30196821,"url":"https://github.com/ankushrathour/audiomaker","last_synced_at":"2025-08-13T05:44:43.492Z","repository":{"id":309503469,"uuid":"1036519847","full_name":"AnkushRathour/AudioMaker","owner":"AnkushRathour","description":"AudioMaker is a Python package for generating seamless, long-form audio from massive text inputs. Unlike traditional TTS tools, AudioMaker can handle book-length content (even 4+ hours) by splitting text into chunks, synthesizing each chunk, and merging them into a single audio file.","archived":false,"fork":false,"pushed_at":"2025-08-12T08:17:24.000Z","size":14,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":0,"default_branch":"main","last_synced_at":"2025-08-12T09:27:12.945Z","etag":null,"topics":["edge-tts","pyaudio","python","text-to-audio","tqdm"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/AnkushRathour.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null}},"created_at":"2025-08-12T07:38:17.000Z","updated_at":"2025-08-12T08:17:27.000Z","dependencies_parsed_at":"2025-08-12T09:39:06.295Z","dependency_job_id":null,"html_url":"https://github.com/AnkushRathour/AudioMaker","commit_stats":null,"previous_names":["ankushrathour/audiomaker"],"tags_count":1,"template":false,"template_full_name":null,"purl":"pkg:github/AnkushRathour/AudioMaker","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/AnkushRathour%2FAudioMaker","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/AnkushRathour%2FAudioMaker/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/AnkushRathour%2FAudioMaker/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/AnkushRathour%2FAudioMaker/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/AnkushRathour","download_url":"https://codeload.github.com/AnkushRathour/AudioMaker/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/AnkushRathour%2FAudioMaker/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":270190130,"owners_count":24542247,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-08-13T02:00:09.904Z","response_time":66,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["edge-tts","pyaudio","python","text-to-audio","tqdm"],"created_at":"2025-08-13T05:44:40.507Z","updated_at":"2025-08-13T05:44:43.470Z","avatar_url":"https://github.com/AnkushRathour.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# 🎙️ AudioMaker\n\n[![PyPI version](https://badge.fury.io/py/audiomaker.svg)](https://pypi.org/project/audiomaker/)\n[![License: MIT](https://img.shields.io/badge/License-MIT-green.svg)](LICENSE)\n[![Python Version](https://img.shields.io/pypi/pyversions/audiomaker)](https://pypi.org/project/audiomaker/)\n\n**AudioMaker** is a Python package for generating **seamless, long-form audio** from massive text inputs.  \nUnlike traditional TTS tools, AudioMaker can handle **book-length content** (even 4+ hours) by **splitting text into chunks**, synthesizing each chunk, and merging them into a single audio file.\n\n---\n\n## ✨ Features\n\n- 📚 **Handles huge text** – turn entire books into one MP3\n- 🧩 **Chunking system** – bypasses TTS length limits automatically\n- 🔗 **Seamless merging** – no awkward pauses or breaks\n- 🎙️ **Custom voices** – choose from Microsoft Edge-TTS voices\n- 🛠 **Flexible usage** – CLI or Python API\n- ⏱ **Progress bars** – real-time status with `tqdm`\n\n---\n\n## 📦 Installation\n\n```bash\npip install audiomaker\n\n# or\n\ngit clone https://github.com/ankushrathour/audiomaker.git\ncd audiomaker\npip install -e .\n```\n\n## 🚀 Usage\n\n1️⃣ Command-Line Interface (CLI)\n```bash\naudiomaker --input file.txt --output file.mp3 --chunk_size 3000 --voice en-US-AriaNeural\n```\n\n####  Arguments:\n######  Flag Description Default\n- input Path to input text file Required\n\n- output Path to save final audio output.mp3\n\n- chunk_size Number of words per TTS chunk 3000\n\n- voice Edge-TTS voice name en-US-AriaNeural\n\n- temp_dir Directory for temporary audio chunks audio_parts\n\n2️⃣ Python API\n\n```bash\nfrom audiomaker import text_to_audio\n\n# Load text from file\nwith open(\"file.txt\", \"r\", encoding=\"utf-8\") as f:\n    text = f.read()\n\n# Convert to audio\ntext_to_audio(\n    text=text, output_path=\"output.mp3\",\n    chunk_size=3000, voice=\"en-US-AriaNeural\", temp_dir=\"audio_parts\"\n)\n```\n\n# 🎨 Example Voices\nSome popular Microsoft Edge-TTS voices you can use:\n\n- en-US-AriaNeural\n- en-GB-RyanNeural\n- en-IN-NeerjaNeural\n- en-AU-NatashaNeural\n\nFor a complete list of available voices, please refer to the full list of [Voices](https://github.com/rany2/edge-tts?tab=readme-ov-file#changing-the-voice).\n\n# ⚠️ Notes\n\nEdge-TTS requires an internet connection to access Microsoft’s speech\nservices. Chunk size may need to be adjusted depending on the voice and\ntext formatting. Intermediate audio files are stored in temp_dir and can\nbe deleted after processing.\n\n# 💡 Tagline\n\n**AudioMaker** – Unlimited text, one seamless voice.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fankushrathour%2Faudiomaker","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fankushrathour%2Faudiomaker","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fankushrathour%2Faudiomaker/lists"}