{"id":40511792,"url":"https://github.com/droxer/slide-speaker-core","last_synced_at":"2026-01-20T20:12:18.297Z","repository":{"id":316719008,"uuid":"1048543353","full_name":"droxer/slide-speaker-core","owner":"droxer","description":"SlideSpeaker is an AI app that converts your slides into engaging videos and Podcasts with narration and avatars.","archived":false,"fork":false,"pushed_at":"2025-11-03T09:32:50.000Z","size":136403,"stargazers_count":3,"open_issues_count":0,"forks_count":0,"subscribers_count":0,"default_branch":"main","last_synced_at":"2025-11-03T11:14:01.143Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/droxer.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null,"notice":null,"maintainers":null,"copyright":null,"agents":"AGENTS.md","dco":null,"cla":null}},"created_at":"2025-09-01T15:57:19.000Z","updated_at":"2025-11-03T09:32:56.000Z","dependencies_parsed_at":"2025-10-17T03:40:50.942Z","dependency_job_id":"cd95cf46-5ea6-4631-91fa-4ffb87bb16fd","html_url":"https://github.com/droxer/slide-speaker-core","commit_stats":null,"previous_names":["droxer/slide-speaker","droxer/slide-speaker-core"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/droxer/slide-speaker-core","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/droxer%2Fslide-speaker-core","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/droxer%2Fslide-speaker-core/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/droxer%2Fslide-speaker-core/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/droxer%2Fslide-speaker-core/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/droxer","download_url":"https://codeload.github.com/droxer/slide-speaker-core/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/droxer%2Fslide-speaker-core/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":28612165,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-01-20T18:56:40.769Z","status":"ssl_error","status_checked_at":"2026-01-20T18:54:26.653Z","response_time":117,"last_error":"SSL_connect returned=1 errno=0 peeraddr=140.82.121.5:443 state=error: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2026-01-20T20:12:17.544Z","updated_at":"2026-01-20T20:12:18.289Z","avatar_url":"https://github.com/droxer.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# SlideSpeaker API\n\nTurn slides/PDFs into narrated videos — transcripts, TTS, subtitles, and optional avatars.\n\nThis repository now contains the FastAPI backend that powers SlideSpeaker. It exposes the task orchestration pipeline, handles transcription/TTS jobs, and serves generated media back to clients. The React/Next.js frontend has been moved into its own repository (`slide-speaker-web/`), ready to be published as a separate git project.\n\n## ⚠️ Project Status\n\nSlideSpeaker is under active development. Expect rapid iteration, breaking changes, and incomplete tooling while we work toward production readiness.\n\n## ✨ Features\n\n- Automated script generation from slide decks or PDFs\n- Natural-sounding text-to-speech narration with configurable voices\n- Optional AI avatars synced to narration for presenter-style videos\n- Podcast-ready audio exports for sharing beyond video platforms\n- Subtitle outputs in VTT/SRT formats aligned to the narration\n- Task-based API that coordinates the full processing pipeline end-to-end\n- Responsive light, dark, and auto themes with per-user preferences\n- Global language switcher with localized UI labels and stored preferences\n- Hybrid authentication powered by NextAuth (Google OAuth + email/password) backed by FastAPI endpoints\n- WCAG 2.1 AA compliance with enhanced accessibility features\n- High contrast themes for both light and dark modes\n- Support for additional languages: Thai, Korean, and Japanese\n- Optimized task creation page and improved processing display\n- Enhanced web performance for better user experience\n- Modern state management with Zustand for improved frontend performance\n- Enhanced theme system with proper high contrast support\n\n## 🚀 Quick Start (API)\n```bash\ncd api\nuv sync                      # Install base dependencies\ncp .env.example .env         # Create config file\n# Edit .env to add your API keys\nmake dev                     # Start development server (port 8000)\n```\n\n### Background Workers\n```bash\ncd api\nmake master-worker          # Start master process that spawns workers\n```\n\n### User Management CLI\n```bash\ncd api\npython scripts/user_cli.py list\npython scripts/user_cli.py create --email you@example.com --password secret --name \"You\"\n```\nUse `--help` on any subcommand to see additional options (`show`, `set-password`, `delete`).\n\n## 🌐 Frontend (Separate Repo)\n\nThe Next.js/React UI now lives in `slide-speaker-web/` (generated beside this repository). Move it to its own git project and follow the instructions in `slide-speaker-web/README.md` to continue frontend development.\n\n## ♿ Accessibility\n\nSlideSpeaker is committed to providing an inclusive experience for all users:\n- WCAG 2.1 AA compliance for web accessibility standards\n- High contrast themes available for both light and dark modes\n- Enhanced focus indicators for keyboard navigation\n- Screen reader friendly interface\n- Support for multiple languages to serve a diverse user base\n\nVisit:\n- `http://localhost:8000/docs` - API documentation\n\n## 🛠️ Configuration\n\n### Essential API Keys\n- **LLM (OpenAI)** - Required for transcript generation\n  - `OPENAI_API_KEY` (required)\n  - Optional: `OPENAI_BASE_URL` (for custom endpoints)\n  - Optional: `OPENAI_TIMEOUT`, `OPENAI_RETRIES`, `OPENAI_BACKOFF`\n\n- **Text-to-Speech**\n  - `TTS_SERVICE=openai|elevenlabs` (defaults to openai)\n  - ElevenLabs requires `ELEVENLABS_API_KEY`\n\n- **Avatar Generation** (optional)\n  - HeyGen: `HEYGEN_API_KEY`\n  - OpenAI DALL-E: Uses your `OPENAI_API_KEY`\n\n- **Storage**\n  - Defaults to local filesystem\n  - For cloud storage, configure S3 or OSS in `.env`\n\n### Storage Options\nSlideSpeaker supports multiple storage backends:\n- **Local** - Default, stores files in `api/output/`\n- **AWS S3** - Configure `AWS_S3_BUCKET_NAME` and credentials\n- **Aliyun OSS** - Configure `OSS_BUCKET_NAME` and credentials\n\n### Authentication\n- **API (FastAPI)**\n  - Password hashing uses PBKDF2-HMAC-SHA256; no additional secrets required.\n- **Next.js (web/.env)**\n  - `NEXTAUTH_SECRET` – signing key for NextAuth JWT sessions\n  - `NEXTAUTH_URL` – base URL of the Next.js app (e.g. `http://localhost:3000`)\n  - `NEXT_PUBLIC_API_BASE_URL` – base URL of the FastAPI backend (defaults to `http://localhost:8000` for local dev)\n- **NextAuth providers**\n  - Optional Google OAuth: set `GOOGLE_CLIENT_ID` / `GOOGLE_CLIENT_SECRET`\n\n## 📚 Documentation\n\n- [Installation Guide](docs/installation.md) - Detailed setup instructions\n- [API Installation Guide](docs/api-installation.md) - Backend-specific installation and configuration\n- [Backend Technical Stack](docs/backend-tech-stack.md) - Python/FastAPI architecture\n- [API Documentation](http://localhost:8000/docs) - Auto-generated API docs (when running)\n- [API Reference](docs/api.md) - Complete API reference and endpoints\n- [Pipeline Overview](docs/pipeline-overview.md) - High-level processing pipeline architecture\n- [Step Definitions](docs/step-definitions.md) - Detailed breakdown of processing steps\n- [Data Flow](docs/dataflow.md) - Data flow and state management\n- [Configuration](api/.env.example) - Environment variables reference\n- [High Contrast Themes Improvements](high-contrast-themes-improvements.md) - Details about accessibility enhancements\n- [Claude Code Guide](.claude/CLAUDE.md) - Guidance for AI coding assistants working with this repository\n\n## 📄 License\n\nMIT License - see [LICENSE](LICENSE) file for details\n\n## 🤝 Contributing\n\n1. Fork the repository\n2. Create your feature branch (`git checkout -b feature/AmazingFeature`)\n3. Commit your changes (`git commit -m 'Add some AmazingFeature'`)\n4. Push to the branch (`git push origin feature/AmazingFeature`)\n5. Open a pull request\n\n## 🆘 Support\n\nFor issues and feature requests, please [open an issue](../../issues) on GitHub.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdroxer%2Fslide-speaker-core","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fdroxer%2Fslide-speaker-core","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdroxer%2Fslide-speaker-core/lists"}