{"id":30027157,"url":"https://github.com/datascience-py/subtitel-generator","last_synced_at":"2025-08-06T12:35:52.887Z","repository":{"id":306087953,"uuid":"1024906797","full_name":"DataScience-py/subtitel-generator","owner":"DataScience-py","description":"This is local subtitel generator code.","archived":false,"fork":false,"pushed_at":"2025-07-31T15:07:35.000Z","size":307,"stargazers_count":1,"open_issues_count":0,"forks_count":0,"subscribers_count":0,"default_branch":"main","last_synced_at":"2025-07-31T18:56:01.310Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"apache-2.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/DataScience-py.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null}},"created_at":"2025-07-23T12:30:32.000Z","updated_at":"2025-07-31T15:07:39.000Z","dependencies_parsed_at":"2025-07-23T16:24:33.903Z","dependency_job_id":"e375c688-26e5-4f25-b8f0-2d9a1e9425ae","html_url":"https://github.com/DataScience-py/subtitel-generator","commit_stats":null,"previous_names":["datascience-py/subtitel-generator"],"tags_count":1,"template":false,"template_full_name":null,"purl":"pkg:github/DataScience-py/subtitel-generator","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/DataScience-py%2Fsubtitel-generator","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/DataScience-py%2Fsubtitel-generator/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/DataScience-py%2Fsubtitel-generator/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/DataScience-py%2Fsubtitel-generator/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/DataScience-py","download_url":"https://codeload.github.com/DataScience-py/subtitel-generator/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/DataScience-py%2Fsubtitel-generator/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":269082860,"owners_count":24357135,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-08-06T02:00:09.910Z","response_time":99,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2025-08-06T12:35:51.821Z","updated_at":"2025-08-06T12:35:52.862Z","avatar_url":"https://github.com/DataScience-py.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Subtitle Generator\n\nA Python project for generating subtitles from audio files using Voice Activity Detection (VAD) and Speech-to-Text (STT) models.\n\n\u003e The subtitle language matches the language of the selected Vosk model. Subtitles can be generated for any language supported by both Vosk and Silero-VAD models.\n\u003e Currently, there is no translation module, so speech is transcribed to text in the original language only.\n\n\n---\n\n## Features Checklist\n\n- [x] ~~Voice Activity Detection (VAD) using Silero VAD~~\n- [x] ~~Speech-to-Text (STT) using Vosk (English )~~\n- [x] ~~Support any vosk model.~~\n- [x] ~~Subtitle file generation (SRT format)~~\n- [ ] Translation support for other languages\n- [ ] Text-to-Speech (TTS) integration\n- [ ] CLI interface\n- [ ] GUI interface\n- [ ] Docker support\n\n---\n\n## Current Limitations\n\n- **The subtitle language is determined by the Vosk model you use.**  \n  Subtitles can be generated for any language supported by both Vosk and Silero-VAD.\n- **No translation module is implemented yet.**  \n  Speech is transcribed to text in the original language only.\n\n---\n\n## Installation\n\n1. Clone the repository:\n   ```bash\n   git clone https://github.com/yourusername/subtitel-generator.git\n   cd subtitel-generator\n   ```\n2. Install dependencies using [Poetry](https://python-poetry.org/):\n   ```bash\n   poetry install\n   ```\n3. Download the required Vosk model and place its folder inside the `models/vosk/` directory. For example, for English, use `models/vosk/vosk-en` (where `vosk-en` is any Vosk model folder you want to use).\n   No additional setup is required: the program will automatically use the model from the specified folder.\n\n---\n\n## Usage Example\n\nSee `src/main.py` for a runnable example. Basic usage:\n\n```python\nfrom subtitel_generator import SubtitelGenerator\nfrom subtitel_generator.file_generator import SrtSubtitleFileGenerator\nfrom subtitel_generator.speech_to_text import VoskSTT\nfrom subtitel_generator.voive_activation_detector import VADSilero\n\ns = SubtitelGenerator(\n    vad=VADSilero(),\n    stt=VoskSTT(),\n    file_generater=SrtSubtitleFileGenerator(),\n)\n\ns.generate(audio_file_path=\"example/Example_audio_endlish_small.wav\") # Path to the audio file\n```\n\n---\n\n## Project Structure\n\n- `src/subtitel_generator/` — Main package\n  - `subtitel_generator.py` — Main orchestration class\n  - `file_generator/` — Subtitle file generators (e.g., SRT)\n  - `speech_to_text/` — Speech-to-text models (Vosk)\n  - `voive_activation_detector/` — Voice activity detection (Silero)\n  - `logger/` — Logging utilities\n- `models/` — Pretrained models (e.g., Vosk)\n- `example/` — Example audio files\n- `tests/` — Unit tests\n\n---\n\n## Testing\n\nRun tests with:\n```bash\npoetry run pytest\n```\n\n---\n\n## License\n\nThis project is licensed under the terms of the [APACHE2.0 License](LICENSE).\n\n---\n\n## Contributing\n\nContributions are welcome! Please open issues or submit pull requests.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdatascience-py%2Fsubtitel-generator","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fdatascience-py%2Fsubtitel-generator","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdatascience-py%2Fsubtitel-generator/lists"}