https://github.com/uqqu/sync_book
audiobook generator with smart personalized translation
https://github.com/uqqu/sync_book
audiobook audiobooks foreign-language language-learning nlp python
Last synced: 2 months ago
JSON representation
audiobook generator with smart personalized translation
- Host: GitHub
- URL: https://github.com/uqqu/sync_book
- Owner: uqqu
- Created: 2024-10-07T12:12:35.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2025-01-20T06:12:09.000Z (over 1 year ago)
- Last Synced: 2025-01-20T07:23:28.319Z (over 1 year ago)
- Topics: audiobook, audiobooks, foreign-language, language-learning, nlp, python
- Language: Python
- Homepage:
- Size: 241 KB
- Stars: 0
- Watchers: 2
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Smart Sync Audio Book
The project is designed for automatic generation of audiobooks in a foreign languages, with personalized translation and interval fixation of words. When generating an audiobook, each sentence is accompanied by translations of words and expressions that are new or have not met for a long time, according to the interval method, for a certain user, as well as a full translation of the sentence, if the sentence contains a lot of new words. These words and expressions are stored in a personal dictionary, and used in subsequent generations. The dictionary takes into account both specific forms of words and their basic lemmas, with different fixation intervals.
The input is the text in the original language. You can also add your own translation file to be used for generation, or leave this task to an external translator (currently the choice is between local Argos Translate and the Google Cloud Translation API). After matching words and expressions in the original and translation, speech synthesis is performed using the provider specified by the user with the specified settings of speaker, model, speed, …. Parts of the audio track with individual words for translation are generated separately, or can be additionally recognized from the generated full sentence using MFA (Montreal Forced Aligner) and reused in the dictionary, at the user's choice. The speed at which sentences and the dictionary are voiced is set separately by the user.
Examples of the output can be found on [YouTube](https://www.youtube.com/@SmartBook-en-ru)