https://github.com/samestrin/pyprosody
Read stories with prosody using AI models
https://github.com/samestrin/pyprosody
Last synced: 8 months ago
JSON representation
Read stories with prosody using AI models
- Host: GitHub
- URL: https://github.com/samestrin/pyprosody
- Owner: samestrin
- License: mit
- Created: 2025-03-13T20:21:22.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2025-03-13T20:22:55.000Z (over 1 year ago)
- Last Synced: 2025-03-13T21:30:04.268Z (over 1 year ago)
- Language: Python
- Size: 42 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# pyprosody
PyProsody is a command-line Python application that takes a text file (a story) and, optionally, a model argument. Its main functionality is twofold:
- Emotion Analysis: Break the story into sentences and phrases, then use a hybrid analysis (lexical, sentiment, contextual, sarcasm/irony, pragmatic/discourse, and feature engineering) to detect emotional cues.
- Expressive Audio Generation: Convert the analyzed text into a speech audio file by adjusting prosodic parameters—such as speed, tone, and volume—to reflect the detected emotions.
Local AI models will be leveraged as much as possible to ensure the application is self-contained and does not rely on external APIs.
This project is under development.