An open API service indexing awesome lists of open source software.

https://github.com/madzadev/voice-cue

πŸ“£ Find sentiments, tags, entities, and actions in your voice recordings instantly
https://github.com/madzadev/voice-cue

audio audio-analysis audio-processing speech speech-recognition speech-to-text transcript voice voice-recognition

Last synced: 11 months ago
JSON representation

πŸ“£ Find sentiments, tags, entities, and actions in your voice recordings instantly

Awesome Lists containing this project

README

          

# Introducing VoiceCue πŸ’ΏπŸ“

Forget about listening to long and boring voice recordings and process them manually.

## About πŸ‘€

[VoiceCue](https://cue.madza.dev) lets you find sentiments, tags, entities, and actions in your voice recordings like a DJ.

![Preview](https://github.com/madzadev/voice-cue/blob/main/public/media-card.png)

The app workflow is as simple as uploading your voice recording, selecting which type of analysis to perform and clicking on the generated cues in the list to instantly navigate to its exact position in the voice recording.

## Features ✨

1. Voice recognition - AI based solution by [Deepgram](https://deepgram.com)
2. General stats - an overview about voice recording
3. Sentiment analysis - positive and negative word detection
4. Word cloud generation - most used word classification
5. Entity name recognition - categories such as person, place, etc
6. Activity tracking - find actions in past, present or future
7. Interactive transcript - see progress or click to control it
8. Speaker detection - total number of speakers in recording
9. Cue word usage - transcript samples for better context
10. Custom search - extended ability to query for cues
11. Waveform preview - see the dynamics of voice, identify silences
12. Audio controls - play, pause, fast forward and backward
13. Drag and drop support - drop audio in the file select area
14. Upload MP3 files - most commonly used audio format
15. Progress loaders - Improved UX for loading transcripts
16. Fully responsive - works fine on mobile and tablets
17. Colorful UI - for easier interaction and highlighting

## Tech stack πŸ› οΈ

[NextJS](https://nextjs.org) - React application framework

[Deepgram](https://deepgram.com) - for AI based speech recognition

[compromise](https://www.npmjs.com/package/compromise), [sentiment](https://www.npmjs.com/package/sentiment) - for text processing

[react-tagcloud](https://www.npmjs.com/package/react-tagcloud) - to generate word cloud

[react-tabs](https://www.npmjs.com/package/react-tabs) - for navigation panels

[react-drag-drop-files](https://www.npmjs.com/package/react-drag-drop-files) - for drag and drop support

[wavesurfer.js](https://www.npmjs.com/package/wavesurfer.js) - to generate the audio waveform

[GitHub](https://github.com) - to host the code

[Vercel](https://vercel.com) - to deploy the project

[ESLint](https://eslint.org/), [prettier](https://prettier.io/) - for linting and code formatting

[Namecheap](https://namecheap.com) - for custom subdomain

## Licence πŸ“š

VoiceCue is an open source project. The feature requests are welcome.

The project is under the terms of [MIT license](https://choosealicense.com/licenses/mit/).