Projects in Awesome Lists tagged with speech-detection
A curated list of projects in awesome lists tagged with speech-detection .
https://github.com/smacke/ffsubsync
Automagically synchronize subtitles with video.
alignment audio caption captions fast-fourier-transform ffmpeg fft speech-detection srt srt-subtitles string-alignment subtitle subtitles sync synchronization vad video vlc vlc-media-player voice-activity-detection
Last synced: 14 May 2025
https://github.com/ina-foss/inaspeechsegmenter
CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
audio-analysis female gender gender-classification gender-equality male mirex music music-detection noise praat segmentation speaker-gender speech speech-activity-detection speech-detection speech-music speech-segmentation transgender voice-activity-detection
Last synced: 14 May 2025
https://github.com/filippogiruzzi/voice_activity_detection
Voice Activity Detection based on Deep Learning & TensorFlow
artificial-intelligence deep-learning deep-neural-networks deeplearning librispeech librispeech-dataset machine-learning mfcc-features python resnet speech speech-detection speech-recognition tensorflow time-series time-series-classification vad voice-activity-detection
Last synced: 07 May 2025
https://github.com/gkonovalov/android-vad
Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.
android audio-processing deep-neural-networks dnn gmm neural-networks offline on-device-ai onnx-models real-time silero silero-vad speech-detection speech-recoginition vad voice-activity-detection voice-activity-detector voice-detection webrtc yamnet
Last synced: 16 May 2025
https://github.com/gtreshchev/RuntimeSpeechRecognizer
Cross-platform, real-time, offline speech recognition plugin for Unreal Engine. Based on Whisper OpenAI technology, whisper.cpp.
audio-processing openai speech-detection speech-processing speech-recognition speech-to-text ue4 ue4-plugin ue5 ue5-plugin unreal-engine unreal-engine-4 unreal-engine-5 voice-recognition whis whisper whisper-ai whisper-cpp
Last synced: 08 Apr 2025
https://github.com/gtreshchev/runtimespeechrecognizer
Cross-platform, real-time, offline speech recognition plugin for Unreal Engine. Based on Whisper OpenAI technology, whisper.cpp.
audio-processing openai speech-detection speech-processing speech-recognition speech-to-text ue4 ue4-plugin ue5 ue5-plugin unreal-engine unreal-engine-4 unreal-engine-5 voice-recognition whis whisper whisper-ai whisper-cpp
Last synced: 19 Feb 2025
https://github.com/tympanix/subsync
Synchronize your subtitles using machine learning
delay fix machine-learning mfcc neural-network shift shift-subtitle speech-detection subsync subtitle subtitles
Last synced: 09 Apr 2025
https://github.com/baochuquan/ios-vad
iOS Voice Activity Detection (VAD). Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.
audio-processing deep-neural-network dnn gmm ios neural-networks offline on-device-ai onnex-models real-time silero silero-vad speech-detection speech-recognition vad voice-activity-detection voice-activity-detector voice-detection webrtc yamnet
Last synced: 13 May 2025