Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists tagged with vad

A curated list of projects in awesome lists tagged with vad .

https://github.com/modelscope/funasr

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

audio-visual-speech-recognition conformer dfsmn paraformer pretrained-model punctuation pytorch rnnt speaker-diarization speech-recognition speechgpt speechllm vad voice-activity-detection whisper

Last synced: 26 Sep 2024

https://github.com/k2-fsa/sherpa-ncnn

Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn without Internet connection. Support iOS, Android, Linux, macOS, Windows, Raspberry Pi, VisionFive2, LicheePi4A etc.

asr c cpp csharp go kotlin python speech-recognition vad voice-activity-detection

Last synced: 30 Sep 2024

https://github.com/jtkim-kaist/VAD

Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.

acam attention bdnn data dnn lstm speech speech-activity-detection speech-recognition vad voice-activity-detection voice-detection

Last synced: 03 Aug 2024

https://github.com/amsehili/auditok

An audio/acoustic activity detection and audio segmentation tool

audio-activities audio-data audio-segmentation vad voice-activity-detection voice-detection

Last synced: 31 Jul 2024

https://github.com/shashikg/whispers2t

An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine

asr deep-learning speech-recognition speech-to-text tensorrt tensorrt-llm vad voice-activity-detection whisper

Last synced: 26 Sep 2024

https://github.com/shashikg/WhisperS2T

An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine

asr deep-learning speech-recognition speech-to-text tensorrt tensorrt-llm vad voice-activity-detection whisper

Last synced: 03 Aug 2024

https://github.com/DmitryRyumin/ICASSP-2023-24-Papers

ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!

asr denoising domain-adaptation face-recognition generative-models icassp icassp2023 icassp2024 image-generation keyword-spotting language-modeling multimodal-learning music-generation self-supervised-learning semantic-segmentation signal-processing signal-restoration speech-recognition spoken-language-understanding vad

Last synced: 05 Aug 2024

https://github.com/eesungkim/Voice_Activity_Detector

A statistical model-based Voice Activity Detection

vad voice-activity-detection voice-detection

Last synced: 03 Aug 2024

https://github.com/Picovoice/cobra

On-device voice activity detection (VAD) powered by deep learning

on-device speech-recognition vad voice-activity voice-activity-detection voice-activity-detector

Last synced: 03 Aug 2024

https://github.com/0vercl0k/sic

Enumerate user mode shared memory mappings on Windows.

driver ntoskrnl prototype-pte shared-memory shm vad windows-10 windows-kernel

Last synced: 04 Aug 2024

https://github.com/mounalab/LSTM-RNN-VAD

Voice Activity Detection LSTM-RNN learning model

lstm lstm-neural-network nlp-machine-learning rnn rnn-tensorflow tensorflow vad

Last synced: 03 Aug 2024

https://github.com/EtienneAb3d/karaok-AI

Karaoke Player / Editor with automatic clip creation from any song file using vocals and lyrics extraction (Speech-to-Text)

djing karaoke karaoke-maker lyrics mp3-player music party-apps sound-processing speech-to-text srt-subtitles subtitles vad whisper

Last synced: 01 Aug 2024

https://github.com/lgrammel/whisperwriter

Local & private voice controlled notepad using whisper.cpp

nextjs stt transcription vad whisper-cpp

Last synced: 06 Aug 2024

https://github.com/daanzu/py-silero-vad-lite

Lightweight wrapper for Silero VAD using internal ONNX Runtime and with no python package dependencies

python speech speech-processing vad voice voice-activity-detection

Last synced: 01 Oct 2024

https://github.com/OpenVoiceOS/ovos-vad-plugin-webrtcvad

ovos plugin for voice activity detection using webrtcvad

openvoiceos ovos vad voice-activity-detection

Last synced: 04 Aug 2024

https://github.com/samfisherirl/unsilencevad

The only Silence Remover via VAD filter on github, akin to Adobe Premiere Pro Text

ai python pytorch silences vad

Last synced: 01 Oct 2024

https://github.com/OpenVoiceOS/ovos-vad-plugin-silero

ovos plugin for voice activity detection using silero vad

openvoiceos ovos plugin vad voice-activity-detection

Last synced: 04 Aug 2024