Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
awesome-keyword-spotting
This repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).
https://github.com/zycv/awesome-keyword-spotting
Last synced: 5 days ago
JSON representation
-
OpenSource Code
-
Others
- Github: A depthwise separable convolutional neural network for keyword spotting on an embedded system
- Github: Wav2KWS: Transfer Learning from Speech Representations for Keyword Spotting - of-the-Art** )
- Github: Mining Effective Negative Training Samples for Keyword Spotting
- Github: Hello Edge: Keyword spotting on Microcontrollers
- Github: Few-Shot Keyword Spotting in Any Language
- Github: Learning Efficient Representations for Keyword Spotting with Triplet Loss
- Github: The 2020 Personalized Voice Trigger Challenge: Open Database, Evaluation Metrics and the Baseline Systems
- Github: Micronets: Neural network architectures for deploying tinyml applications on commodity microcontrollers
- Github: Neural ODE with Temporal Convolution and Time Delay Neural Networks for Small-Footprint Keyword Spotting
- Github: Few-Shot Keyword Spotting With Prototypical Networks
- Region Proposal Network Based Small-Footprint Keyword Spotting
- Github: Temporal Convolution for Real-time Keyword Spotting on Mobile Devices
- Github: Benchmarking Keyword Spotting Efficiency on Neuromorphic Hardware
- Official code: Improving reverberant speech training using diffuse acoustic simulation
-
-
Software
-
Datesets
-
Others
- Region Proposal Network Based Small-Footprint Keyword Spotting
- Speech Commands: A Dataset for Limited-Vocabulary Speech Recognition
- http://download.tensorflow.org/data/speech_commands_v0.02.tar.gz
- http://download.tensorflow.org/data/speech_commands_test_set_v0.02.tar.gz
- MobvoiHotwords
- A far-field text-dependent speaker verification database for AISHELL Speaker Verification Challenge 2019
- HI-MIA
-
-
Publications
-
2021
- Learning Efficient Representations for Keyword Spotting with Triplet Loss
- Wav2KWS: Transfer Learning from Speech Representations for Keyword Spotting
- BBS-KWS:The Mandarin Keyword Spotting System Won the Video Keyword Wakeup Challenge
- Implicit Acoustic Echo Cancellation for Keyword Spotting and Device-Directed Speech Detection
- WaveSense: Efficient Temporal Convolutions with Spiking Neural Networks for Keyword Spotting
- End-to-end Keyword Spotting using Xception-1d
- Multi-task Voice Activated Framework using Self-supervised Learning
- Lightweight dynamic filter for keyword spotting
- Audiomer: A Convolutional Transformer for Keyword Spotting
- Behavior of Keyword Spotting Networks Under Noisy Conditions
- A Separable Temporal Convolution Neural Network with Attention for Small-Footprint Keyword Spotting
- Text Anchor Based Metric Learning for Small-footprint Keyword Spotting
- Multi-task Learning with Cross Attention for Keyword Spotting
- AUC Optimization for Robust Small-footprint Keyword Spotting with Limited Training Data
- An Integrated Framework for Two-pass Personalized Voice Trigger
- Zero-Shot Federated Learning with New Classes for Audio Classification
- Broadcasted Residual Learning for Efficient Keyword Spotting
- Encoder-Decoder Neural Architecture Optimization for Keyword Spotting
- Teaching keyword spotters to spot new keywords with limited examples
- Noisy student-teacher training for robust keyword spotting
- A Streaming End-to-End Framework For Spoken Language Understanding
- Wav2KWS: Transfer Learning from Speech Representations for Keyword Spotting
- Streaming Transformer for Hardware Efficient Voice Trigger Detection and False Trigger Mitigation
- Efficient Keyword Spotting through long-range interactions with Temporal Lambda Networks
- End-to-end Keyword Spotting using Neural Architecture Search and Quantization
- The DKU System Description for The Interspeech 2021 Auto-KWS Challenge
- Few-Shot Keyword Spotting in Any Language
- Keyword Transformer A Self-Attention Model for Keyword Spotting
- Learning Efficient Representations for Keyword Spotting with Triplet Loss
- The 2020 Personalized Voice Trigger Challenge: Open Database, Evaluation Metrics and the Baseline Systems
- Learning Efficient Representations for Keyword Spotting with Triplet Loss
- Keyword Transformer A Self-Attention Model for Keyword Spotting
- The 2020 Personalized Voice Trigger Challenge: Open Database, Evaluation Metrics and the Baseline Systems
-
Others
- Convolutional Recurrent Neural Networks for Small-Footprint Keyword Spotting
- A Cascade Architecture for Keyword Spotting on Mobile Devices
- Multiple-Instance, Cascaded Classification for Keyword Spotting in Narrow-Band Audio
- An experimental analysis of the power consumption of convolutional neural networks for keyword spotting
- Hello Edge: Keyword Spotting on Microcontrollers
- Deep residual learning for small-footprint keyword spotting
- Streaming small-footprint keyword spotting using sequence-to-sequence models
- Small-footprint keyword spotting using deep neural network and connectionist temporal classifier
- Compressed Time Delay Neural Network for Small-Footprint Keyword Spotting
- Max-pooling loss training of long short-term memory networks for small-footprint keyword spotting
- Convolutional Recurrent Neural Networks for Small-Footprint Keyword Spotting
- An End-to-End Architecture for Keyword Spotting and Voice Activity Detection
- Trainable Frontend For Robust and Far-Field Keyword Spotting
- Low Resource High Accuracy Keyword Spotting
- Online keyword spotting with a character-level recurrent neural network
- Structured Transforms for Small-Footprint Deep Learning
- Small-footprint keyword spotting using deep neural networks
- A Cascade Architecture for Keyword Spotting on Mobile Devices
- Multiple-Instance, Cascaded Classification for Keyword Spotting in Narrow-Band Audio
- An experimental analysis of the power consumption of convolutional neural networks for keyword spotting
- Deep residual learning for small-footprint keyword spotting
- Streaming small-footprint keyword spotting using sequence-to-sequence models
- Small-footprint keyword spotting using deep neural network and connectionist temporal classifier
- Max-pooling loss training of long short-term memory networks for small-footprint keyword spotting
- Convolutional Recurrent Neural Networks for Small-Footprint Keyword Spotting
- An End-to-End Architecture for Keyword Spotting and Voice Activity Detection
- Trainable Frontend For Robust and Far-Field Keyword Spotting
- Online keyword spotting with a character-level recurrent neural network
- Structured Transforms for Small-Footprint Deep Learning
-
2022
- Efficient dynamic filter for robust and low computational feature extraction
- Improving Feature Generalizability with Multitask Learning in Class Incremental Learning
- Understanding Audio Features via Trainable Basis Functions
- Depth Pruning with Auxiliary Networks for TinyML
- AB/BA analysis: A framework for estimating keyword spotting recall improvement while maintaining audio privacy
- Production federated keyword spotting via distillation, filtering, and joint federated-centralized training
- Small Footprint Multi-channel ConvMixer for Keyword Spotting with Centroid Based Awareness
- Delta Keyword Transformer: Bringing Transformers to the Edge through Dynamically Pruned Multi-Head Self-Attention
- On the Efficiency of Integrating Self-supervised Learning and Meta-learning for User-defined Few-shot Keyword Spotting
- Target-aware Neural Architecture Search and Deployment for Keyword Spotting
- Learning Decoupling Features Through Orthogonality Regularization
- Rainbow Keywords: Efficient Incremental Learning for Online Spoken Keyword Spotting
- BiFSMN: Binary Neural Network for Keyword Spotting
- A Fast Network Exploration Strategy to Profile Low Energy Consumption for Keyword Spotting
- Progressive Continual Learning for Spoken Keyword Spotting
- ConvMixer: Feature Interactive Convolution with Curriculum Learning for Small Footprint and Noisy Far-field Keyword Spotting
- Efficient dynamic filter for robust and low computational feature extraction
- Depth Pruning with Auxiliary Networks for TinyML
- AB/BA analysis: A framework for estimating keyword spotting recall improvement while maintaining audio privacy
- Production federated keyword spotting via distillation, filtering, and joint federated-centralized training
- Small Footprint Multi-channel ConvMixer for Keyword Spotting with Centroid Based Awareness
- Delta Keyword Transformer: Bringing Transformers to the Edge through Dynamically Pruned Multi-Head Self-Attention
- On the Efficiency of Integrating Self-supervised Learning and Meta-learning for User-defined Few-shot Keyword Spotting
- Rainbow Keywords: Efficient Incremental Learning for Online Spoken Keyword Spotting
- Improving Feature Generalizability with Multitask Learning in Class Incremental Learning
- Understanding Audio Features via Trainable Basis Functions
- Learning Decoupling Features Through Orthogonality Regularization
-
2020
- Optimize what matters: Training DNN-HMM Keyword Spotting Model Using End Metric
- Training Wake Word Detection with Synthesized Speech Data on Confusion Words
- Ieee slt 2021 alpha-mini speech challenge: Open datasets, tracks, rules and baselines
- A depthwise separable convolutional neural network for keyword spotting on an embedded system
- Small-Footprint Keyword Spotting with Multi-Scale Temporal Convolution
- Micronets: Neural network architectures for deploying tinyml applications on commodity microcontrollers
- Neural Architecture Search For Keyword Spotting
- Seeing wake words: Audio-visual keyword spotting
- AutoKWS: Keyword Spotting with Differentiable Architecture Search
- Hardware Aware Training for Efficient Keyword Spotting on General Purpose and Specialized Hardware
- Neural ODE with Temporal Convolution and Time Delay Neural Networks for Small-Footprint Keyword Spotting
- WSRNet: Joint Spotting and Recognition of Handwritten Words
- Domain Aware Training for Far-field Small-footprint Keyword Spotting
- Very Fast Keyword Spotting System with Real Time Factor Below 0.01
- Few-Shot Keyword Spotting With Prototypical Networks
- Exploring Filterbank Learning for Keyword Spotting
- Mining Effective Negative Training Samples for Keyword Spotting
- Multi-Task Network for Noise-Robust Keyword Spotting and Speaker Verification using CTC-based Soft VAD and Global Query Attention
- Streaming keyword spotting on mobile devices
- Metric Learning for Keyword Spotting
- End-to-End Multi-Look Keyword Spotting
- Depthwise Separable Convolutional ResNet with Squeeze-and-Excitation Blocks for Small-footprint Keyword Spotting
- Phoneme boundary detection using learnable segmental features - Ilan University & Facebook Inc., 2020.02
- Small-Footprint Open-Vocabulary Keyword Spotting with Quantized LSTM Networks
- Training Keyword Spotters with Limited and Synthesized Speech Data
- Learning to detect keyword parts and whole by smoothed max pooling
- Multi-Task Learning for Speaker Verification and Voice Trigger Detection
- Performance-Oriented Neural Architecture Search
- Optimize what matters: Training DNN-HMM Keyword Spotting Model Using End Metric
- Training Wake Word Detection with Synthesized Speech Data on Confusion Words
- Ieee slt 2021 alpha-mini speech challenge: Open datasets, tracks, rules and baselines
- Small-Footprint Keyword Spotting with Multi-Scale Temporal Convolution
- Neural Architecture Search For Keyword Spotting
- AutoKWS: Keyword Spotting with Differentiable Architecture Search
- Hardware Aware Training for Efficient Keyword Spotting on General Purpose and Specialized Hardware
- Neural ODE with Temporal Convolution and Time Delay Neural Networks for Small-Footprint Keyword Spotting
- WSRNet: Joint Spotting and Recognition of Handwritten Words
- Very Fast Keyword Spotting System with Real Time Factor Below 0.01
- Few-Shot Keyword Spotting With Prototypical Networks
- Exploring Filterbank Learning for Keyword Spotting
- Training Keyword Spotting Models on Non-IID Data with Federated Learning
- Multi-Task Network for Noise-Robust Keyword Spotting and Speaker Verification using CTC-based Soft VAD and Global Query Attention
- Streaming keyword spotting on mobile devices
- Metric Learning for Keyword Spotting
- End-to-End Multi-Look Keyword Spotting
- Depthwise Separable Convolutional ResNet with Squeeze-and-Excitation Blocks for Small-footprint Keyword Spotting
- Phoneme boundary detection using learnable segmental features - Ilan University & Facebook Inc., 2020.02
- Small-Footprint Open-Vocabulary Keyword Spotting with Quantized LSTM Networks
- Training Keyword Spotters with Limited and Synthesized Speech Data
- Learning to detect keyword parts and whole by smoothed max pooling
- Multi-Task Learning for Speaker Verification and Voice Trigger Detection
- Performance-Oriented Neural Architecture Search
-
2019
- Small-footprint keyword spotting with graph convolutional network
- Predicting detection filters for small footprint open-vocabulary keyword spotting
- Temporal feedback convolutional recurrent neural networks for keyword spotting
- Small-footprint keyword spotting on raw audio data with sinc-convolutions
- Orthogonality constrained multi-head attention for keyword spotting
- Query-by-example on-device keyword spotting
- Adversarial example detection by classification for deep speech recognition
- Sub-band Convolutional Neural Networks for Small-footprint Spoken Term Classification
- Improving reverberant speech training using diffuse acoustic simulation
- Multi-layer Attention Mechanism for Speech Keyword Recognition
- A Monaural Speech Enhancement Method for Robust Small-Footprint Keyword Spotting
- Keyword Spotting for Hearing Assistive Devices Robust to External Speakers
- Temporal Convolution for Real-time Keyword Spotting on Mobile Devices
- SpeechYOLO: Detection and Localization of Speech Objects - Ilan University, 2019.04
- Ternary hybrid neural-tree networks for highly constrained iot applications
- Stochastic Adaptive Neural Architecture Search for Keyword Spotting
- Region Proposal Network Based Small-Footprint Keyword Spotting
- An In-Vehicle Keyword Spotting System with Multi-Source Fusion for Vehicle Applications
- Efficient keyword spotting using dilated convolutions and gating
- End-to-end streaming keyword spotting
- Prototypical metric transfer learning for continuous speech keyword spotting with limited training data
- Temporal feedback convolutional recurrent neural networks for keyword spotting
- Orthogonality constrained multi-head attention for keyword spotting
- Query-by-example on-device keyword spotting
- Adversarial example detection by classification for deep speech recognition
- A Channel-Pruned and Weight-Binarized Convolutional Neural Network for Keyword Spotting
- Sub-band Convolutional Neural Networks for Small-footprint Spoken Term Classification
- Improving reverberant speech training using diffuse acoustic simulation
- Multi-layer Attention Mechanism for Speech Keyword Recognition
- Small-footprint keyword spotting on raw audio data with sinc-convolutions
- A Monaural Speech Enhancement Method for Robust Small-Footprint Keyword Spotting
- Keyword Spotting for Hearing Assistive Devices Robust to External Speakers
- Temporal Convolution for Real-time Keyword Spotting on Mobile Devices
- SpeechYOLO: Detection and Localization of Speech Objects - Ilan University, 2019.04
- Ternary hybrid neural-tree networks for highly constrained iot applications
- Efficient keyword spotting using dilated convolutions and gating
- End-to-end streaming keyword spotting
- Prototypical metric transfer learning for continuous speech keyword spotting with limited training data
- Small-footprint keyword spotting with graph convolutional network
- Predicting detection filters for small footprint open-vocabulary keyword spotting
-
2018
- Benchmarking Keyword Spotting Efficiency on Neuromorphic Hardware
- Streaming Voice Query Recognition using Causal Convolutional Recurrent Neural Networks
- Efficient Voice Trigger Detection for Low Resource Hardware
- Sequence-to-sequence models for small-footprint keyword spotting
- End-to-end Models with auditory attention in Multi-channel Keyword Spotting
- Hierarchical Neural Network Architecture In Keyword Spotting
- Feature exploration for almost zero-resource ASR-free keyword spotting using a multilingual bottleneck extractor and correspondence autoencoders
- DONUT: CTC-based Query-by-Example Keyword Spotting
- JavaScript Convolutional Neural Networks for Keyword Spotting in the Browser: An Experimental Analysis
- Data augmentation for robust keyword spotting under playback interference
- Sequence discriminative training for deep learning based acoustic keyword spotting
- Weight-importance sparse training in keyword spotting
- Efficient keyword spotting using time delay neural networks
- Zero-shot keyword spotting for visual speech recognition in-the-wild
- ASR-free CNN-DTW keyword spotting using multilingual bottleneck features for almost zero-resource languages
- Resource-Efficient Neural Architect
- Visually grounded cross-lingual keyword spotting in speech
- Speech Commands: A Dataset for Limited-Vocabulary Speech Recognition
- Developing far-field speaker system via teacher-student learning
- An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling
- Speech recognition: keyword spotting through image recognition
- Attention-based End-to-End Models for Small-Footprint Keyword Spotting
- Attention-based End-to-End Models for Small-Footprint Keyword Spotting
- Benchmarking Keyword Spotting Efficiency on Neuromorphic Hardware
- Streaming Voice Query Recognition using Causal Convolutional Recurrent Neural Networks
- Sequence-to-sequence models for small-footprint keyword spotting
- End-to-end Models with auditory attention in Multi-channel Keyword Spotting
- Hierarchical Neural Network Architecture In Keyword Spotting
- Feature exploration for almost zero-resource ASR-free keyword spotting using a multilingual bottleneck extractor and correspondence autoencoders
- DONUT: CTC-based Query-by-Example Keyword Spotting
- Federated learning for keyword spotting
- JavaScript Convolutional Neural Networks for Keyword Spotting in the Browser: An Experimental Analysis
- Data augmentation for robust keyword spotting under playback interference
- Sequence discriminative training for deep learning based acoustic keyword spotting
- Weight-importance sparse training in keyword spotting
- Efficient keyword spotting using time delay neural networks
- ASR-free CNN-DTW keyword spotting using multilingual bottleneck features for almost zero-resource languages
- Resource-Efficient Neural Architect
- Visually grounded cross-lingual keyword spotting in speech
- Speech Commands: A Dataset for Limited-Vocabulary Speech Recognition
- Developing far-field speaker system via teacher-student learning
- Speech recognition: keyword spotting through image recognition
-
-
Challenge
-
Others
- The 2020 Personalized Voice Trigger Challenge (PVTC2020)
- The 2020 Personalized Voice Trigger Challenge (PVTC2020)
- The 2020 Personalized Voice Trigger Challenge (PVTC2020)
- The 2020 Personalized Voice Trigger Challenge (PVTC2020)
- The 2020 Personalized Voice Trigger Challenge (PVTC2020)
- The 2020 Personalized Voice Trigger Challenge (PVTC2020)
- The 2020 Personalized Voice Trigger Challenge (PVTC2020)
- The 2020 Personalized Voice Trigger Challenge (PVTC2020)
- AutoSpeech 2020 Challenge
- The 2020 Personalized Voice Trigger Challenge (PVTC2020)
- The 2020 Personalized Voice Trigger Challenge (PVTC2020)
- The 2020 Personalized Voice Trigger Challenge (PVTC2020)
- The 2020 Personalized Voice Trigger Challenge (PVTC2020)
- The 2020 Personalized Voice Trigger Challenge (PVTC2020)
- The 2020 Personalized Voice Trigger Challenge (PVTC2020)
- The 2020 Personalized Voice Trigger Challenge (PVTC2020)
- The 2020 Personalized Voice Trigger Challenge (PVTC2020)
- The 2020 Personalized Voice Trigger Challenge (PVTC2020)
- The 2020 Personalized Voice Trigger Challenge (PVTC2020)
- The 2020 Personalized Voice Trigger Challenge (PVTC2020)
- The 2020 Personalized Voice Trigger Challenge (PVTC2020)
- The 2020 Personalized Voice Trigger Challenge (PVTC2020)
- The 2020 Personalized Voice Trigger Challenge (PVTC2020)
- The 2020 Personalized Voice Trigger Challenge (PVTC2020)
- The 2020 Personalized Voice Trigger Challenge (PVTC2020)
-
-
Survey
Programming Languages
Keywords
keyword-spotting
8
speech-recognition
6
wake-word-detection
4
hotword-detection
3
voice-recognition
3
kws
2
on-device
2
voice-control
2
handsfree
1
prototypical-networks
1
few-shot-recognition
1
audio
1
pytorch
1
deep-learning
1
query-by-example
1
keyword-search
1
few-shot-learning
1
python
1
microcontrollers
1
machine-learning
1
deep-neural-networks
1
cmsis-nn
1
arm
1
transfer-learning
1
state-of-the-art
1
speech-commands
1
raspberry-pi
1
embedded-systems
1
voice-user-interface
1
voice-interface
1
voice-commands
1
voice-command
1
voice-assistant
1
speech-recoginition
1
nlu
1
natural-language-understanding
1
stt
1
speech-to-text
1
speech
1
node
1
alexa
1
wake-word-engine
1
wake-word
1
voice-activation
1
trigger-word-detection
1
keyword-spotter
1
hotword-detector
1
hotword
1
fine-tuning
1