An open API service indexing awesome lists of open source software.

https://github.com/dmitryryumin/interspeech-2023-24-papers

INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!
https://github.com/dmitryryumin/interspeech-2023-24-papers

acoustic adaptation asr audio-signals interspeech interspeech2023 interspeech2024 language-modeling lexical-analysis linguistic-analysis machine-translation prosody self-supervised-learning signal-processing speech-analysis speech-production speech-recognition speech-synthesis speech-technology transmission

Last synced: 2 months ago
JSON representation

INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!

Awesome Lists containing this project

README

        


INTERSPEECH-2023-24-Papers


General Information


Awesome


Conference

Version
License: MIT



Repository Size and Activity

GitHub repo size
GitHub commit activity (branch)



Contribution Statistics

GitHub contributors
GitHub closed issues
GitHub issues
GitHub closed pull requests
GitHub pull requests



Other Metrics

GitHub last commit
GitHub watchers
GitHub forks
GitHub Repo stars
Visitors



Application


App




Progress Status


Main






---

INTERSPEECH 2024 Papers: A complete collection of influential and exciting research papers from the [*INTERSPEECH 2024*](https://interspeech2024.org/) conference. Explore the latest advances in speech and language processing. Code included. :star: the repository to support the advancement of speech technology!



INTERSPEECH 2024

---

> [!TIP]
[*The PDF version of the INTERSPEECH 2024 Conference Programme*](https://drive.google.com/file/d/1w_F9STjblCMANAZXO8l5Yy6vfDSNYFIw/view), comprises a list of all accepted full papers, their presentation order, as well as the designated presentation times.

---



Other collections of the best AI conferences




> [!important]
> Conference table will be up to date all the time.



Conference
Year


2023
2024


Computer Vision (CV)


CVPR



ICCV
 



ECCV




WACV
:heavy_minus_sign:
 


FG
:heavy_minus_sign:



Speech/Signal Processing (SP/SigProc)


ICASSP



INTERSPEECH



ISMIR
 
:heavy_minus_sign:


Natural Language Processing (NLP)


EMNLP




Machine Learning (ML)


AAAI
:heavy_minus_sign:



ICLR
:heavy_minus_sign:



ICML
:heavy_minus_sign:



NeurIPS
:heavy_minus_sign:

---

## Contributors






> [!NOTE]
> Contributions to improve the completeness of this list are greatly appreciated. If you come across any overlooked papers, please **feel free to [*create pull requests*](https://github.com/DmitryRyumin/INTERSPEECH-2023-24-Papers/pulls), [*open issues*](https://github.com/DmitryRyumin/INTERSPEECH-2023-24-Papers/issues) or contact me via [*email*](mailto:[email protected])**. Your participation is crucial to making this repository even better.

---

## [Papers-2024](https://www.isca-archive.org/interspeech_2024/) (`In progress`)


App



Section
Papers








L2 Speech, Bilingualism and Code-Switching


Papers


Preprints


Open Code


Videos




Speaker Diarization


Papers


Preprints


Open Code


Videos




Speech and Audio Analysis and Representations


Papers


Preprints


Open Code


Videos




Acoustic Event Detection, Segmentation and Classification


Papers


Preprints


Open Code


Videos




Detection and Classification of Bioacoustic Signals


Papers


Preprints


Open Code


Videos


---

## [Papers-2023](https://www.isca-archive.org/interspeech_2023/)


App



Section
Papers







Resources for Spoken Language Processing


Papers


Preprints


Open Code




Speech Synthesis: Prosody and Emotion


Papers


Preprints


Open Code




Statistical Machine Translation


Papers


Preprints


Open Code




Self-Supervised Learning in ASR


Papers


Preprints


Open Code




Prosody


Papers


Preprints


Open Code




Speech Production


Papers


Preprints


Open Code




Dysarthric Speech Assessment


Papers


Preprints


Open Code




Speech Coding: Transmission


Papers


Preprints


Open Code




Speech Recognition: Signal Processing, Acoustic Modeling, Robustness, Adaptation


Papers


Preprints


Open Code




Analysis of Speech and Audio Signals


Papers


Preprints


Open Code




Speech Recognition: Architecture, Search, and Linguistic Components


Papers


Preprints


Open Code




Speech Recognition: Technologies and Systems for New Applications


Papers


Preprints


Open Code




Lexical and Language Modeling for ASR


Papers


Preprints


Open Code




Language Identification and Diarization


Papers


Preprints


Open Code




Speech Quality Assessment


Papers


Preprints


Open Code




Feature Modeling for ASR


Papers


Preprints


Open Code




Interfacing Speech Technology and Phonetics


Papers


Preprints


Open Code




Speech Synthesis: Multilinguality


Papers


Preprints


Open Code




Speech Emotion Recognition


Papers


Preprints


Open Code




Spoken Dialog Systems and Conversational Analysis


Papers


Preprints


Open Code




Speech Coding and Enhancement


Papers


Preprints


Open Code




Paralinguistics


Papers


Preprints


Open Code




Speech Enhancement and Denoising


Papers


Preprints


Open Code




Speech Synthesis: Evaluation


Papers


Preprints


Open Code




End-to-End Spoken Dialog Systems


Papers


Preprints


Open Code




Biosignal-enabled Spoken Communication


Papers


Preprints


Open Code




Neural-based Speech and Acoustic Analysis


Papers


Preprints


Open Code




DiGo - Dialog for Good: Speech and Language Technology for Social Good


Papers


Preprints


Open Code




Spoken Language Processing: Translation, Information Retrieval, Summarization, Resources, and Evaluation


Papers


Preprints


Open Code




Speech, Voice, and Hearing Disorders


Papers


Preprints


Open Code




Spoken Term Detection and Voice Search


Papers


Preprints


Open Code




Models for Streaming ASR


Papers


Preprints


Open Code




Source Separation


Papers


Preprints


Open Code




Speech Perception


Papers


Preprints


Open Code




Phonetics and Phonology: Languages and Varieties


Papers


Preprints


Open Code




Speaker and Language Identification


Papers


Preprints


Open Code




Speech Synthesis and Voice Conversion


Papers


Preprints


Open Code




Speech and Language in Health: from Remote Monitoring to Medical Conversations


Papers


Preprints


Open Code




Novel Transformer Models for ASR


Papers


Preprints


Open Code




Speaker Recognition


Papers


Preprints


Open Code




Cross-lingual and Multilingual ASR


Papers


Preprints


Open Code




Voice Conversion


Papers


Preprints


Open Code




Pathological Speech Analysis


Papers


Preprints


Open Code




Multimodal Speech Emotion Recognition


Papers


Preprints


Open Code




Phonetics, Phonology, and Prosody


Papers


Preprints


Open Code




Speech Coding: Privacy


Papers


Preprints


Open Code




Analysis of Neural Speech Representations


Papers


Preprints


Open Code




End-to-end ASR


Papers


Preprints


Open Code




Spoken Language Understanding, Summarization, and Information Retrieval


Papers


Preprints


Open Code




Invariant and Robust Pre-trained Acoustic Models


Papers


Preprints


Open Code




Speech Synthesis: Representation Learning


Papers


Preprints


Open Code




Speech Perception, Production, and Acquisition


Papers


Preprints


Open Code




Acoustic Model Adaptation for ASR


Papers


Preprints


Open Code




Speech Synthesis: Expressivity


Papers


Preprints


Open Code




Multi-modal Systems


Papers


Preprints


Open Code




Question Answering from Speech


Papers


Preprints


Open Code




Multi-talker Methods in Speech Processing


Papers


Preprints


Open Code




Sociophonetics


Papers


Preprints


Open Code




Speaker and Language Diarization


Papers


Preprints


Open Code




Anti-Spoofing for Speaker Verification


Papers


Preprints


Open Code




Speech Coding: Intelligibility


Papers


Preprints


Open Code




New Computational Strategies for ASR Training and Inference


Papers


Preprints


Open Code




MERLIon CCS Challenge: Multilingual Everyday Recordings - Language Identification On Code-Switched Child-Directed Speech


Papers


Preprints


Open Code




Health-Related Speech Analysis


Papers


Preprints


Open Code




Automatic Audio Classification and Audio Captioning


Papers


Preprints


Open Code




Speech Synthesis


Papers


Preprints


Open Code




Speech Synthesis: Controllability and Adaptation


Papers


Preprints


Open Code




Search Methods and Decoding Algorithms for ASR


Papers


Preprints


Open Code




Speech Signal Analysis


Papers


Preprints


Open Code




Connecting Speech-science and Speech-technology for Children's Speech


Papers


Preprints


Open Code




Dialog Management


Papers


Preprints


Open Code




Speech Activity Detection and Modeling


Papers


Preprints


Open Code




Multilingual Models for ASR


Papers


Preprints


Open Code




Speech Enhancement and Bandwidth Expansion


Papers


Preprints


Open Code




Articulation


Papers


Preprints


Open Code




Neural Processing of Speech and Language: Encoding and Decoding the Diverse Auditory Brain


Papers


Preprints


Open Code




Perception of Paralinguistics


Papers


Preprints


Open Code




Technologies for Child Speech Processing


Papers


Preprints


Open Code




Speech Synthesis: Multilinguality; Evaluation


Papers


Preprints


Open Code




Show and Tell: Health Applications and Emotion Recognition


Papers


Preprints


Open Code




Show and Tell: Speech Tools, Speech Enhancement, Speech Synthesis


Papers


Preprints


Open Code




Show and Tell: Language Learning and Educational Resources


Papers


Preprints


Open Code




Show and Tell: Media and Commercial Applications


Papers


Preprints


Open Code


---

## Key Terms

> To be added soon

---

## Star History



Star History Chart