https://github.com/DmitryRyumin/ICASSP-2023-24-Papers
ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!
https://github.com/DmitryRyumin/ICASSP-2023-24-Papers
asr denoising domain-adaptation face-recognition generative-models icassp icassp2023 icassp2024 image-generation keyword-spotting language-modeling multimodal-learning music-generation self-supervised-learning semantic-segmentation signal-processing signal-restoration speech-recognition spoken-language-understanding vad
Last synced: 11 months ago
JSON representation
ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!
- Host: GitHub
- URL: https://github.com/DmitryRyumin/ICASSP-2023-24-Papers
- Owner: DmitryRyumin
- License: mit
- Created: 2023-08-01T09:17:13.000Z (almost 3 years ago)
- Default Branch: main
- Last Pushed: 2024-10-30T01:47:26.000Z (over 1 year ago)
- Last Synced: 2024-10-30T03:56:24.418Z (over 1 year ago)
- Topics: asr, denoising, domain-adaptation, face-recognition, generative-models, icassp, icassp2023, icassp2024, image-generation, keyword-spotting, language-modeling, multimodal-learning, music-generation, self-supervised-learning, semantic-segmentation, signal-processing, signal-restoration, speech-recognition, spoken-language-understanding, vad
- Language: Python
- Homepage:
- Size: 8.8 MB
- Stars: 388
- Watchers: 29
- Forks: 17
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- License: LICENSE
- Code of conduct: CODE_OF_CONDUCT.md
Awesome Lists containing this project
README
General Information
Repository Size and Activity
Contribution Statistics
Other Metrics
GitHub Actions
Application
Progress Status
Main
---
ICASSP 2024 Papers: A complete collection of influential and exciting research papers from the [*ICASSP 2024*](https://2024.ieeeicassp.org/) conference. Explore the latest advancements in acoustics, speech and signal processing. Code included. :star: the repository to support the advancement of audio and signal processing!
---
> [!TIP]
[*Online version of the ICASSP 2024 Conference Technical Program*](https://2024.ieeeicassp.org/program-schedule/), which lists all accepted full papers along with their presentation mode and time.
---
Other collections of the best AI conferences
> [!important]
> Conference table will be up to date all the time.
Conference
Year
2023
2024
Computer Vision (CV)
CVPR
ICCV
ECCV
WACV
:heavy_minus_sign:
FG
:heavy_minus_sign:
Speech/Signal Processing (SP/SigProc)
ICASSP
INTERSPEECH
ISMIR
:heavy_minus_sign:
Natural Language Processing (NLP)
EMNLP
Machine Learning (ML)
AAAI
:heavy_minus_sign:
ICLR
:heavy_minus_sign:
ICML
:heavy_minus_sign:
NeurIPS
:heavy_minus_sign:
---
## Contributors
> [!NOTE]
> Contributions to improve the completeness of this list are greatly appreciated. If you come across any overlooked papers, please **feel free to [*create pull requests*](https://github.com/DmitryRyumin/ICASSP-2023-24-Papers/pulls), [*open issues*](https://github.com/DmitryRyumin/ICASSP-2023-24-Papers/issues) or contact me via [*email*](mailto:neweraairesearch@gmail.com)**. Your participation is crucial to making this repository even better.
---
## Papers 
Section
Papers
Main
Audio-Visual Speech Processing
Vision and Language
Acoustic Signal Processing
Deep Learning Techniques
Speech Enhancement and Separation - Diffusion and other Probabilistic Models
ASPS Lecture
Distributed and Federated Learning
Transfer Learning
Voice Conversion
Graph Neural Networks
Language Resources, Metrics and Systems
Watermarking and Data Hiding
Signal and Information Processing over Graphs
Integrated Sensing and Communications
Audio Events Detection and Classification; Music Information Retrieval
Language Understanding and Computational Semantics - NLP Tasks
Physiological and Wearable Signal Processing
Speech Enhancement; Music Information Retrieval
Multimodal Medical Image Fusion and Analysis
Sparse/Low-Dimensional Signal Processing
Robust and Sustainable Machine Learning
Machine Learning for Image and Video Processing
Deep Learning Generalization
Distributed Processing and Federated Learning
Biological Image Analysis
Learning from Multimodal Data
Biometrics
Detection and Classification
Multimedia Coding
Anonymisation, Data Privacy and Hiding
Quality Assessment and Anomaly Detection
Signal Filtering, Reconstruction, Restoration and Enhancement
Speech Emotion Recognition and Analysis
Deep Generative Models
Context and LLM Speech Recognition
Music Information Retrieval
Multimodal Processing: Vision + Language
Environmental Sound Synthesis and Generation
Biomedical and Biological Image Processing
DoA Estimation
Tracking
Machine Learning for Communications
Image and Video Processing for Watermarking and Security
Self-Supervised Learning for Speech Processing
Deep Learning for Image and Video Processing
Image, Video, and 3D Content Generation
Classification of Acoustic Scenes and Events
Reinforcement Learning
Subspace and Manifold Learning
Active Noise Control and Echo Cancellation; Source Separation
Machine Learning, Detection and Classification
Machine Learning for Audio, Speech and Music Processing
Multimedia Generation and Synthesis
Medical Image Detection and Segmentation
Multimedia Forensics and Cybersecurity
Estimation Theory and Methods
Emerging Methods for Biomedical Image and Signal Processing
Text to Speech Generation
Audio Classification, Detection and Localization
Self-Supervised and Semi-Supervised Learning
Multichannel/Multimodal Speech Recognition
Speaker Verification
Speaker Diarization
Adversarial Machine Learning
Machine Learning Methods for Language
SPED: Signal Processing Education
Multimedia Quality of Experience
Domain-Enriched Learning for Medical Image Processing
Speech Enhancement and Separation
Image Denoising
ASPS Poster
ASR - New Algorithms and Approaches
Data Mining and Big Data
Language Understanding and Computational Semantics - Machine Learning
Explainable and Interpretable Machine Learning
Neuroimaging and Brain/Human-Computer Interfaces
Localization, DOA Estimation, Spatial Audio Recording and Reproduction
Perception and Processing for Autonomous Systems and Applications
Computational Imaging