Projects in Awesome Lists tagged with audio-visual-learning
A curated list of projects in awesome lists tagged with audio-visual-learning .
https://github.com/ali-vilab/dreamtalk
Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
audio-visual-learning face-animation talking-head video-generation
Last synced: 15 May 2025
https://opennlplab.github.io/AVSBench/
[ECCV 2022] & [IJCV 2024] Official implementation of the paper: Audio-Visual Segmentation (with Semantics)
audio-visual-learning audio-visual-segmentation multi-modal-segmentation segmentation-benchmark
Last synced: 16 Mar 2025
https://github.com/OpenNLPLab/AVSBench
[ECCV 2022] & [IJCV 2024] Official implementation of the paper: Audio-Visual Segmentation (with Semantics)
audio-visual-learning audio-visual-segmentation multi-modal-segmentation segmentation-benchmark
Last synced: 14 Jul 2025
https://github.com/yapengtian/ave-eccv18
Audio-Visual Event Localization in Unconstrained Videos, ECCV 2018
audio-visual-events audio-visual-learning ave-dataset eccv-2018
Last synced: 06 Apr 2025
https://github.com/alvinliu0/HA2G
[CVPR 2022] Code for "Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation"
audio-visual-learning co-speech-gesture cvpr2022
Last synced: 23 Apr 2025
https://github.com/ttgeng233/UnAV
Dense-Localizing Audio-Visual Events in Untrimmed Videos: A Large-Scale Benchmark and Baseline (CVPR 2023)
audio-visual-events audio-visual-learning multi-modal-learning
Last synced: 09 May 2025
https://github.com/ttgeng233/unav
Dense-Localizing Audio-Visual Events in Untrimmed Videos: A Large-Scale Benchmark and Baseline (CVPR 2023)
audio-visual-events audio-visual-learning multi-modal-learning
Last synced: 26 Sep 2025
https://github.com/praveena2j/jointcrossattentional-av-fusion
ABAW3 (CVPRW): A Joint Cross-Attention Model for Audio-Visual Fusion in Dimensional Emotion Recognition
affective-computing attention-model audio-visual-learning emotion emotion-recognition multimodal-learning
Last synced: 30 Jul 2025
https://github.com/praveena2j/joint-cross-attention-for-audio-visual-fusion
IEEE T-BIOM : "Audio-Visual Fusion for Emotion Recognition in the Valence-Arousal Space Using Joint Cross-Attention"
affective-computing attention attention-model audio-visual-learning emotion-recognition multimodal-learning
Last synced: 12 Apr 2025
https://github.com/praveena2j/cross-attentional-av-fusion
FG2021: Cross Attentional AV Fusion for Dimensional Emotion Recognition
affective-computing attention-model audio-visual-learning emotion emotion-recognition multimodal-learning
Last synced: 12 Apr 2025
https://github.com/praveena2j/recurrentjointattentionwithlstms
ICASSP 2023: "Recursive Joint Attention for Audio-Visual Fusion in Regression Based Emotion Recognition"
affective-computing attention-model audio-visual-learning emotion-recognition multimodal-learning
Last synced: 12 Apr 2025
https://github.com/praveena2j/rjcaforspeakerverification
[FG 2024] "Audio-Visual Person Verification based on Recursive Fusion of Joint Cross-Attention"
attention attention-model audio-visual-learning multimodal-learning speaker-verification
Last synced: 24 Sep 2025
https://github.com/praveena2j/dynamic-crossattention
IEEE ICME : "Cross-Attention is not always needed: Dynamic Cross-Attention for Audio-Visual Dimensional Emotion Recognition"
affective-computing ai attention attention-mechanism attention-model audio-visual-learning computer-vision cross-attention emotion-recognition multimodal-learning
Last synced: 16 Oct 2025