Projects in Awesome Lists by haoheliu
A curated list of projects in awesome lists by haoheliu .
https://github.com/haoheliu/audioldm
AudioLDM: Generate speech, sound effects, music and beyond, with text.
Last synced: 13 May 2025
https://github.com/haoheliu/AudioLDM
AudioLDM: Generate speech, sound effects, music and beyond, with text.
Last synced: 27 Mar 2025
https://github.com/haoheliu/versatile_audio_super_resolution
Versatile audio super resolution (any -> 48kHz) with AudioSR.
Last synced: 14 May 2025
https://github.com/haoheliu/voicefixer
General Speech Restoration
declipping denoise dereverberation mel speech speech-analysis speech-enhancement speech-processing speech-synthesis super-resolution tts vocoder
Last synced: 14 May 2025
https://github.com/haoheliu/audioldm_eval
This toolbox aims to unify audio generation model evaluation for easier comparison.
Last synced: 07 Apr 2025
https://github.com/haoheliu/voicefixer_main
General Speech Restoration
machine-learning speech speech-analysis speech-enhancement speech-processing speech-synthesis speech-to-text tts
Last synced: 06 Apr 2025
https://github.com/haoheliu/audioldm-training-finetuning
AudioLDM training, finetuning, evaluation and inference.
audiogeneration diffusion-models
Last synced: 12 Apr 2025
https://github.com/haoheliu/AudioLDM-training-finetuning
AudioLDM training, finetuning, evaluation and inference.
audiogeneration diffusion-models
Last synced: 09 Dec 2024
https://github.com/haoheliu/semanticodec-inference
Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a better semantic in the latent space.
Last synced: 09 Apr 2025
https://github.com/haoheliu/ssr_eval
Evaluation and Benchmarking of Speech Super-resolution Methods
Last synced: 09 Apr 2025
https://github.com/haoheliu/2021-ismir-mss-challenge-cws-presunet
Music Source Separation; Train & Eval & Inference piplines and pretrained models we used for 2021 ISMIR MDX Challenge.
Last synced: 14 May 2025
https://github.com/haoheliu/torchsubband
Pytorch implementation of subband decomposition
deep-learning music-source-separation signal-processing speech-enhancement speech-processing speech-recognition
Last synced: 13 Mar 2025
https://github.com/haoheliu/subband-music-separation
Pytorch: Channel-wise subband (CWS) input for better voice and accompaniment separation
Last synced: 14 May 2025
https://github.com/haoheliu/diffres-python
Learning differentiable temporal resolution on time-series data.
Last synced: 14 May 2025
https://github.com/haoheliu/dcase_2022_task_5
System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection
Last synced: 14 May 2025
https://github.com/haoheliu/courseproject_compiler
java implementation of NWPU Compiler course project-西工大编译原理-试点班
compiler-design coursework mips recursive-descent
Last synced: 14 May 2025
https://github.com/haoheliu/key-word-spotting-dnn-gru-dscnn
key word spotting GRU/DNN/DSCNN
dnn gru key-word-spotting tensorflow tensorflow-experiments
Last synced: 09 Feb 2025
https://github.com/haoheliu/channel-wise-subband-input
The demos of paper: Channel-wise Subband Input for Better Voice and Accompaniment Separation on High Resolution Music
Last synced: 13 Mar 2025
https://github.com/haoheliu/demopage-voicefixer
Voicefixer is a speech restoration model that handles noise, reverberation, low resolution (2kHz~44.1kHz), and clipping (0.1-1.0 threshold) distortion simultaneously.
Last synced: 13 Mar 2025
https://github.com/haoheliu/ai-paper-digest
Digest the daily Arxiv AI paper from three domains: CV, NLP, and Sound
Last synced: 13 May 2025
https://github.com/haoheliu/classification_mnist
pytorch classification mnist CNN
Last synced: 13 Mar 2025
https://github.com/haoheliu/courseproject_computerarchitecture
NWPU computer architecture course project
Last synced: 13 Mar 2025