Projects in Awesome Lists by X-LANCE
A curated list of projects in awesome lists by X-LANCE .
https://github.com/x-lance/anitalker
[ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding"
Last synced: 15 May 2025
https://github.com/X-LANCE/AniTalker
[ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding"
Last synced: 11 Apr 2025
https://github.com/x-lance/slam-llm
Speech, Language, Audio, Music Processing with Large Language Model
audio-processing large-language-model multimodal-large-language-models music-processing peft speech-processing
Last synced: 15 May 2025
https://github.com/X-LANCE/SLAM-LLM
Speech, Language, Audio, Music Processing with Large Language Model
audio-processing large-language-model multimodal-large-language-models music-processing peft speech-processing
Last synced: 06 Jan 2025
https://github.com/x-lance/voiceflow-tts
[ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"
conditional-flow-matching generative-models probabilistic-models rectified-flow-matching speech-synthesis text-to-speech tts
Last synced: 06 Apr 2025
https://github.com/X-LANCE/text2sql-lgesql
[ACL 2021] This is the project containing source codes and pre-trained models about ACL2021 Long Paper ``LGESQL: Line Graph Enhanced Text-to-SQL Model with Mixed Local and Non-Local Relations".
database heterogeneous-graph-neural-network natural-language-interface semantic-parsing structured-prediction text-to-sql
Last synced: 10 Dec 2024
https://github.com/x-lance/text2sql-lgesql
[ACL 2021] This is the project containing source codes and pre-trained models about ACL2021 Long Paper ``LGESQL: Line Graph Enhanced Text-to-SQL Model with Mixed Local and Non-Local Relations".
database heterogeneous-graph-neural-network natural-language-interface semantic-parsing structured-prediction text-to-sql
Last synced: 02 May 2025
https://github.com/x-lance/storytts
[ICASSP 2024] StoryTTS: A Highly Expressive Text-to-Speech Dataset with Rich Textual Expressiveness Annotations
Last synced: 01 Mar 2025
https://github.com/x-lance/unicats-ctx-vec2wav
[AAAI 2024] Code for CTX-vec2wav in UniCATS
self-supervised-speech semantic-token speech-synthesis unicats vocoder vocoding
Last synced: 15 Dec 2024
https://github.com/x-lance/unicats-ctx-txt2vec
[AAAI 2024] CTX-txt2vec, the acoustic model in UniCATS
acoustic-model ctx-txt2vec speech-synthesis text-to-speech tts unicats vq-diffusion
Last synced: 13 Apr 2025
https://github.com/x-lance/websrc-baseline
[EMNLP 2021] The baseline code for WebSRC dataset.
Last synced: 01 Mar 2025
https://github.com/x-lance/msdwild
[INTERSPEECH 2022] This dataset is designed for multi-modal speaker diarization and lip-speech synchronization in the wild.
Last synced: 01 Mar 2025
https://github.com/x-lance/mobile-env
A Universal Platform for Training and Evaluation of Mobile Interaction
decision-making information-ui infoui interaction-platform nlp rl-environments rl-platform
Last synced: 09 Apr 2025
https://github.com/X-LANCE/Mobile-Env
A Universal Platform for Training and Evaluation of Mobile Interaction
decision-making information-ui infoui interaction-platform nlp rl-environments rl-platform
Last synced: 22 Apr 2025
https://github.com/x-lance/text2sql-gpt
[EMNLP 2023 Findings] ACT-SQL: In-Context Learning for Text-to-SQL with Automatically-Generated Chain-of-Thought
Last synced: 11 Jan 2025
https://github.com/x-lance/public_talks
Materials of public talks given By SJTU X-LANCE members
Last synced: 01 Mar 2025
https://github.com/x-lance/weblm
[WSDM 2024] Hierarchical Multimodal Pre-training for Visually Rich Webpage Understanding
Last synced: 01 Mar 2025
https://github.com/x-lance/suzhou-tutorials
A brief tutorial and startup scripts about suzhou clusters for members of speechlab
Last synced: 02 May 2025
https://github.com/x-lance/meta-gui-baseline
[EMNLP 2022] The baseline code for META-GUI dataset
Last synced: 02 May 2025
https://github.com/x-lance/text2sql-multiturn-gpt
[NAACL 2024] CoE-SQL: In-Context Learning for Multi-Turn Text-to-SQL with Chain-of-Editions
Last synced: 11 Jan 2025
https://github.com/x-lance/mivs_birgat
[ICASSP 2024] A BiRGAT Model for Multi-intent Spoken Language Understanding with Hierarchical Semantic Frames
Last synced: 11 Jan 2025
https://github.com/x-lance/medical-dataset
[ACL 2023 Findings] CSS: A Large-scale Cross-schema Chinese Text-to-SQL Medical Dataset
Last synced: 11 Jan 2025
https://github.com/x-lance/websrc
[EMNLP 2021] WebSRC: A dataset for web based structural machine reading comprehension.
Last synced: 02 May 2025
https://github.com/x-lance/mbs
[COLING 2024] Multilingual Brain Surgeon: Large Language Models Can be Compressed Leaving No Language Behind
Last synced: 11 Jan 2025
https://github.com/x-lance/d4
[EMNLP 2022] D4: a Chinese Dialogue Dataset for Depression-Diagnosis-Oriented Chat
Last synced: 01 Mar 2025
https://github.com/x-lance/speechlab-sjtu.github.io
Home page for speechlab.
Last synced: 01 Mar 2025
https://github.com/x-lance/meta-gui-leaderboard
[EMNLP 2022] Leaderboard of META-GUI
Last synced: 01 Mar 2025
https://github.com/x-lance/psyagents
An Open-Source Psychotherapy Simulation Platform with Interactive Agents
Last synced: 01 Mar 2025