Projects in Awesome Lists by THUNLP-MT
A curated list of projects in awesome lists by THUNLP-MT .
https://github.com/THUNLP-MT/MT-Reading-List
A machine translation reading list maintained by Tsinghua Natural Language Processing Group
machine-translation reading-list
Last synced: 05 May 2025
https://github.com/thunlp-mt/mt-reading-list
A machine translation reading list maintained by Tsinghua Natural Language Processing Group
machine-translation reading-list
Last synced: 24 Jan 2026
https://github.com/thunlp-mt/thumt
An open-source neural machine translation toolkit developed by Tsinghua Natural Language Processing Group
deep-learning machine-translation neural-machine-translation
Last synced: 12 Apr 2025
https://github.com/thunlp-mt/tg-reading-list
A text generation reading list maintained by Tsinghua Natural Language Processing Group.
code paper reading-list text-generation
Last synced: 24 Feb 2026
https://github.com/THUNLP-MT/TG-Reading-List
A text generation reading list maintained by Tsinghua Natural Language Processing Group.
code paper reading-list text-generation
Last synced: 22 Apr 2025
https://github.com/thunlp-mt/document-transformer
Improving the Transformer translation model with document-level context
document-level-translation neural-machine-translation
Last synced: 04 Mar 2026
https://github.com/thunlp-mt/stabletoolbench
A new tool learning benchmark aiming at well-balanced stability and reality, based on ToolBench.
Last synced: 07 Apr 2025
https://github.com/thunlp-mt/streamingbench
StreamingBench: Assessing the Gap for MLLMs to Achieve Streaming Video Understanding
Last synced: 15 Mar 2026
https://github.com/thunlp-mt/dymean
This repo contains the codes for our paper "End-to-End Full-Atom Antibody Design"
antibody-design drug-discovery generative-ai
Last synced: 06 Apr 2025
https://github.com/thunlp-mt/mean
This repo contains the codes for our paper Conditional Antibody Design as 3D Equivariant Graph Translation.
antibody-design drug-discovery generative-ai
Last synced: 14 Apr 2025
https://github.com/thunlp-mt/mask-align
Code for our paper "Mask-Align: Self-Supervised Neural Word Alignment" in ACL 2021
machine-translation self-supervised-learning word-alignment
Last synced: 13 Jul 2025
https://github.com/thunlp-mt/pepglad
Codes for our paper "Full-Atom Peptide Design with Geometric Latent Diffusion" (NeurIPS 2024)
drug-discovery generative-ai peptide-design protein-design
Last synced: 28 Oct 2025
https://github.com/thunlp-mt/thucc
An open-source classical Chinese information processing toolkit developed by Tsinghua Natural Language Processing Group
classical-chinese traditional-chinese
Last synced: 14 Apr 2025
https://github.com/thunlp-mt/ps-vae
This repo contains the codes for our paper: Molecule Generation by Principal Subgraph Mining and Assembling.
drug-discovery fragment-based-drug-discovery molecule-generation
Last synced: 07 Jul 2025
https://github.com/thunlp-mt/get
This repo contains the codes for our paper "Generalist Equivariant Transformer Towards 3D Molecular Interaction Learning" (ICML 2024).
affinity-prediction graph-transformer unified-molecules
Last synced: 14 Apr 2025
https://github.com/thunlp-mt/modelcompose
Official code for our paper "Model Composition for Multimodal Large Language Models" (ACL 2024)
Last synced: 14 Apr 2025
https://github.com/thunlp-mt/skr
Self-Knowledge Guided Retrieval Augmentation for Large Language Models (EMNLP Findings 2023)
Last synced: 04 Sep 2025
https://github.com/thunlp-mt/plm4mt
Code for our work "MSP: Multi-Stage Prompting for Making Pre-trained Language Models Better Translators" in ACL 2022
Last synced: 14 Apr 2025
https://github.com/thunlp-mt/escapecraft
Official repo for EscapeCraft (an 3D environment for room escape) and benchmark MM-Escape. This work is accepted by ICCV 2025.
Last synced: 11 Sep 2025
https://github.com/thunlp-mt/directquote
A Dataset for Direct Quotation Extraction and Attribution in News Articles.
Last synced: 05 Oct 2025
https://github.com/thunlp-mt/mt-toolkit-list
A list of machine translation open-source toolkits maintained by Tsinghua Natural Language Processing Group
machine-translation open-source-toolkits
Last synced: 24 Feb 2026
https://github.com/thunlp-mt/trice
Code for our paper "Transfer Learning for Sequence Generation: from Single-source to Multi-source" in ACL 2021.
automatic-post-editing finetuning machine-translation multi-source-translation natural-language-processing
Last synced: 14 Apr 2025
https://github.com/thunlp-mt/pr4nmt
Prior Knowledge Integration for Neural Machine Translation using Posterior Regularization
neu neural-machine-translation prior-knowledge-integration
Last synced: 14 Apr 2025
https://github.com/thunlp-mt/l2copy4ape
Learning to Copy for Automatic Post-Editing (EMNLP 2019)
Last synced: 14 Apr 2025
https://github.com/thunlp-mt/codis
Repo for paper "CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models".
Last synced: 14 Apr 2025
https://github.com/thunlp-mt/promptgating4mctg
This is the repo for our work “An Extensible Plug-and-Play Method for Multi-Aspect Controllable Text Generation” (ACL 2023).
Last synced: 04 Sep 2025
https://github.com/thunlp-mt/pgra
Prompt-Guided Retrieval For Non-Knowledge-Intensive Tasks
Last synced: 04 Sep 2025
https://github.com/thunlp-mt/ubilexat
An Unsupervised Bilingual Lexicon Inducer From Non-Parallel Data by Adversarial Training
bilingual-lexicon-extraction bilingual-word-embedding
Last synced: 14 Apr 2025
https://github.com/thunlp-mt/fiig
Filling the Image Information Gap for VQA: Prompting Large Language Models to Proactively Ask Questions (EMNLP 2023 Findings)
Last synced: 08 Nov 2025
https://github.com/thunlp-mt/museg
Repo for paper "MUSEG: Reinforcing Video Temporal Understanding via Timestamp-Aware Multi-Segment Grounding".
Last synced: 23 Aug 2025
https://github.com/thunlp-mt/tran
This is the repo for our work “Failures Pave the Way: Enhancing Large Language Models through Tuning-free Rule Accumulation” (EMNLP 2023).
Last synced: 04 Sep 2025
https://github.com/thunlp-mt/selfsupervisedqe
Self-Supervised Quality Estimation for Machine Translation
Last synced: 30 Jun 2025
https://github.com/thunlp-mt/adamms
Official Repository for "AdaMMS: Model Merging for Heterogeneous Multimodal Large Language Models with Unsupervised Coefficient Optimization" [CVPR2025]
Last synced: 20 Feb 2026
https://github.com/thunlp-mt/symbol2language
Speak It Out: Solving Symbol-Related Problems with Symbol-to-Language Conversion for Language Models
chain-of-thought large-language-models nlp reasoning symbolic-expressions
Last synced: 26 Jan 2026
https://github.com/thunlp-mt/ubilexemd
An Unsupervised Bilingual Lexicon Inducer From Non-Parallel Data by Earth Mover's Distance Minimization
bilingual-lexicon-extraction bilingual-word-embedding
Last synced: 18 Jul 2025
https://github.com/thunlp-mt/bilex
A Bilingual Lexicon Inducer From Non-Parallel Data
bilingual-lexicon-extraction bilingual-word-embedding
Last synced: 14 Apr 2025
https://github.com/thunlp-mt/rogo
This repo contains the codes for our work “Restricted orthogonal gradient projection for continual learning”.
Last synced: 10 Oct 2025
https://github.com/thunlp-mt/voting4sc
Modeling Voting for System Combination in Machine Translation (IJCAI 2020)
Last synced: 26 Oct 2025
https://github.com/thunlp-mt/extagents
Public code repository for "Scaling External Knowledge Input Beyond the Context Length of LLMs via Multi-Agent Collaboration"
Last synced: 28 Jul 2025
https://github.com/thunlp-mt/metaranking
Official code repo for our work "Meta Ranking: Less Capable Language Models are Capable for Single Response Judgement".
Last synced: 19 Mar 2026
https://github.com/thunlp-mt/align
This is the official repo for paper "Agent-Environment Alignment via Automated Interface Generation"
Last synced: 07 Jul 2025
https://github.com/thunlp-mt/transformer-dmb
Codes for our paper "Dynamic Multi-Branch Layers for On-Device Neural Machine Translation" in TASLP
Last synced: 24 Feb 2026
https://github.com/thunlp-mt/mt-dataset-list
A list machine translation datasets maintained by Tsinghua Natural Language Processing Group
Last synced: 09 Feb 2026
https://github.com/thunlp-mt/ckd
Continual Knowledge Distillation for Neural Machine Translation
Last synced: 24 Feb 2026
https://github.com/thunlp-mt/collabuiagents
The official implementation of paper "Enhancing Language Multi-Agent Learning with Credit Re-Assignment for Interactive Environment Generalization".
Last synced: 11 Mar 2026
https://github.com/thunlp-mt/vat
This is the official repo for paper "Visual Abstract Thinking Empowers Multimodal Reasoning"
Last synced: 07 Oct 2025
https://github.com/thunlp-mt/cospace
Official Repository for “CoSpace: Benchmarking Continuous Space Perception Ability for Vision-Language Models" [CVPR2025]
Last synced: 19 Mar 2026