An open API service indexing awesome lists of open source software.

Projects in Awesome Lists by THUNLP-MT

A curated list of projects in awesome lists by THUNLP-MT .

https://github.com/THUNLP-MT/MT-Reading-List

A machine translation reading list maintained by Tsinghua Natural Language Processing Group

machine-translation reading-list

Last synced: 05 May 2025

https://github.com/thunlp-mt/mt-reading-list

A machine translation reading list maintained by Tsinghua Natural Language Processing Group

machine-translation reading-list

Last synced: 24 Jan 2026

https://github.com/thunlp-mt/thumt

An open-source neural machine translation toolkit developed by Tsinghua Natural Language Processing Group

deep-learning machine-translation neural-machine-translation

Last synced: 12 Apr 2025

https://github.com/thunlp-mt/tg-reading-list

A text generation reading list maintained by Tsinghua Natural Language Processing Group.

code paper reading-list text-generation

Last synced: 24 Feb 2026

https://github.com/THUNLP-MT/TG-Reading-List

A text generation reading list maintained by Tsinghua Natural Language Processing Group.

code paper reading-list text-generation

Last synced: 22 Apr 2025

https://github.com/thunlp-mt/document-transformer

Improving the Transformer translation model with document-level context

document-level-translation neural-machine-translation

Last synced: 04 Mar 2026

https://github.com/thunlp-mt/stabletoolbench

A new tool learning benchmark aiming at well-balanced stability and reality, based on ToolBench.

llm tool-learning

Last synced: 07 Apr 2025

https://github.com/thunlp-mt/streamingbench

StreamingBench: Assessing the Gap for MLLMs to Achieve Streaming Video Understanding

Last synced: 15 Mar 2026

https://github.com/thunlp-mt/dymean

This repo contains the codes for our paper "End-to-End Full-Atom Antibody Design"

antibody-design drug-discovery generative-ai

Last synced: 06 Apr 2025

https://github.com/thunlp-mt/mean

This repo contains the codes for our paper Conditional Antibody Design as 3D Equivariant Graph Translation.

antibody-design drug-discovery generative-ai

Last synced: 14 Apr 2025

https://github.com/thunlp-mt/mask-align

Code for our paper "Mask-Align: Self-Supervised Neural Word Alignment" in ACL 2021

machine-translation self-supervised-learning word-alignment

Last synced: 13 Jul 2025

https://github.com/thunlp-mt/pepglad

Codes for our paper "Full-Atom Peptide Design with Geometric Latent Diffusion" (NeurIPS 2024)

drug-discovery generative-ai peptide-design protein-design

Last synced: 28 Oct 2025

https://github.com/thunlp-mt/thucc

An open-source classical Chinese information processing toolkit developed by Tsinghua Natural Language Processing Group

classical-chinese traditional-chinese

Last synced: 14 Apr 2025

https://github.com/thunlp-mt/ps-vae

This repo contains the codes for our paper: Molecule Generation by Principal Subgraph Mining and Assembling.

drug-discovery fragment-based-drug-discovery molecule-generation

Last synced: 07 Jul 2025

https://github.com/thunlp-mt/scaffold

Scaffold Prompting to promote LMMs

Last synced: 16 Oct 2025

https://github.com/thunlp-mt/get

This repo contains the codes for our paper "Generalist Equivariant Transformer Towards 3D Molecular Interaction Learning" (ICML 2024).

affinity-prediction graph-transformer unified-molecules

Last synced: 14 Apr 2025

https://github.com/thunlp-mt/modelcompose

Official code for our paper "Model Composition for Multimodal Large Language Models" (ACL 2024)

Last synced: 14 Apr 2025

https://github.com/thunlp-mt/skr

Self-Knowledge Guided Retrieval Augmentation for Large Language Models (EMNLP Findings 2023)

Last synced: 04 Sep 2025

https://github.com/thunlp-mt/plm4mt

Code for our work "MSP: Multi-Stage Prompting for Making Pre-trained Language Models Better Translators" in ACL 2022

Last synced: 14 Apr 2025

https://github.com/thunlp-mt/uce4bt

Last synced: 03 Mar 2026

https://github.com/thunlp-mt/escapecraft

Official repo for EscapeCraft (an 3D environment for room escape) and benchmark MM-Escape. This work is accepted by ICCV 2025.

Last synced: 11 Sep 2025

https://github.com/thunlp-mt/directquote

A Dataset for Direct Quotation Extraction and Attribution in News Articles.

Last synced: 05 Oct 2025

https://github.com/thunlp-mt/mt-toolkit-list

A list of machine translation open-source toolkits maintained by Tsinghua Natural Language Processing Group

machine-translation open-source-toolkits

Last synced: 24 Feb 2026

https://github.com/thunlp-mt/brote

Last synced: 14 Apr 2025

https://github.com/thunlp-mt/trice

Code for our paper "Transfer Learning for Sequence Generation: from Single-source to Multi-source" in ACL 2021.

automatic-post-editing finetuning machine-translation multi-source-translation natural-language-processing

Last synced: 14 Apr 2025

https://github.com/thunlp-mt/pr4nmt

Prior Knowledge Integration for Neural Machine Translation using Posterior Regularization

neu neural-machine-translation prior-knowledge-integration

Last synced: 14 Apr 2025

https://github.com/thunlp-mt/l2copy4ape

Learning to Copy for Automatic Post-Editing (EMNLP 2019)

Last synced: 14 Apr 2025

https://github.com/thunlp-mt/codis

Repo for paper "CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models".

Last synced: 14 Apr 2025

https://github.com/thunlp-mt/promptgating4mctg

This is the repo for our work “An Extensible Plug-and-Play Method for Multi-Aspect Controllable Text Generation” (ACL 2023).

Last synced: 04 Sep 2025

https://github.com/thunlp-mt/pgra

Prompt-Guided Retrieval For Non-Knowledge-Intensive Tasks

Last synced: 04 Sep 2025

https://github.com/thunlp-mt/ktnmt

Last synced: 03 Oct 2025

https://github.com/thunlp-mt/ubilexat

An Unsupervised Bilingual Lexicon Inducer From Non-Parallel Data by Adversarial Training

bilingual-lexicon-extraction bilingual-word-embedding

Last synced: 14 Apr 2025

https://github.com/thunlp-mt/fiig

Filling the Image Information Gap for VQA: Prompting Large Language Models to Proactively Ask Questions (EMNLP 2023 Findings)

Last synced: 08 Nov 2025

https://github.com/thunlp-mt/museg

Repo for paper "MUSEG: Reinforcing Video Temporal Understanding via Timestamp-Aware Multi-Segment Grounding".

Last synced: 23 Aug 2025

https://github.com/thunlp-mt/tran

This is the repo for our work “Failures Pave the Way: Enhancing Large Language Models through Tuning-free Rule Accumulation” (EMNLP 2023).

Last synced: 04 Sep 2025

https://github.com/thunlp-mt/selfsupervisedqe

Self-Supervised Quality Estimation for Machine Translation

Last synced: 30 Jun 2025

https://github.com/thunlp-mt/adamms

Official Repository for "AdaMMS: Model Merging for Heterogeneous Multimodal Large Language Models with Unsupervised Coefficient Optimization" [CVPR2025]

Last synced: 20 Feb 2026

https://github.com/thunlp-mt/symbol2language

Speak It Out: Solving Symbol-Related Problems with Symbol-to-Language Conversion for Language Models

chain-of-thought large-language-models nlp reasoning symbolic-expressions

Last synced: 26 Jan 2026

https://github.com/thunlp-mt/ubilexemd

An Unsupervised Bilingual Lexicon Inducer From Non-Parallel Data by Earth Mover's Distance Minimization

bilingual-lexicon-extraction bilingual-word-embedding

Last synced: 18 Jul 2025

https://github.com/thunlp-mt/bilex

A Bilingual Lexicon Inducer From Non-Parallel Data

bilingual-lexicon-extraction bilingual-word-embedding

Last synced: 14 Apr 2025

https://github.com/thunlp-mt/rogo

This repo contains the codes for our work “Restricted orthogonal gradient projection for continual learning”.

Last synced: 10 Oct 2025

https://github.com/thunlp-mt/deem

Last synced: 04 Feb 2026

https://github.com/thunlp-mt/voting4sc

Modeling Voting for System Combination in Machine Translation (IJCAI 2020)

Last synced: 26 Oct 2025

https://github.com/thunlp-mt/extagents

Public code repository for "Scaling External Knowledge Input Beyond the Context Length of LLMs via Multi-Agent Collaboration"

Last synced: 28 Jul 2025

https://github.com/thunlp-mt/actiview

Last synced: 04 Sep 2025

https://github.com/thunlp-mt/metaranking

Official code repo for our work "Meta Ranking: Less Capable Language Models are Capable for Single Response Judgement".

Last synced: 19 Mar 2026

https://github.com/thunlp-mt/align

This is the official repo for paper "Agent-Environment Alignment via Automated Interface Generation"

Last synced: 07 Jul 2025

https://github.com/thunlp-mt/ric

Last synced: 18 Oct 2025

https://github.com/thunlp-mt/transformer-dmb

Codes for our paper "Dynamic Multi-Branch Layers for On-Device Neural Machine Translation" in TASLP

Last synced: 24 Feb 2026

https://github.com/thunlp-mt/mt-dataset-list

A list machine translation datasets maintained by Tsinghua Natural Language Processing Group

Last synced: 09 Feb 2026

https://github.com/thunlp-mt/panda

Last synced: 24 Feb 2026

https://github.com/thunlp-mt/ckd

Continual Knowledge Distillation for Neural Machine Translation

Last synced: 24 Feb 2026

https://github.com/thunlp-mt/collabuiagents

The official implementation of paper "Enhancing Language Multi-Agent Learning with Credit Re-Assignment for Interactive Environment Generalization".

Last synced: 11 Mar 2026

https://github.com/thunlp-mt/vat

This is the official repo for paper "Visual Abstract Thinking Empowers Multimodal Reasoning"

Last synced: 07 Oct 2025

https://github.com/thunlp-mt/cospace

Official Repository for “CoSpace: Benchmarking Continuous Space Perception Ability for Vision-Language Models" [CVPR2025]

Last synced: 19 Mar 2026