Projects in Awesome Lists tagged with pretrained-language-model

https://github.com/wenge-research/yayi2

YAYI 2 是中科闻歌研发的新一代开源大语言模型，采用了超过 2 万亿 Tokens 的高质量、多语言语料进行预训练。(Repo for YaYi 2 Chinese LLMs)

artificial-intelligence chat chinese gpt natural-language-generation pretrained-language-model yayi

Last synced: 30 Mar 2025

https://github.com/wenge-research/YAYI2

YAYI 2 是中科闻歌研发的新一代开源大语言模型，采用了超过 2 万亿 Tokens 的高质量、多语言语料进行预训练。(Repo for YaYi 2 Chinese LLMs)

artificial-intelligence chat chinese gpt natural-language-generation pretrained-language-model yayi

Last synced: 09 May 2025

https://github.com/microsoft/torchscale

Foundation Architecture for (M)LLMs

computer-vision machine-learning multimodal natural-language-processing pretrained-language-model speech-processing transformer translation

Last synced: 14 May 2025

https://github.com/thudm/p-tuning-v2

An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks

natural-language-processing p-tuning parameter-efficient-learning pretrained-language-model prompt-tuning

Last synced: 15 May 2025

https://github.com/THUDM/P-tuning-v2

An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks

natural-language-processing p-tuning parameter-efficient-learning pretrained-language-model prompt-tuning

Last synced: 14 Mar 2025

https://github.com/thunlp/opendelta

A plug-and-play library for parameter-efficient-tuning (Delta Tuning)

deep-learning nlp nlp-library parameter-efficient-learning pretrained-language-model

Last synced: 15 May 2025

https://github.com/thunlp/OpenDelta

A plug-and-play library for parameter-efficient-tuning (Delta Tuning)

deep-learning nlp nlp-library parameter-efficient-learning pretrained-language-model

Last synced: 09 May 2025

https://github.com/xcfcode/Summarization-Papers

Summarization Papers

chatgpt natural-language-processing nlp pretrained-language-model summarization text-generation

Last synced: 05 May 2025

https://github.com/AndrewZhe/lawyer-llama

中文法律LLaMA (LLaMA for Chinese legel domain)

alpaca large-language-models legal-ai llama llm nlp plm pretrained-language-model pretrained-models

Last synced: 01 Apr 2025

https://github.com/allenai/dont-stop-pretraining

Code associated with the Don't Stop Pretraining ACL 2020 paper

natural-language-processing pretrained-language-model

Last synced: 13 Oct 2025

https://github.com/gaoisbest/NLP-Projects

word2vec, sentence2vec, machine reading comprehension, dialog system, text classification, pretrained language model (i.e., XLNet, BERT, ELMo, GPT), sequence labeling, information retrieval, information extraction (i.e., entity, relation and event extraction), knowledge graph, text generation, network embedding

dialogue-systems information-extraction information-retrieval knowledge-graph machine-reading-comprehension network-embedding pretrained-language-model sentence2vec sequence-labeling text-classification text-generation word2vec

Last synced: 07 Apr 2025

https://github.com/OpenBMB/CPM-Live

Live Training for Open-source Big Models

deep-learning multi-task-learning natural-language-generation natural-language-processing natural-language-understanding nlp parameter-efficient-learning pretrained-language-model

Last synced: 22 Jul 2025

https://github.com/lyh-yf/mwptoolkit

MWPToolkit is an open-source framework for math word problem(MWP) solvers.

deep-learning graph-to-tree math-word-problem pretrained-language-model pytorch sequence-to-sequence sequence-to-tree

Last synced: 08 Oct 2025

https://github.com/thunlp/prompt-transferability

On Transferability of Prompt Tuning for Natural Language Processing

nlp parameter-efficient-learning parameter-efficient-tuning pretrained-language-model pretrained-language-models pretrained-models prompt prompt-tuning pytorch transfer-learning

Last synced: 25 Apr 2025

https://github.com/thunlp/Prompt-Transferability

On Transferability of Prompt Tuning for Natural Language Processing

nlp parameter-efficient-learning parameter-efficient-tuning pretrained-language-model pretrained-language-models pretrained-models prompt prompt-tuning pytorch transfer-learning

Last synced: 21 Jun 2025

https://github.com/sjtu-ipads/bamboo

Bamboo-7B Large Language Model

large-language-models llm powerinfer pretrained-language-model pretrained-models sparse-llm

Last synced: 14 Oct 2025

https://github.com/EagleW/Scientific-Inspiration-Machines-Optimized-for-Novelty

Official implementation of the ACL 2024: Scientific Inspiration Machines Optimized for Novelty

acl2024 gpt4 hypothesis-generation llm pretrained-language-model pytorch retrieval-augmented-generation text-generation

Last synced: 14 Oct 2025

https://github.com/franxyao/poincareprobe

Implementation of ICLR 21 paper: Probing BERT in Hyperbolic Spaces

bert bert-embeddings bert-model bertology hyperbolic hyperbolic-embeddings hyperbolic-geometry pretrained-language-model probing probing-tasks

Last synced: 30 Apr 2025

https://github.com/heraclex12/nlp2sparql

Translate Natural Language Processing to SPARQL Query and vice versa

bert2bert knowledge-base machine-translation pretrained-language-model question-answering sparql spbert

Last synced: 19 Apr 2025

https://github.com/ganjinzero/coder

CODER: Knowledge infused cross-lingual medical term embedding for term normalization. [JBI, ACL-BioNLP 2022]

embeddings medical multi-language nlp pretrained-language-model umls

Last synced: 25 Oct 2025

https://github.com/ganjinzero/biobart

BioBART: Pretraining and Evaluation of A Biomedical Generative Language Model [ACL-BioNLP 2022]

biomedical generative pretrained-language-model

Last synced: 13 Jul 2025

https://github.com/thunlp/cokebert

CokeBERT: Contextual Knowledge Selection and Embedding towards Enhanced Pre-Trained Language Models

bert knowledge-graph nlp pretrained-language-model pytorch

Last synced: 25 Apr 2025

https://github.com/engineeringsoftware/coditt5

CoditT5: Pretraining for Source Code and Natural Language Editing

machine-learning pretrained-language-model software-engineering

Last synced: 12 May 2025

https://github.com/wxl1999/unicrs

[KDD22] Official PyTorch implementation for "Towards Unified Conversational Recommender Systems via Knowledge-Enhanced Prompt Learning".

conversation conversational-ai conversational-bots dialog dialogue dialogue-systems pretrained-language-model pretrained-models pretraining prompt prompt-tuning prompts recommendation recommendation-system recommender-system

Last synced: 26 Oct 2025

https://github.com/yingyuankai/AiSpace

AiSpace: Better practices for deep learning model development and deployment For Tensorflow 2.0

aispace albert-chinese bert chinese clue cmrc2018 dureader electra ernie lr-finder pretrained-language-model pretrained-models roberta-chinese swa tensorflow2 xlnet

Last synced: 18 Mar 2025

https://github.com/rucaibox/elmer

This repository is the official implementation of our EMNLP 2022 paper ELMER: A Non-Autoregressive Pre-trained Language Model for Efficient and Effective Text Generation

non-autoregressive-translation pretrained-language-model text-generation

Last synced: 17 Sep 2025

https://github.com/megagonlabs/cocosum

:coconut: Code & Data for Comparative Opinion Summarization via Collaborative Decoding (Iso et al; Findings of ACL 2022)

co-decoding comparative-opinion-summarization natural-language-generation opinion-summarization pretrained-language-model pytorch-lightning text-generation

Last synced: 14 Jun 2025

https://github.com/clarifai/examples

Examples for Clarifai Python SDK and Integrations. Give the repo a star ⭐

clarifai clarifai-python computer-vision examples generative-ai llm natural-language-processing pretrained-language-model pretrained-models python

Last synced: 13 Apr 2025

https://github.com/arianhosseini/negation-learning

code for our paper "Understanding by Understanding Not: Modeling Negation in Language Models"

bert language-model negation pretrained-language-model pytorch transformer

Last synced: 15 Mar 2025

https://github.com/imkett/resee

[EMNLP'23 Oral] ReSee: Responding through Seeing Fine-grained Visual Knowledge in Open-domain Dialogue PyTorch Implementation

dataset dialogue-systems emnlp2023 multimodal-dialogue pretrained-language-model transformers visual-dialogue

Last synced: 21 Mar 2025

https://github.com/wxl1999/cfcrs

[KDD23] Official PyTorch implementation for "Improving Conversational Recommendation Systems via Counterfactual Data Simulation".

conversation conversational-ai conversational-bots conversational-recommendation conversational-recommender-system data-augmentation data-augmentation-strategies data-augmentations dialog dialogue dialogue-systems pretrained-language-model pretrained-models pretraining recommendation recommendation-system recommender-system

Last synced: 10 Oct 2025

https://github.com/imsanko/image_caption_generator_with_transformers

This repository contains code for generating captions for images using a Transformer-based model. The model used is the `VisionEncoderDecoderModel` from the Hugging Face Transformers library, specifically the `nlpconnect/vit-gpt2-image-captioning` model.

collaboration ipython-notebook jypyternotebook llm pretrained-language-model pthon3 transfromers

Last synced: 27 Oct 2025

https://github.com/eric11eca/reckoning-metakg

RECKONING is a bi-level learning algorithm that improves language models' reasoning ability by folding contextual knowledge into parametric knowledge through back-propagation.

bilevel-optimization complex-reasoning meta-learning pretrained-language-model question-answering

Last synced: 14 Apr 2025

https://github.com/damo-nlp-sg/peerda

Source code of "PeerDA: Data Augmentation via Modeling Peer Relation for Span Identification Tasks" (ACL23)

data-augmentation machine-reading-comprehension pretrained-language-model

Last synced: 03 May 2025

https://github.com/xcollab/huggingface

This repository provides an overview of Hugging Face's Transformers library, a powerful tool for natural language processing (NLP) and machine learning tasks.

bert bert-model gpt gpt-models huggingface huggingface-transformers llm llms models pretrained-language-model pretrained-models python transformer transformers-models

Last synced: 10 Apr 2025

https://github.com/apsinghanalytics/finragify_app

An LLM app leveraging RAG with LangChain and GPT-4 mini to analyze earnings call transcripts, assess company performance, using natural language queries (NLP), FAISS (vector database), and Hugging Face re-ranking models.

aws-ec2 cloud-application docker-container earnings-transcripts faiss-vector-database finance fine-tuning gpt-4o-mini huggingface-models langchain-python large-language-model natural-language-processing pretrained-language-model prompt-engineering question-answering-system reranking retrieval-augmented-generation stocks vector-embeddings

Last synced: 05 Apr 2025

https://github.com/sreeeswaran/train-your-llm

This repository contains code and resources for training, fine-tuning, and deploying large language models using Hugging Face's Transformers library.

artificial-intelligence deep-learning language-model large-language-model large-language-models llm llm-training llms machine-learning model-training nlp pretrained-language-model pretrained-models training

Last synced: 13 Jul 2025

https://github.com/cai991108/machine-learning-and-language-model

This project explores GPT-2 and Llama models through pre-training, fine-tuning, and Chain-of-Thought (CoT) prompting. It includes memory-efficient optimizations (SGD, LoRA, BAdam) and evaluations on math datasets (GSM8K, NumGLUE, StimulEq, SVAMP).

chainofthought finetune-llm gpt2 llama llm llm-inference pretrained-language-model

Last synced: 13 Nov 2025

https://github.com/shreydan/masked-language-modeling

Transformers Pre-Training with MLM objective — implemented encoder-only model and trained from scratch on Wikipedia dataset.

masked-language-models nlp pretrained-language-model pytorch transformers

Last synced: 15 May 2025

https://github.com/zobayerakib/transfer-learning-for-nlp-with-tensorflow-hub

This project demonstrates the use of various pre-trained models for transfer learning in NLP using TensorFlow Hub.

fine-tuning natural-language-processing nlp pretrained-language-model pretrained-models quora-insincere-questions-classification tensorboard-visualizations tensorflowhub transfer-learning

Last synced: 29 Dec 2025

https://github.com/hojat72elect/imdb_storyline_summaries_database

The database IMDB storylines and their summaries

computational-linguistics computer-science csv data-science dataset machine-learning natural-language-processing nlp pretrained-language-model python science tplm

Last synced: 04 Oct 2025

https://github.com/snigdho8869/text-summarizer-flask

This repository contains a Flask-based web application that utilizes the BART, GPT-2 pretrained models for text summarization.

abstractive-summarization bart deep-learning fine-tuning gpt-2 huggingface huggingface-transformers machine-learning natural-language-processing nlp pretrained-language-model pretrained-models text-generation text-summarization transformers

Last synced: 24 Jun 2025

https://github.com/20101301-alina-hasan/robust-fake-review-detection-using-uncertainty-aware-lstm-and-bert

Our study utilizes BERT and LSTM models alongside Monte Carlo Dropout (MCD) on the Yelp Labelled Dataset. MCD bolsters robustness by introducing uncertainty through neuron dropout. The BERT-embedded MCD achieves an impressive 91.75% accuracy, surpassing the LSTM model.

artificial-intelligence bert fake-review-detection language-model long-short-term-memory lstm natural-language-preprocessing neural-network pretrained-language-model yelp-dataset

Last synced: 17 Jul 2025

https://github.com/am-ankitgit/complete-deep-learning-algorithms

deep-learning machine-learning

artificial-intelligence backprogation cnn-lstm cnn-model cnn-text-classification deep-learning deep-neural-networks forward-propagation keras-classification-models keras-tensorflow keras-tuner lstm-neural-networks pretrained-language-model pretrained-models python rnn-tensorflow tensorflow tensorflow2

Last synced: 20 Jun 2025