Projects in Awesome Lists tagged with pretrained-language-model
A curated list of projects in awesome lists tagged with pretrained-language-model .
https://github.com/wenge-research/yayi2
YAYI 2 是中科闻歌研发的新一代开源大语言模型,采用了超过 2 万亿 Tokens 的高质量、多语言语料进行预训练。(Repo for YaYi 2 Chinese LLMs)
artificial-intelligence chat chinese gpt natural-language-generation pretrained-language-model yayi
Last synced: 30 Mar 2025
https://github.com/wenge-research/YAYI2
YAYI 2 是中科闻歌研发的新一代开源大语言模型,采用了超过 2 万亿 Tokens 的高质量、多语言语料进行预训练。(Repo for YaYi 2 Chinese LLMs)
artificial-intelligence chat chinese gpt natural-language-generation pretrained-language-model yayi
Last synced: 09 May 2025
https://github.com/microsoft/torchscale
Foundation Architecture for (M)LLMs
computer-vision machine-learning multimodal natural-language-processing pretrained-language-model speech-processing transformer translation
Last synced: 14 May 2025
https://github.com/thudm/p-tuning-v2
An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks
natural-language-processing p-tuning parameter-efficient-learning pretrained-language-model prompt-tuning
Last synced: 15 May 2025
https://github.com/THUDM/P-tuning-v2
An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks
natural-language-processing p-tuning parameter-efficient-learning pretrained-language-model prompt-tuning
Last synced: 14 Mar 2025
https://github.com/thunlp/opendelta
A plug-and-play library for parameter-efficient-tuning (Delta Tuning)
deep-learning nlp nlp-library parameter-efficient-learning pretrained-language-model
Last synced: 15 May 2025
https://github.com/thunlp/OpenDelta
A plug-and-play library for parameter-efficient-tuning (Delta Tuning)
deep-learning nlp nlp-library parameter-efficient-learning pretrained-language-model
Last synced: 09 May 2025
https://github.com/xcfcode/Summarization-Papers
Summarization Papers
chatgpt natural-language-processing nlp pretrained-language-model summarization text-generation
Last synced: 05 May 2025
https://github.com/AndrewZhe/lawyer-llama
中文法律LLaMA (LLaMA for Chinese legel domain)
alpaca large-language-models legal-ai llama llm nlp plm pretrained-language-model pretrained-models
Last synced: 01 Apr 2025
https://github.com/allenai/dont-stop-pretraining
Code associated with the Don't Stop Pretraining ACL 2020 paper
natural-language-processing pretrained-language-model
Last synced: 13 Oct 2025
https://github.com/gaoisbest/NLP-Projects
word2vec, sentence2vec, machine reading comprehension, dialog system, text classification, pretrained language model (i.e., XLNet, BERT, ELMo, GPT), sequence labeling, information retrieval, information extraction (i.e., entity, relation and event extraction), knowledge graph, text generation, network embedding
dialogue-systems information-extraction information-retrieval knowledge-graph machine-reading-comprehension network-embedding pretrained-language-model sentence2vec sequence-labeling text-classification text-generation word2vec
Last synced: 07 Apr 2025
https://github.com/OpenBMB/CPM-Live
Live Training for Open-source Big Models
deep-learning multi-task-learning natural-language-generation natural-language-processing natural-language-understanding nlp parameter-efficient-learning pretrained-language-model
Last synced: 22 Jul 2025
https://github.com/lyh-yf/mwptoolkit
MWPToolkit is an open-source framework for math word problem(MWP) solvers.
deep-learning graph-to-tree math-word-problem pretrained-language-model pytorch sequence-to-sequence sequence-to-tree
Last synced: 08 Oct 2025
https://github.com/thunlp/prompt-transferability
On Transferability of Prompt Tuning for Natural Language Processing
nlp parameter-efficient-learning parameter-efficient-tuning pretrained-language-model pretrained-language-models pretrained-models prompt prompt-tuning pytorch transfer-learning
Last synced: 25 Apr 2025
https://github.com/thunlp/Prompt-Transferability
On Transferability of Prompt Tuning for Natural Language Processing
nlp parameter-efficient-learning parameter-efficient-tuning pretrained-language-model pretrained-language-models pretrained-models prompt prompt-tuning pytorch transfer-learning
Last synced: 21 Jun 2025
https://github.com/sjtu-ipads/bamboo
Bamboo-7B Large Language Model
large-language-models llm powerinfer pretrained-language-model pretrained-models sparse-llm
Last synced: 14 Oct 2025
https://github.com/EagleW/Scientific-Inspiration-Machines-Optimized-for-Novelty
Official implementation of the ACL 2024: Scientific Inspiration Machines Optimized for Novelty
acl2024 gpt4 hypothesis-generation llm pretrained-language-model pytorch retrieval-augmented-generation text-generation
Last synced: 14 Oct 2025
https://github.com/franxyao/poincareprobe
Implementation of ICLR 21 paper: Probing BERT in Hyperbolic Spaces
bert bert-embeddings bert-model bertology hyperbolic hyperbolic-embeddings hyperbolic-geometry pretrained-language-model probing probing-tasks
Last synced: 30 Apr 2025
https://github.com/heraclex12/nlp2sparql
Translate Natural Language Processing to SPARQL Query and vice versa
bert2bert knowledge-base machine-translation pretrained-language-model question-answering sparql spbert
Last synced: 19 Apr 2025
https://github.com/ganjinzero/coder
CODER: Knowledge infused cross-lingual medical term embedding for term normalization. [JBI, ACL-BioNLP 2022]
embeddings medical multi-language nlp pretrained-language-model umls
Last synced: 25 Oct 2025
https://github.com/ganjinzero/biobart
BioBART: Pretraining and Evaluation of A Biomedical Generative Language Model [ACL-BioNLP 2022]
biomedical generative pretrained-language-model
Last synced: 13 Jul 2025
https://github.com/thunlp/cokebert
CokeBERT: Contextual Knowledge Selection and Embedding towards Enhanced Pre-Trained Language Models
bert knowledge-graph nlp pretrained-language-model pytorch
Last synced: 25 Apr 2025
https://github.com/engineeringsoftware/coditt5
CoditT5: Pretraining for Source Code and Natural Language Editing
machine-learning pretrained-language-model software-engineering
Last synced: 12 May 2025
https://github.com/wxl1999/unicrs
[KDD22] Official PyTorch implementation for "Towards Unified Conversational Recommender Systems via Knowledge-Enhanced Prompt Learning".
conversation conversational-ai conversational-bots dialog dialogue dialogue-systems pretrained-language-model pretrained-models pretraining prompt prompt-tuning prompts recommendation recommendation-system recommender-system
Last synced: 26 Oct 2025
https://github.com/yingyuankai/AiSpace
AiSpace: Better practices for deep learning model development and deployment For Tensorflow 2.0
aispace albert-chinese bert chinese clue cmrc2018 dureader electra ernie lr-finder pretrained-language-model pretrained-models roberta-chinese swa tensorflow2 xlnet
Last synced: 18 Mar 2025
https://github.com/rucaibox/elmer
This repository is the official implementation of our EMNLP 2022 paper ELMER: A Non-Autoregressive Pre-trained Language Model for Efficient and Effective Text Generation
non-autoregressive-translation pretrained-language-model text-generation
Last synced: 17 Sep 2025
https://github.com/megagonlabs/cocosum
:coconut: Code & Data for Comparative Opinion Summarization via Collaborative Decoding (Iso et al; Findings of ACL 2022)
co-decoding comparative-opinion-summarization natural-language-generation opinion-summarization pretrained-language-model pytorch-lightning text-generation
Last synced: 14 Jun 2025
https://github.com/clarifai/examples
Examples for Clarifai Python SDK and Integrations. Give the repo a star ⭐
clarifai clarifai-python computer-vision examples generative-ai llm natural-language-processing pretrained-language-model pretrained-models python
Last synced: 13 Apr 2025
https://github.com/arianhosseini/negation-learning
code for our paper "Understanding by Understanding Not: Modeling Negation in Language Models"
bert language-model negation pretrained-language-model pytorch transformer
Last synced: 15 Mar 2025
https://github.com/imkett/resee
[EMNLP'23 Oral] ReSee: Responding through Seeing Fine-grained Visual Knowledge in Open-domain Dialogue PyTorch Implementation
dataset dialogue-systems emnlp2023 multimodal-dialogue pretrained-language-model transformers visual-dialogue
Last synced: 21 Mar 2025
https://github.com/wxl1999/cfcrs
[KDD23] Official PyTorch implementation for "Improving Conversational Recommendation Systems via Counterfactual Data Simulation".
conversation conversational-ai conversational-bots conversational-recommendation conversational-recommender-system data-augmentation data-augmentation-strategies data-augmentations dialog dialogue dialogue-systems pretrained-language-model pretrained-models pretraining recommendation recommendation-system recommender-system
Last synced: 10 Oct 2025
https://github.com/imsanko/image_caption_generator_with_transformers
This repository contains code for generating captions for images using a Transformer-based model. The model used is the `VisionEncoderDecoderModel` from the Hugging Face Transformers library, specifically the `nlpconnect/vit-gpt2-image-captioning` model.
collaboration ipython-notebook jypyternotebook llm pretrained-language-model pthon3 transfromers
Last synced: 27 Oct 2025
https://github.com/eric11eca/reckoning-metakg
RECKONING is a bi-level learning algorithm that improves language models' reasoning ability by folding contextual knowledge into parametric knowledge through back-propagation.
bilevel-optimization complex-reasoning meta-learning pretrained-language-model question-answering
Last synced: 14 Apr 2025
https://github.com/damo-nlp-sg/peerda
Source code of "PeerDA: Data Augmentation via Modeling Peer Relation for Span Identification Tasks" (ACL23)
data-augmentation machine-reading-comprehension pretrained-language-model
Last synced: 03 May 2025
https://github.com/xcollab/huggingface
This repository provides an overview of Hugging Face's Transformers library, a powerful tool for natural language processing (NLP) and machine learning tasks.
bert bert-model gpt gpt-models huggingface huggingface-transformers llm llms models pretrained-language-model pretrained-models python transformer transformers-models
Last synced: 10 Apr 2025
https://github.com/apsinghanalytics/finragify_app
An LLM app leveraging RAG with LangChain and GPT-4 mini to analyze earnings call transcripts, assess company performance, using natural language queries (NLP), FAISS (vector database), and Hugging Face re-ranking models.
aws-ec2 cloud-application docker-container earnings-transcripts faiss-vector-database finance fine-tuning gpt-4o-mini huggingface-models langchain-python large-language-model natural-language-processing pretrained-language-model prompt-engineering question-answering-system reranking retrieval-augmented-generation stocks vector-embeddings
Last synced: 05 Apr 2025
https://github.com/sreeeswaran/train-your-llm
This repository contains code and resources for training, fine-tuning, and deploying large language models using Hugging Face's Transformers library.
artificial-intelligence deep-learning language-model large-language-model large-language-models llm llm-training llms machine-learning model-training nlp pretrained-language-model pretrained-models training
Last synced: 13 Jul 2025
https://github.com/cai991108/machine-learning-and-language-model
This project explores GPT-2 and Llama models through pre-training, fine-tuning, and Chain-of-Thought (CoT) prompting. It includes memory-efficient optimizations (SGD, LoRA, BAdam) and evaluations on math datasets (GSM8K, NumGLUE, StimulEq, SVAMP).
chainofthought finetune-llm gpt2 llama llm llm-inference pretrained-language-model
Last synced: 13 Nov 2025
https://github.com/shreydan/masked-language-modeling
Transformers Pre-Training with MLM objective — implemented encoder-only model and trained from scratch on Wikipedia dataset.
masked-language-models nlp pretrained-language-model pytorch transformers
Last synced: 15 May 2025
https://github.com/zobayerakib/transfer-learning-for-nlp-with-tensorflow-hub
This project demonstrates the use of various pre-trained models for transfer learning in NLP using TensorFlow Hub.
fine-tuning natural-language-processing nlp pretrained-language-model pretrained-models quora-insincere-questions-classification tensorboard-visualizations tensorflowhub transfer-learning
Last synced: 29 Dec 2025
https://github.com/hojat72elect/imdb_storyline_summaries_database
The database IMDB storylines and their summaries
computational-linguistics computer-science csv data-science dataset machine-learning natural-language-processing nlp pretrained-language-model python science tplm
Last synced: 04 Oct 2025
https://github.com/snigdho8869/text-summarizer-flask
This repository contains a Flask-based web application that utilizes the BART, GPT-2 pretrained models for text summarization.
abstractive-summarization bart deep-learning fine-tuning gpt-2 huggingface huggingface-transformers machine-learning natural-language-processing nlp pretrained-language-model pretrained-models text-generation text-summarization transformers
Last synced: 24 Jun 2025
https://github.com/20101301-alina-hasan/robust-fake-review-detection-using-uncertainty-aware-lstm-and-bert
Our study utilizes BERT and LSTM models alongside Monte Carlo Dropout (MCD) on the Yelp Labelled Dataset. MCD bolsters robustness by introducing uncertainty through neuron dropout. The BERT-embedded MCD achieves an impressive 91.75% accuracy, surpassing the LSTM model.
artificial-intelligence bert fake-review-detection language-model long-short-term-memory lstm natural-language-preprocessing neural-network pretrained-language-model yelp-dataset
Last synced: 17 Jul 2025
https://github.com/am-ankitgit/complete-deep-learning-algorithms
deep-learning machine-learning
artificial-intelligence backprogation cnn-lstm cnn-model cnn-text-classification deep-learning deep-neural-networks forward-propagation keras-classification-models keras-tensorflow keras-tuner lstm-neural-networks pretrained-language-model pretrained-models python rnn-tensorflow tensorflow tensorflow2
Last synced: 20 Jun 2025