An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with pretrained-language-model

A curated list of projects in awesome lists tagged with pretrained-language-model .

https://github.com/wenge-research/yayi2

YAYI 2 是中科闻歌研发的新一代开源大语言模型,采用了超过 2 万亿 Tokens 的高质量、多语言语料进行预训练。(Repo for YaYi 2 Chinese LLMs)

artificial-intelligence chat chinese gpt natural-language-generation pretrained-language-model yayi

Last synced: 30 Mar 2025

https://github.com/wenge-research/YAYI2

YAYI 2 是中科闻歌研发的新一代开源大语言模型,采用了超过 2 万亿 Tokens 的高质量、多语言语料进行预训练。(Repo for YaYi 2 Chinese LLMs)

artificial-intelligence chat chinese gpt natural-language-generation pretrained-language-model yayi

Last synced: 09 May 2025

https://github.com/thudm/p-tuning-v2

An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks

natural-language-processing p-tuning parameter-efficient-learning pretrained-language-model prompt-tuning

Last synced: 15 May 2025

https://github.com/THUDM/P-tuning-v2

An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks

natural-language-processing p-tuning parameter-efficient-learning pretrained-language-model prompt-tuning

Last synced: 14 Mar 2025

https://github.com/thunlp/opendelta

A plug-and-play library for parameter-efficient-tuning (Delta Tuning)

deep-learning nlp nlp-library parameter-efficient-learning pretrained-language-model

Last synced: 15 May 2025

https://github.com/thunlp/OpenDelta

A plug-and-play library for parameter-efficient-tuning (Delta Tuning)

deep-learning nlp nlp-library parameter-efficient-learning pretrained-language-model

Last synced: 09 May 2025

https://github.com/allenai/dont-stop-pretraining

Code associated with the Don't Stop Pretraining ACL 2020 paper

natural-language-processing pretrained-language-model

Last synced: 13 Oct 2025

https://github.com/gaoisbest/NLP-Projects

word2vec, sentence2vec, machine reading comprehension, dialog system, text classification, pretrained language model (i.e., XLNet, BERT, ELMo, GPT), sequence labeling, information retrieval, information extraction (i.e., entity, relation and event extraction), knowledge graph, text generation, network embedding

dialogue-systems information-extraction information-retrieval knowledge-graph machine-reading-comprehension network-embedding pretrained-language-model sentence2vec sequence-labeling text-classification text-generation word2vec

Last synced: 07 Apr 2025

https://github.com/lyh-yf/mwptoolkit

MWPToolkit is an open-source framework for math word problem(MWP) solvers.

deep-learning graph-to-tree math-word-problem pretrained-language-model pytorch sequence-to-sequence sequence-to-tree

Last synced: 08 Oct 2025

https://github.com/EagleW/Scientific-Inspiration-Machines-Optimized-for-Novelty

Official implementation of the ACL 2024: Scientific Inspiration Machines Optimized for Novelty

acl2024 gpt4 hypothesis-generation llm pretrained-language-model pytorch retrieval-augmented-generation text-generation

Last synced: 14 Oct 2025

https://github.com/heraclex12/nlp2sparql

Translate Natural Language Processing to SPARQL Query and vice versa

bert2bert knowledge-base machine-translation pretrained-language-model question-answering sparql spbert

Last synced: 19 Apr 2025

https://github.com/ganjinzero/coder

CODER: Knowledge infused cross-lingual medical term embedding for term normalization. [JBI, ACL-BioNLP 2022]

embeddings medical multi-language nlp pretrained-language-model umls

Last synced: 25 Oct 2025

https://github.com/ganjinzero/biobart

BioBART: Pretraining and Evaluation of A Biomedical Generative Language Model [ACL-BioNLP 2022]

biomedical generative pretrained-language-model

Last synced: 13 Jul 2025

https://github.com/thunlp/cokebert

CokeBERT: Contextual Knowledge Selection and Embedding towards Enhanced Pre-Trained Language Models

bert knowledge-graph nlp pretrained-language-model pytorch

Last synced: 25 Apr 2025

https://github.com/engineeringsoftware/coditt5

CoditT5: Pretraining for Source Code and Natural Language Editing

machine-learning pretrained-language-model software-engineering

Last synced: 12 May 2025

https://github.com/wxl1999/unicrs

[KDD22] Official PyTorch implementation for "Towards Unified Conversational Recommender Systems via Knowledge-Enhanced Prompt Learning".

conversation conversational-ai conversational-bots dialog dialogue dialogue-systems pretrained-language-model pretrained-models pretraining prompt prompt-tuning prompts recommendation recommendation-system recommender-system

Last synced: 26 Oct 2025

https://github.com/yingyuankai/AiSpace

AiSpace: Better practices for deep learning model development and deployment For Tensorflow 2.0

aispace albert-chinese bert chinese clue cmrc2018 dureader electra ernie lr-finder pretrained-language-model pretrained-models roberta-chinese swa tensorflow2 xlnet

Last synced: 18 Mar 2025

https://github.com/rucaibox/elmer

This repository is the official implementation of our EMNLP 2022 paper ELMER: A Non-Autoregressive Pre-trained Language Model for Efficient and Effective Text Generation

non-autoregressive-translation pretrained-language-model text-generation

Last synced: 17 Sep 2025

https://github.com/megagonlabs/cocosum

:coconut: Code & Data for Comparative Opinion Summarization via Collaborative Decoding (Iso et al; Findings of ACL 2022)

co-decoding comparative-opinion-summarization natural-language-generation opinion-summarization pretrained-language-model pytorch-lightning text-generation

Last synced: 14 Jun 2025

https://github.com/arianhosseini/negation-learning

code for our paper "Understanding by Understanding Not: Modeling Negation in Language Models"

bert language-model negation pretrained-language-model pytorch transformer

Last synced: 15 Mar 2025

https://github.com/imkett/resee

[EMNLP'23 Oral] ReSee: Responding through Seeing Fine-grained Visual Knowledge in Open-domain Dialogue PyTorch Implementation

dataset dialogue-systems emnlp2023 multimodal-dialogue pretrained-language-model transformers visual-dialogue

Last synced: 21 Mar 2025

https://github.com/imsanko/image_caption_generator_with_transformers

This repository contains code for generating captions for images using a Transformer-based model. The model used is the `VisionEncoderDecoderModel` from the Hugging Face Transformers library, specifically the `nlpconnect/vit-gpt2-image-captioning` model.

collaboration ipython-notebook jypyternotebook llm pretrained-language-model pthon3 transfromers

Last synced: 27 Oct 2025

https://github.com/eric11eca/reckoning-metakg

RECKONING is a bi-level learning algorithm that improves language models' reasoning ability by folding contextual knowledge into parametric knowledge through back-propagation.

bilevel-optimization complex-reasoning meta-learning pretrained-language-model question-answering

Last synced: 14 Apr 2025

https://github.com/damo-nlp-sg/peerda

Source code of "PeerDA: Data Augmentation via Modeling Peer Relation for Span Identification Tasks" (ACL23)

data-augmentation machine-reading-comprehension pretrained-language-model

Last synced: 03 May 2025

https://github.com/xcollab/huggingface

This repository provides an overview of Hugging Face's Transformers library, a powerful tool for natural language processing (NLP) and machine learning tasks.

bert bert-model gpt gpt-models huggingface huggingface-transformers llm llms models pretrained-language-model pretrained-models python transformer transformers-models

Last synced: 10 Apr 2025

https://github.com/apsinghanalytics/finragify_app

An LLM app leveraging RAG with LangChain and GPT-4 mini to analyze earnings call transcripts, assess company performance, using natural language queries (NLP), FAISS (vector database), and Hugging Face re-ranking models.

aws-ec2 cloud-application docker-container earnings-transcripts faiss-vector-database finance fine-tuning gpt-4o-mini huggingface-models langchain-python large-language-model natural-language-processing pretrained-language-model prompt-engineering question-answering-system reranking retrieval-augmented-generation stocks vector-embeddings

Last synced: 05 Apr 2025

https://github.com/sreeeswaran/train-your-llm

This repository contains code and resources for training, fine-tuning, and deploying large language models using Hugging Face's Transformers library.

artificial-intelligence deep-learning language-model large-language-model large-language-models llm llm-training llms machine-learning model-training nlp pretrained-language-model pretrained-models training

Last synced: 13 Jul 2025

https://github.com/cai991108/machine-learning-and-language-model

This project explores GPT-2 and Llama models through pre-training, fine-tuning, and Chain-of-Thought (CoT) prompting. It includes memory-efficient optimizations (SGD, LoRA, BAdam) and evaluations on math datasets (GSM8K, NumGLUE, StimulEq, SVAMP).

chainofthought finetune-llm gpt2 llama llm llm-inference pretrained-language-model

Last synced: 13 Nov 2025

https://github.com/shreydan/masked-language-modeling

Transformers Pre-Training with MLM objective — implemented encoder-only model and trained from scratch on Wikipedia dataset.

masked-language-models nlp pretrained-language-model pytorch transformers

Last synced: 15 May 2025

https://github.com/20101301-alina-hasan/robust-fake-review-detection-using-uncertainty-aware-lstm-and-bert

Our study utilizes BERT and LSTM models alongside Monte Carlo Dropout (MCD) on the Yelp Labelled Dataset. MCD bolsters robustness by introducing uncertainty through neuron dropout. The BERT-embedded MCD achieves an impressive 91.75% accuracy, surpassing the LSTM model.

artificial-intelligence bert fake-review-detection language-model long-short-term-memory lstm natural-language-preprocessing neural-network pretrained-language-model yelp-dataset

Last synced: 17 Jul 2025