An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with roberta

A curated list of projects in awesome lists tagged with roberta .

https://github.com/microsoft/lora

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

adaptation deberta deep-learning gpt-2 gpt-3 language-model lora low-rank pytorch roberta

Last synced: 12 May 2025

https://github.com/microsoft/LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

adaptation deberta deep-learning gpt-2 gpt-3 language-model lora low-rank pytorch roberta

Last synced: 27 Mar 2025

https://github.com/ymcui/chinese-bert-wwm

Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)

bert bert-wwm bert-wwm-ext chinese-bert nlp pytorch rbt roberta roberta-wwm tensorflow

Last synced: 17 Nov 2025

https://github.com/ymcui/Chinese-BERT-wwm

Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)

bert bert-wwm bert-wwm-ext chinese-bert nlp pytorch rbt roberta roberta-wwm tensorflow

Last synced: 28 Mar 2025

https://github.com/jessevig/bertviz

BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)

bert gpt2 machine-learning natural-language-processing neural-network nlp pytorch roberta transformer transformers visualization

Last synced: 13 May 2025

https://github.com/cluebenchmark/clue

中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

albert benchmark bert chinese chineseglue corpus dataset glue language-model nlu pretrained-models pytorch roberta tensorflow transformers

Last synced: 14 May 2025

https://github.com/CLUEbenchmark/CLUE

中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

albert benchmark bert chinese chineseglue corpus dataset glue language-model nlu pretrained-models pytorch roberta tensorflow transformers

Last synced: 28 Mar 2025

https://github.com/brightmart/albert_zh

A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型

albert bert chinese-corpus pre-trained pre-trained-model pytorch roberta tensorflow xlnet

Last synced: 14 May 2025

https://github.com/guillaume-be/rust-bert

Rust native ready-to-use NLP pipelines and transformer-based models (BERT, DistilBERT, GPT2,...)

bart bert deep-learning electra gpt gpt-2 language-generation machine-learning ner nlp question-answering roberta rust rust-lang sentiment-analysis transformer translation

Last synced: 14 May 2025

https://github.com/brightmart/roberta_zh

RoBERTa中文预训练模型: RoBERTa for Chinese

bert chinese gpt2 pre-trained pre-trained-language-models roberta

Last synced: 15 May 2025

https://github.com/deepset-ai/farm

:house_with_garden: Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.

bert deep-learning germanbert language-models ner nlp nlp-framework nlp-library pretrained-models pytorch question-answering roberta transfer-learning xlnet-pytorch

Last synced: 11 Apr 2025

https://github.com/deepset-ai/FARM

:house_with_garden: Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.

bert deep-learning germanbert language-models ner nlp nlp-framework nlp-library pretrained-models pytorch question-answering roberta transfer-learning xlnet-pytorch

Last synced: 03 Apr 2025

https://github.com/tencent/turbotransformers

a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.

albert bert decoder gpt2 gpu huggingface-transformers inference machine-translation nlp pytorch roberta transformer

Last synced: 15 May 2025

https://github.com/cluebenchmark/cluener2020

CLUENER2020 中文细粒度命名实体识别 Fine Grained Named Entity Recognition

albert bert chinese chinese-ner chinesener dataset fine-grained-ner named-entity-recognition ner roberta seq2seq sequence-labeling sequence-to-sequence

Last synced: 08 Apr 2025

https://github.com/Tencent/TurboTransformers

a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.

albert bert decoder gpt2 gpu huggingface-transformers inference machine-translation nlp pytorch roberta transformer

Last synced: 19 Mar 2025

https://github.com/920232796/bert_seq2seq

pytorch实现 Bert 做seq2seq任务,使用unilm方案,现在也可以做自动摘要,文本分类,情感分析,NER,词性标注等任务,支持t5模型,支持GPT2进行文章续写。

autotitle bert crf gpt2 ner pytorch roberta seq2seq t5-model text-classification unilm

Last synced: 15 May 2025

https://github.com/cluebenchmark/cluecorpus2020

Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料

albert bert chinese chinese-corpus corpus datasets nlp pretrain roberta

Last synced: 26 Jan 2026

https://github.com/CLUEbenchmark/CLUECorpus2020

Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料

albert bert chinese chinese-corpus corpus datasets nlp pretrain roberta

Last synced: 09 May 2025

https://github.com/grammarly/gector

Official implementation of the papers "GECToR – Grammatical Error Correction: Tag, Not Rewrite" (BEA-20) and "Text Simplification by Tagging" (BEA-21)

bert grammatical-error-correction natural-language-processing nlp roberta sequence-labeling text-simplification transformers xlnet

Last synced: 21 Feb 2026

https://github.com/explosion/curated-transformers

🤖 A PyTorch library of curated Transformer models and their composable components

albert bert camembert dolly2 falcon gptneox llama llm llms nlp pytorch roberta transformer transformers xlm-roberta

Last synced: 23 Oct 2025

https://github.com/asyml/texar-pytorch

Integrating the Best of TF into PyTorch, for Machine Learning, Natural Language Processing, and Text Generation. This is part of the CASL project: http://casl-project.ai/

bert casl-project data-processing deep-learning dialog-systems gpt-2 machine-learning machine-translation natural-language-processing python pytorch roberta texar texar-pytorch text-data text-generation xlnet

Last synced: 08 Oct 2025

https://github.com/rinnakk/japanese-pretrained-models

Code for producing Japanese pretrained models provided by rinna Co., Ltd.

gpt2 japanese nlp roberta

Last synced: 09 Apr 2025

https://github.com/HHousen/TransformerSum

Models to perform neural summarization (extractive and abstractive) using machine learning transformers and a tool to convert abstractive summarization datasets to the extractive task.

albert automatic-summarization bert distilbert extractive-summarization machine-learning pytorch-lightning roberta summarization summarization-dataset text-summarization transformer-models

Last synced: 29 Apr 2025

https://github.com/brightmart/xlnet_zh

中文预训练XLNet模型: Pre-Trained Chinese XLNet_Large

bert language-model pre-train roberta xlnet

Last synced: 20 Jun 2025

https://github.com/ipieter/robbert

A Dutch RoBERTa-based language model

bert bert-model language-model nlp nlp-resources roberta transformers

Last synced: 05 Mar 2026

https://github.com/iPieter/RobBERT

A Dutch RoBERTa-based language model

bert bert-model language-model nlp nlp-resources roberta transformers

Last synced: 10 May 2025

https://github.com/brightmart/multi-label_classification

transform multi-label classification as sentence pair task, with more training data and information

bert multi-label-classification roberta sentence-pair-classification text-classification

Last synced: 15 Apr 2025

https://github.com/jaketae/koclip

KoCLIP: Korean port of OpenAI CLIP, in Flax

flax jax openai-clip roberta vision-transformer

Last synced: 13 May 2025

https://github.com/mim-solutions/bert_for_longer_texts

BERT classification model for processing texts longer than 512 tokens. Text is first divided into smaller chunks and after feeding them to BERT, intermediate results are pooled. The implementation allows fine-tuning.

bert deep-learning machine-learning natural-language-processing nlp pytorch roberta text-classification transfer-learning transformers

Last synced: 07 Apr 2026

https://github.com/microsoft/AdaMix

This is the implementation of the paper AdaMix: Mixture-of-Adaptations for Parameter-efficient Model Tuning (https://arxiv.org/abs/2205.12410).

adapter bert dart glue gpt-2 nlg nlp nlu parameter-efficient pytorch roberta webnlg

Last synced: 21 Jun 2025

https://github.com/microsoft/adamix

This is the implementation of the paper AdaMix: Mixture-of-Adaptations for Parameter-efficient Model Tuning (https://arxiv.org/abs/2205.12410).

adapter bert dart glue gpt-2 nlg nlp nlu parameter-efficient pytorch roberta webnlg

Last synced: 10 Apr 2025

https://github.com/tae898/erc

The official implementation of "EmoBERTa: Speaker-Aware Emotion Recognition in Conversation with RoBERTa"

bert deep-learning emotion-recognition huggingface nlp pytorch roberta transformers

Last synced: 04 Oct 2025

https://github.com/lonepatient/clue_pytorch

CLUE baseline pytorch CLUE的pytorch版本基线

albert bert chinese classification clue ernie glue pytorch roberta xlnet

Last synced: 12 Oct 2025

https://github.com/YJiangcm/SST-2-sentiment-analysis

Use BiLSTM_attention, BERT, ALBERT, RoBERTa, XLNet model to classify the SST-2 data set based on pytorch

albert bert bilstm-attention google-colab pytorch roberta xlnet

Last synced: 19 Nov 2025

https://github.com/xiongma/roberta-wwm-base-distill

this is roberta wwm base distilled model which was distilled from roberta wwm by roberta wwm large

bert distillation natural-language-processing pretrained-models roberta tensorflow

Last synced: 13 Apr 2025

https://github.com/920232796/bert_seq2seq_ddp

bert_seq2seq的DDP版本,支持bert、roberta、nezha、t5、gpt2等模型,支持seq2seq、ner、关系抽取等任务,无需添加额外代码,轻松启动DDP多卡训练。

bert data-parallel ddp gpt2 nezha pytorch roberta t5 transformer

Last synced: 13 Apr 2025

https://github.com/hritik5102/fake-news-classification-model

✨ Fake news classification using source adaptive framework - BE Project 🎓The repository contains Detailed Documentation of the project, Classification pipeline, Architecture, System Interface Design, Tech stack used.

bert bilstm deep-learning fake-news fake-news-classification fake-news-dataset fake-news-detection glove machine-learning natural-language-processing openai-gpt2 python roberta

Last synced: 07 May 2025

https://github.com/roim1998/apt

[ICML'24 Oral] APT: Adaptive Pruning and Tuning Pretrained Language Models for Efficient Training and Inference

bert efficient-deep-learning llama2 llm llm-finetuning peft peft-fine-tuning-llm pruning roberta t5

Last synced: 16 May 2025

https://github.com/danieldk/oxidized-transformers

Modular Rust transformer/LLM library using Candle

bert llama llama2 llm roberta rust transformers

Last synced: 28 Jul 2025

https://github.com/oxidized-transformers/oxidized-transformers

Modular Rust transformer/LLM library using Candle

bert llama llama2 llm roberta rust transformers

Last synced: 25 Oct 2025

https://github.com/adapter-hub/efficient-task-transfer

Research code for "What to Pre-Train on? Efficient Intermediate Task Selection", EMNLP 2021

adapters bert nlp roberta transfer-learning transformers

Last synced: 05 Aug 2025

https://github.com/ermlab/politbert

Polish RoBERTA model trained on Polish literature, Wikipedia, and Oscar. The major assumption is that quality text will give a good model.

nlp polish roberta text-corpus

Last synced: 07 Apr 2026

https://github.com/Ermlab/PoLitBert

Polish RoBERTA model trained on Polish literature, Wikipedia, and Oscar. The major assumption is that quality text will give a good model.

nlp polish roberta text-corpus

Last synced: 26 Apr 2025

https://github.com/nipunsadvilkar/roberta-base-mr

RoBERTa Marathi Language model trained from scratch during huggingface 🤗 x flax community week

flax huggingface jax marathi roberta transformers

Last synced: 18 Apr 2025

https://github.com/declare-lab/team

Our EMNLP 2022 paper on MCQA

deberta mcq question-answering roberta transformers

Last synced: 14 Apr 2025

https://github.com/m3hrdadfi/zabanshenas

Zabanshenas is a solution for identifying the most likely language of a piece of written text. Demo (👇 )

attention-is-all-you-need detected-languages langdetect language-detector languages langugage-recognition pytorch-implementation roberta transformer transformers

Last synced: 09 Jul 2025

https://github.com/ztjhz/minilm

Small Model Is All You Need - NTU SC4001 Neural Network & Deep Learning Project

bert deep-learning deepspeed gpt2 llama llm neural-network nlp ntu roberta sc4001 wandb

Last synced: 30 Jul 2025

https://github.com/lifeweb-ir/lm

Lifeweb AI team Language Models

language-model mobilebert persian-nlp roberta

Last synced: 31 Jan 2026

https://github.com/vremold/dtem

Developer Technical Expertise Mining

github gnn graphcodebert roberta

Last synced: 30 Apr 2025

https://github.com/haozhg/lmd

Language Model Decomposition: Quantifying the Dependency and Correlation of Language Models

bert deep-learning language-models multilingual-bert natural-language-processing nlp pretrained-models python pytorch roberta transformers xlm-roberta

Last synced: 17 Jan 2026

https://github.com/smappnyu/smaberta

Wrapper for stable version of RoBERTa language models

huggingface nlp roberta transfer-learning

Last synced: 10 Apr 2025

https://github.com/jaketae/auto-tagger

Fine-tuning and zero-shot learning with transformers to automatically tag my study blog posts

bart bert multilabel-classification nli nlp pytorch roberta tagging transformer zero-shot-learning

Last synced: 11 Oct 2025

https://github.com/Joppewouts/belabBERT

🤧belabBERT: Repository for a new Dutch language model based on the RoBERTa architecture

bert language-model nlp roberta

Last synced: 10 May 2025

https://github.com/cyberagentailab/japanese-nli-model

This repository provides the code for Japanese NLI model, a fine-tuned masked language model.

bert japanese natural-language-processing natural-language-understanding nli nlp roberta sentence-transformers transformers

Last synced: 21 Sep 2025

https://github.com/minnesotanlp/Quantifying-Annotation-Disagreement

Official implementation of Wan et al's paper "Everyone's Voice Matters: Quantifying Annotation Disagreement Using Demographic Information" (AAAI 2023)

aaai ai annotation natural-language-processing nlp roberta

Last synced: 08 Apr 2025

https://github.com/louisbrulenaudet/lemone-embed

All-in-one repo for the Lemone-embed project, a series of fine-tuned embedding models for Tax retrieval augmented generation (RAG).

e5 embedding-models embeddings fine-tuning huggingface lemone lemone-embed open-source rag roberta roberta-model sentence-transformers similarity-search tax taxation

Last synced: 22 Apr 2025

https://github.com/deepmancer/roberta-adapter-fine-tuning

Fine-tuning a RoBERTa model for sentiment analysis on the IMDB movie reviews dataset using the Adapter method and PyTorch Transformers

adapter-finetuning bert huggingface imdb-dataset llm pytorch roberta roberta-model sentiment-analysis sentiment-classification transformers

Last synced: 01 Apr 2025

https://github.com/vremold/linkso

A weakly supervised approach to link Stack Overflow posts to programming language textbooks. Mainly rely on matching in 3 extracted features: keyphrase, semantics and code elements

python roberta stackoverflow textbook

Last synced: 30 Apr 2026

https://github.com/muhammadadyl/SarcasmDetection

This project used in my dissertation for detecting sarcasm in twitter dataset using Sate-of-the-Art Transformer

bert-model detection distilbert distilroberta nlp pytorch pytorch-transformers roberta sarcasm transformers twitter xlnet

Last synced: 11 Mar 2025

https://github.com/dsfsi/zabantu-beta

ZaBantu is a fleet of light-weight Masked Language Models for Southern Bantu Languages

low-resource-languages nlp roberta sotho tshivenda tsonga xlm-roberta zulu

Last synced: 16 Jun 2026

https://github.com/walidbosso/roberta_fine_tuning_sentiment_analysis

Welcome to our RoBERTa Sentiment Analysis project! In this repository, we explore the world of Natural Language Processing (NLP) by fine-tuning a RoBERTa Transformer for sentiment analysis.

bert classification fine-tuning huggingface machine machine-learning movies-reviews neural-network nlp-machine-learning numpy pandas pytorch roberta roberta-model roberta-tokenizer sentiment-analysis transformer

Last synced: 19 Apr 2026

https://github.com/subhadarship/text-clustering

Clustering text data (data mining fall 2019)

bert clustering glove-embeddings lda nlp roberta topic-modeling visualization

Last synced: 18 Mar 2025

https://github.com/vidhi1290/text-classification-with-transformers-roberta-and-xlnet-model

This project explores advanced NLP techniques using RoBERTa and XLNet models to classify human emotions from textual data. By leveraging robust training and optimization methods, it aims to enhance emotion detection accuracy in applications like social media analysis and customer support.

autoencoder bert ktrain matplotlib natural-language-processing nlp numpy pandas python roberta seaborn sklearn tensorflow transformers xlnet

Last synced: 06 Jan 2026

https://github.com/kuberwastaken/orcus

A Dual Model AI Based Comment-Analyzer and Flag-Generator for Social Media

artificial-intelligence comments gpt-2 linkedin orcus roberta

Last synced: 26 Feb 2026

https://github.com/tikquuss/eulascript

Machine learning (ML) solution that review end-user license agreements (EULA) for terms and conditions that are unacceptable to the government

albert bert distilbert eula huggingface ktrain pandas roberta xlnet

Last synced: 28 Apr 2026