Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

https://github.com/m3hrdadfi/zabanshenas

Zabanshenas is a solution for identifying the most likely language of a piece of written text. Demo (👇 )

attention-is-all-you-need detected-languages langdetect language-detector languages langugage-recognition pytorch-implementation roberta transformer transformers

Last synced: 27 Jun 2024

https://github.com/grammarly/gector

Official implementation of the papers "GECToR – Grammatical Error Correction: Tag, Not Rewrite" (BEA-20) and "Text Simplification by Tagging" (BEA-21)

bert grammatical-error-correction natural-language-processing nlp roberta sequence-labeling text-simplification transformers xlnet

Last synced: 23 Jun 2024

https://github.com/Tencent/TurboTransformers

a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.

albert bert decoder gpt2 gpu huggingface-transformers inference machine-translation nlp pytorch roberta transformer

Last synced: 22 Jun 2024

https://github.com/HHousen/TransformerSum

Models to perform neural summarization (extractive and abstractive) using machine learning transformers and a tool to convert abstractive summarization datasets to the extractive task.

albert automatic-summarization bert distilbert extractive-summarization machine-learning pytorch-lightning roberta summarization summarization-dataset text-summarization transformer-models

Last synced: 16 Jun 2024

https://github.com/rinnakk/japanese-pretrained-models

Code for producing Japanese pretrained models provided by rinna Co., Ltd.

gpt2 japanese nlp roberta

Last synced: 16 Jun 2024

https://github.com/vipulraheja/IteraTeR

Official implementation of the paper "IteraTeR: Understanding Iterative Revision from Human-Written Text" (ACL 2022)

bart bert iterative-text-editing iterative-text-revision natural-language-processing nlp pegasus roberta text-editing text-revision transformer transformers writing-assistant writing-systems

Last synced: 11 Jun 2024

https://github.com/guillaume-be/rust-bert

Rust native ready-to-use NLP pipelines and transformer-based models (BERT, DistilBERT, GPT2,...)

bart bert deep-learning electra gpt gpt-2 language-generation machine-learning ner nlp question-answering roberta rust rust-lang sentiment-analysis transformer translation

Last synced: 09 Jun 2024

https://github.com/minnesotanlp/Quantifying-Annotation-Disagreement

Official implementation of Wan et al's paper "Everyone's Voice Matters: Quantifying Annotation Disagreement Using Demographic Information" (AAAI 2023)

aaai ai annotation natural-language-processing nlp roberta

Last synced: 08 Jun 2024

https://github.com/deepset-ai/FARM

:house_with_garden: Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.

bert deep-learning germanbert language-models ner nlp nlp-framework nlp-library pretrained-models pytorch question-answering roberta transfer-learning xlnet-pytorch

Last synced: 07 Jun 2024

https://github.com/brightmart/albert_zh

A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型

albert bert chinese-corpus pre-trained pre-trained-model pytorch roberta tensorflow xlnet

Last synced: 06 Jun 2024

https://github.com/brightmart/xlnet_zh

中文预训练XLNet模型: Pre-Trained Chinese XLNet_Large

bert language-model pre-train roberta xlnet

Last synced: 06 Jun 2024

https://github.com/brightmart/roberta_zh

RoBERTa中文预训练模型: RoBERTa for Chinese

bert chinese gpt2 pre-trained pre-trained-language-models roberta

Last synced: 06 Jun 2024

https://github.com/iPieter/RobBERT

A Dutch RoBERTa-based language model

bert bert-model language-model nlp nlp-resources roberta transformers

Last synced: 16 May 2024

https://github.com/Joppewouts/belabBERT

🤧belabBERT: Repository for a new Dutch language model based on the RoBERTa architecture

bert language-model nlp roberta

Last synced: 16 May 2024

https://github.com/Ermlab/PoLitBert

Polish RoBERTA model trained on Polish literature, Wikipedia, and Oscar. The major assumption is that quality text will give a good model.

nlp polish roberta text-corpus

Last synced: 27 Apr 2024

https://github.com/jessevig/bertviz

BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)

bert gpt2 machine-learning natural-language-processing neural-network nlp pytorch roberta transformer transformers visualization

Last synced: 20 Apr 2024

https://github.com/asyml/texar-pytorch

Integrating the Best of TF into PyTorch, for Machine Learning, Natural Language Processing, and Text Generation. This is part of the CASL project: http://casl-project.ai/

bert casl-project data-processing deep-learning dialog-systems gpt-2 machine-learning machine-translation natural-language-processing python pytorch roberta texar texar-pytorch text-data text-generation xlnet

Last synced: 19 Apr 2024

https://github.com/microsoft/LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

adaptation deberta deep-learning gpt-2 gpt-3 language-model lora low-rank pytorch roberta

Last synced: 16 Apr 2024

https://github.com/CLUEbenchmark/CLUECorpus2020

Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料

albert bert chinese chinese-corpus corpus datasets nlp pretrain roberta

Last synced: 04 Apr 2024

https://github.com/ymcui/Chinese-BERT-wwm

Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)

bert bert-wwm bert-wwm-ext chinese-bert nlp pytorch rbt roberta roberta-wwm tensorflow

Last synced: 31 Mar 2024

https://github.com/dbiir/UER-py

Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo

albert bart bert chinese classification clue elmo fine-tuning gpt gpt-2 model-zoo natural-language-processing ner pegasus pre-training pytorch roberta t5 unilm xlm-roberta

Last synced: 31 Mar 2024

https://github.com/lonePatient/awesome-pretrained-chinese-nlp-models

Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合

bert chinese dataset ernie gpt gpt-2 large-language-models llm multimodel nezha nlp nlu-nlg pangu pretrained-models roberta simbert xlnet

Last synced: 31 Mar 2024

https://github.com/explosion/curated-transformers

🤖 A PyTorch library of curated Transformer models and their composable components

albert bert camembert dolly2 falcon gptneox llama llm llms nlp pytorch roberta transformer transformers xlm-roberta

Last synced: 21 Mar 2024

https://github.com/CLUEbenchmark/CLUE

中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

albert benchmark bert chinese chineseglue corpus dataset glue language-model nlu pretrained-models pytorch roberta tensorflow transformers

Last synced: 13 Mar 2024