Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/m3hrdadfi/zabanshenas
Zabanshenas is a solution for identifying the most likely language of a piece of written text. Demo (👇 )
attention-is-all-you-need detected-languages langdetect language-detector languages langugage-recognition pytorch-implementation roberta transformer transformers
Last synced: 27 Jun 2024
![](https://github.com/m3hrdadfi.png)
https://github.com/grammarly/gector
Official implementation of the papers "GECToR – Grammatical Error Correction: Tag, Not Rewrite" (BEA-20) and "Text Simplification by Tagging" (BEA-21)
bert grammatical-error-correction natural-language-processing nlp roberta sequence-labeling text-simplification transformers xlnet
Last synced: 23 Jun 2024
![](https://github.com/grammarly.png)
https://github.com/Tencent/TurboTransformers
a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.
albert bert decoder gpt2 gpu huggingface-transformers inference machine-translation nlp pytorch roberta transformer
Last synced: 22 Jun 2024
![](https://github.com/Tencent.png)
https://github.com/HHousen/TransformerSum
Models to perform neural summarization (extractive and abstractive) using machine learning transformers and a tool to convert abstractive summarization datasets to the extractive task.
albert automatic-summarization bert distilbert extractive-summarization machine-learning pytorch-lightning roberta summarization summarization-dataset text-summarization transformer-models
Last synced: 16 Jun 2024
![](https://github.com/HHousen.png)
https://github.com/rinnakk/japanese-pretrained-models
Code for producing Japanese pretrained models provided by rinna Co., Ltd.
Last synced: 16 Jun 2024
![](https://github.com/rinnakk.png)
https://github.com/liuyukid/transformers-ner
Pytorch-Named-Entity-Recognition-with-transformers
adversarial-training albert bert camembert crf distilbert electra fgm ner pgd pytorch roberta softmax span transformers xlm xlmroberta
Last synced: 16 Jun 2024
![](https://github.com/liuyukid.png)
https://github.com/lhao499/language-quantized-autoencoders
Language Quantized AutoEncoders
autoencoders bert large-language-models multimodal roberta vq vqvae
Last synced: 14 Jun 2024
![](https://github.com/lhao499.png)
https://github.com/CLUEbenchmark/CLUEPretrainedModels
高质量中文预训练模型集合:最先进大模型、最快小模型、相似度专门模型
albert bert chinese corpus dataset distillation pretrained-models roberta semantic-similarity sentence-analysis sentence-classification sentence-pairs text-classification
Last synced: 13 Jun 2024
![](https://github.com/CLUEbenchmark.png)
https://github.com/clue-ai/PromptCLUE
PromptCLUE, 全中文任务支持零样本学习模型
bert chinese few-shot-learning gpt-3 multitask-learning pretrained-models prompt-tuning roberta t5-model transfer-learning zero-shot-learning
Last synced: 13 Jun 2024
![](https://github.com/clue-ai.png)
https://github.com/microsoft/DeBERTa
The implementation of DeBERTa
bert deeplearning language-model natural-language-understanding representation-learning roberta self-attention transformer-encoder
Last synced: 11 Jun 2024
![](https://github.com/microsoft.png)
https://github.com/vipulraheja/IteraTeR
Official implementation of the paper "IteraTeR: Understanding Iterative Revision from Human-Written Text" (ACL 2022)
bart bert iterative-text-editing iterative-text-revision natural-language-processing nlp pegasus roberta text-editing text-revision transformer transformers writing-assistant writing-systems
Last synced: 11 Jun 2024
![](https://github.com/vipulraheja.png)
https://github.com/guillaume-be/rust-bert
Rust native ready-to-use NLP pipelines and transformer-based models (BERT, DistilBERT, GPT2,...)
bart bert deep-learning electra gpt gpt-2 language-generation machine-learning ner nlp question-answering roberta rust rust-lang sentiment-analysis transformer translation
Last synced: 09 Jun 2024
![](https://github.com/guillaume-be.png)
https://github.com/minnesotanlp/Quantifying-Annotation-Disagreement
Official implementation of Wan et al's paper "Everyone's Voice Matters: Quantifying Annotation Disagreement Using Demographic Information" (AAAI 2023)
aaai ai annotation natural-language-processing nlp roberta
Last synced: 08 Jun 2024
![](https://github.com/minnesotanlp.png)
https://github.com/VinAIResearch/BERTweet
BERTweet: A pre-trained language model for English Tweets (EMNLP-2020)
bert bertweet bertweet-covid19 covid covid-19 covid19 english english-tweets fairseq irony-detection language-model named-entity-recognition ner part-of-speech-tagging python3 roberta sentiment-analysis text-classification transformers
Last synced: 07 Jun 2024
![](https://github.com/VinAIResearch.png)
https://github.com/nguyenvulebinh/vietnamese-roberta
A Robustly Optimized BERT Pretraining Approach for Vietnamese
bert bert-embeddings fairseq natural-language-processing pretrained-models pytorch roberta sentencepiece transformer vietnamese vietnamese-nlp
Last synced: 07 Jun 2024
![](https://github.com/nguyenvulebinh.png)
https://github.com/deepset-ai/FARM
:house_with_garden: Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.
bert deep-learning germanbert language-models ner nlp nlp-framework nlp-library pretrained-models pytorch question-answering roberta transfer-learning xlnet-pytorch
Last synced: 07 Jun 2024
![](https://github.com/deepset-ai.png)
https://github.com/VinAIResearch/PhoBERT
PhoBERT: Pre-trained language models for Vietnamese (EMNLP-2020 Findings)
bert bert-embeddings deep-learning fairseq language-models named-entity-recognition natural-language-inference ner nli part-of-speech-tagging phobert pos-tagging python3 rdrsegmenter roberta transformers transformers-library vietnamese vietnamese-nlp vncorenlp
Last synced: 07 Jun 2024
![](https://github.com/VinAIResearch.png)
https://github.com/brightmart/albert_zh
A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型
albert bert chinese-corpus pre-trained pre-trained-model pytorch roberta tensorflow xlnet
Last synced: 06 Jun 2024
![](https://github.com/brightmart.png)
https://github.com/brightmart/xlnet_zh
中文预训练XLNet模型: Pre-Trained Chinese XLNet_Large
bert language-model pre-train roberta xlnet
Last synced: 06 Jun 2024
![](https://github.com/brightmart.png)
https://github.com/brightmart/roberta_zh
RoBERTa中文预训练模型: RoBERTa for Chinese
bert chinese gpt2 pre-trained pre-trained-language-models roberta
Last synced: 06 Jun 2024
![](https://github.com/brightmart.png)
https://github.com/dpressel/mint
MinT: Minimal Transformer Library and Tutorials
bart bert gpt gpt2 opt pytorch roberta sentence-transformers t5 transformer transformer-models transformers tutorials
Last synced: 31 May 2024
![](https://github.com/dpressel.png)
https://github.com/fhamborg/news-please
news-please - an integrated web crawler and information extractor for news that just works
cc-news ccnews commoncrawl crawler data-gathering elasticsearch extract-articles extract-information extractor json news news-archive news-articles news-crawler news-extractor news-scraper news-websites nlp python roberta
Last synced: 29 May 2024
![](https://github.com/fhamborg.png)
https://github.com/iPieter/RobBERT
A Dutch RoBERTa-based language model
bert bert-model language-model nlp nlp-resources roberta transformers
Last synced: 16 May 2024
![](https://github.com/iPieter.png)
https://github.com/Joppewouts/belabBERT
🤧belabBERT: Repository for a new Dutch language model based on the RoBERTa architecture
bert language-model nlp roberta
Last synced: 16 May 2024
![](https://github.com/Joppewouts.png)
https://github.com/Ermlab/PoLitBert
Polish RoBERTA model trained on Polish literature, Wikipedia, and Oscar. The major assumption is that quality text will give a good model.
nlp polish roberta text-corpus
Last synced: 27 Apr 2024
![](https://github.com/Ermlab.png)
https://github.com/ai-forever/model-zoo
NLP model zoo for Russian
bert nlp pytorch roberta roberta-model russian russian-language t5 t5-model transformers
Last synced: 22 Apr 2024
![](https://github.com/ai-forever.png)
https://github.com/jessevig/bertviz
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
bert gpt2 machine-learning natural-language-processing neural-network nlp pytorch roberta transformer transformers visualization
Last synced: 20 Apr 2024
![](https://github.com/jessevig.png)
https://github.com/asyml/texar-pytorch
Integrating the Best of TF into PyTorch, for Machine Learning, Natural Language Processing, and Text Generation. This is part of the CASL project: http://casl-project.ai/
bert casl-project data-processing deep-learning dialog-systems gpt-2 machine-learning machine-translation natural-language-processing python pytorch roberta texar texar-pytorch text-data text-generation xlnet
Last synced: 19 Apr 2024
![](https://github.com/asyml.png)
https://github.com/microsoft/LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
adaptation deberta deep-learning gpt-2 gpt-3 language-model lora low-rank pytorch roberta
Last synced: 16 Apr 2024
![](https://github.com/microsoft.png)
https://github.com/CLUEbenchmark/CLUECorpus2020
Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料
albert bert chinese chinese-corpus corpus datasets nlp pretrain roberta
Last synced: 04 Apr 2024
![](https://github.com/CLUEbenchmark.png)
https://github.com/ymcui/Chinese-BERT-wwm
Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)
bert bert-wwm bert-wwm-ext chinese-bert nlp pytorch rbt roberta roberta-wwm tensorflow
Last synced: 31 Mar 2024
![](https://github.com/ymcui.png)
https://github.com/dbiir/UER-py
Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo
albert bart bert chinese classification clue elmo fine-tuning gpt gpt-2 model-zoo natural-language-processing ner pegasus pre-training pytorch roberta t5 unilm xlm-roberta
Last synced: 31 Mar 2024
![](https://github.com/dbiir.png)
https://github.com/lonePatient/awesome-pretrained-chinese-nlp-models
Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合
bert chinese dataset ernie gpt gpt-2 large-language-models llm multimodel nezha nlp nlu-nlg pangu pretrained-models roberta simbert xlnet
Last synced: 31 Mar 2024
![](https://github.com/lonePatient.png)
https://github.com/labteral/ernie
Simple State-of-the-Art BERT-Based Sentence Classification with Keras / TensorFlow 2. Built with HuggingFace's Transformers.
albert bert bert-as-service bert-embeddings bert-model bert-models distilbert huggingface huggingface-transformer keras natural-language-processing nlp roberta sentence-classification tensorflow tensorflow2 transformer-architecture transformer-tensorflow2 transformers
Last synced: 21 Mar 2024
![](https://github.com/labteral.png)
https://github.com/EricFillion/happy-transformer
Happy Transformer makes it easy to fine-tune and perform inference with NLP Transformer models.
ai artificial-intelligence bert deep-learning language-models machine-learning natural-language-processing nlp python question-answering roberta text-classification transformers
Last synced: 21 Mar 2024
![](https://github.com/EricFillion.png)
https://github.com/explosion/curated-transformers
🤖 A PyTorch library of curated Transformer models and their composable components
albert bert camembert dolly2 falcon gptneox llama llm llms nlp pytorch roberta transformer transformers xlm-roberta
Last synced: 21 Mar 2024
![](https://github.com/explosion.png)
https://github.com/CLUEbenchmark/CLUE
中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
albert benchmark bert chinese chineseglue corpus dataset glue language-model nlu pretrained-models pytorch roberta tensorflow transformers
Last synced: 13 Mar 2024
![](https://github.com/CLUEbenchmark.png)