Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

https://github.com/Beomi/KcBERT

🤗 Pretrained BERT model & WordPiece tokenizer trained on Korean Comments 한국어 댓글로 프리트레이닝한 BERT 모델과 데이터셋

bert bert-model korean-nlp nlp transformers

Last synced: 02 Jul 2024

https://github.com/SKTBrain/KoBERT

Korean BERT pre-trained cased (KoBERT)

bert korean-nlp language-model nlp pytorch transformers

Last synced: 02 Jul 2024

https://github.com/thunlp/PromptPapers

Must-read papers on prompt-based tuning for pre-trained language models.

ai bert machine-learning nlp pre-trained-language-models prompt prompt-based prompt-learning prompt-toolkit

Last synced: 30 Jun 2024

https://github.com/ukairia777/tensorflow-nlp-tutorial

tensorflow를 사용하여 텍스트 전처리부터, Topic Models, BERT, GPT, LLM과 같은 최신 모델의 다운스트림 태스크들을 정리한 Deep Learning NLP 저장소입니다.

bert bert-ner dpo huggingface keras-tutorial llama llm lora named-entity-recognition natural-language-processing nlp nlp-tutorial question-answering sft tensorflow trainer transformers

Last synced: 29 Jun 2024

https://github.com/LINs-lab/DynMoE

[Preprint] Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models

bert mixture-of-experts moe multimodal-large-language-models phi-2 qwen stablelm vision-transformer

Last synced: 28 Jun 2024

https://github.com/PlayVoice/vits_chinese

Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!

aishell3 bert bert-vits bert-vits2 naturalspeech tts vits

Last synced: 28 Jun 2024

https://github.com/ymcui/cmrc2018

A Span-Extraction Dataset for Chinese Machine Reading Comprehension (CMRC 2018)

bert natural-language-processing question-answering reading-comprehension

Last synced: 28 Jun 2024

https://github.com/mhezarei/ai-bot

2020 AI bot challenge (ai-bot.ir) repository. This program answers a given question with a specific format and subject.

bert nlp persian-nlp

Last synced: 27 Jun 2024

https://github.com/Lukas-Justen/Law-OMNI-BERT-Project

Directly applying advancements in transfer learning from BERT results in poor accuracy in domain-specific areas like law because of a word distribution shift from general domain corpora to domain-specific corpora. In our project, we will demonstrate how the pre-trained language model BERT can be adapted to additional domains, such as contract law or court judgments.

bert bert-model contracts language-model law legal-texts statistical-linguistics

Last synced: 26 Jun 2024

https://github.com/mczhuge/Kaleido-BERT

(CVPR2021) Kaleido-BERT: Vision-Language Pre-training on Fashion Domain.

bert e-commerce fashion multimodal pre-training vision-language

Last synced: 23 Jun 2024

https://github.com/grammarly/gector

Official implementation of the papers "GECToR – Grammatical Error Correction: Tag, Not Rewrite" (BEA-20) and "Text Simplification by Tagging" (BEA-21)

bert grammatical-error-correction natural-language-processing nlp roberta sequence-labeling text-simplification transformers xlnet

Last synced: 23 Jun 2024

https://github.com/lonePatient/BERT-NER-Pytorch

Chinese NER(Named Entity Recognition) using BERT(Softmax, CRF, Span)

adversarial-training albert bert chinese crf focal-loss labelsmoothing ner nlp pytorch softmax span

Last synced: 23 Jun 2024

https://github.com/zjunlp/openue

OpenUE是一个轻量级知识图谱抽取工具 (An Open Toolkit for Universal Extraction from Text published at EMNLP2020: https://aclanthology.org/2020.emnlp-demos.1.pdf)

bert event-extraction intent-classification named-entity-recognition natural-language-processing nlp nlp-extraction-tasks openue pytorch relation-extraction slot-filling triple-extraction

Last synced: 23 Jun 2024

https://github.com/autoliuweijie/K-BERT

Source code of K-BERT (AAAI2020)

aaai2020 bert k-bert nlp

Last synced: 23 Jun 2024

https://github.com/THUDM/CogQA

Source code and dataset for ACL 2019 paper "Cognitive Graph for Multi-Hop Reading Comprehension at Scale"

bert graph-neural-networks question-answering

Last synced: 23 Jun 2024

https://github.com/xv44586/ccf_2020_qa_match

ccf 2020 qa match competition top1

bert ccf keras top1

Last synced: 23 Jun 2024

https://github.com/songhaoyu/BoB

The released codes for ACL 2021 paper 'BoB: BERT Over BERT for Training Persona-based Dialogue Models from Limited Personalized Data'

acl2021 bert dialogue-model personachat

Last synced: 23 Jun 2024

https://github.com/EmreTaha/Unsupervised-Domain-Adaptation-with-BERT

Unsupervised domain adaptation with BERT for Amazon food product reviews sentiment analysis.

adversarial-learning amazon-food-reviews bert bert-model colab domain-adaptation nlp sentiment-analysis tensorflow unsupervised-learning

Last synced: 23 Jun 2024

https://github.com/FerdinandZhong/punctuator

A small seq2seq punctuator tool based on DistilBERT

bert bert-ner chinese-nlp deep-learning nlp punctuation pytorch seq2seq

Last synced: 22 Jun 2024

https://github.com/NVIDIA-Merlin/Transformers4Rec

Transformers4Rec is a flexible and efficient library for sequential and session-based recommendation and works with PyTorch.

bert gtp huggingface language-model nlp pytorch recommender-system recsys seq2seq session-based-recommendation tabular-data transformer xlnet

Last synced: 22 Jun 2024

https://github.com/JetRunner/BERT-of-Theseus

⛵️The official PyTorch implementation for "BERT-of-Theseus: Compressing BERT by Progressive Module Replacing" (EMNLP 2020).

bert glue model-compression nlp transformers

Last synced: 22 Jun 2024

https://github.com/Tencent/TurboTransformers

a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.

albert bert decoder gpt2 gpu huggingface-transformers inference machine-translation nlp pytorch roberta transformer

Last synced: 22 Jun 2024

https://github.com/PaddlePaddle/awesome-DeepLearning

深度学习入门课、资深课、特色课、学术案例、产业实践案例、深度学习知识百科及面试题库The course, case and knowledge of Deep Learning and AI

bert classification cnn detection dqn dssm dynabert gan nlp pose recommender-system reinforcement-learning rnn sarsa segmentation tinybert transformer video

Last synced: 22 Jun 2024

https://github.com/Jiakui/awesome-bert

bert nlp papers, applications and github resources, including the newst xlnet , BERT、XLNet 相关论文和 github 项目

bert google-bert nlp xlnet

Last synced: 21 Jun 2024

https://maartengr.github.io/KeyBERT/

Minimal keyword extraction with BERT

bert keyphrase-extraction keyword-extraction mmr

Last synced: 21 Jun 2024

https://maartengr.github.io/BERTopic/

Leveraging BERT and c-TF-IDF to create easily interpretable topics.

bert ldavis machine-learning nlp sentence-embeddings topic topic-modeling topic-modelling topic-models transformers

Last synced: 21 Jun 2024

https://github.com/HHousen/TransformerSum

Models to perform neural summarization (extractive and abstractive) using machine learning transformers and a tool to convert abstractive summarization datasets to the extractive task.

albert automatic-summarization bert distilbert extractive-summarization machine-learning pytorch-lightning roberta summarization summarization-dataset text-summarization transformer-models

Last synced: 16 Jun 2024

https://github.com/TheAtticusProject/cuad

CUAD (NeurIPS 2021)

bert legal-nlp

Last synced: 16 Jun 2024

https://github.com/km1994/NLP-Interview-Notes

该仓库主要记录 NLP 算法工程师相关的面试题

bert deel-learning ner nlp transformer

Last synced: 16 Jun 2024

https://github.com/naver/splade

SPLADE: sparse neural search (SIGIR21, SIGIR22)

bert information-retrieval nlp passage-retrieval sparse splade

Last synced: 16 Jun 2024

https://github.com/hppRC/bert-classification-tutorial

【2023年版】BERTによるテキスト分類

bert deep-learning japanese nlp python pytorch transformers

Last synced: 16 Jun 2024

https://github.com/yao8839836/kg-bert

KG-BERT: BERT for Knowledge Graph Completion

bert knowledge-graph

Last synced: 16 Jun 2024

https://github.com/PaddlePaddle/PaddleSlim

PaddleSlim is an open-source library for deep model compression and architecture search.

bert compression detection distillation ernie nas pruning quantization segmentation sparsity tensorrt transformer yolov5 yolov6 yolov7

Last synced: 15 Jun 2024

https://github.com/FranxYao/Language-Model-Pretraining-for-Text-Generation

LM pretraining for generation, reading list, resources, conference mappings.

bert bert-model gpt language-generation language-model pretrained-models text-generation

Last synced: 15 Jun 2024

https://github.com/whu-zqh/chatgpt-vs.-bert

🎁[ChatGPT4NLU] A Comparative Study on ChatGPT and Fine-tuned BERT

bert chain-of-thought chatgpt in-context-learning natural-language-understanding

Last synced: 14 Jun 2024

https://github.com/shibing624/textgen

TextGen: Implementation of Text Generation models, include LLaMA, BLOOM, GPT2, BART, T5, SongNet and so on. 文本生成模型,实现了包括LLaMA,ChatGLM,BLOOM,GPT2,Seq2Seq,BART,T5,UDA等模型的训练和预测,开箱即用。

bart bert chatglm chatgpt gpt2 llama seq2seq t5 text-generation textgen xlnet

Last synced: 14 Jun 2024

https://github.com/cvi-szu/linly

Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集

bert chatbot chatgpt chinese chinese-nlp gpt-3 language-model llama nlp zero-shot-learning

Last synced: 14 Jun 2024

https://github.com/denis2054/transformers-for-nlp-2nd-edition

Transformer models from BERT to GPT-4, environments from Hugging Face to OpenAI. Fine-tuning, training, and prompt engineering examples. A bonus section with ChatGPT, GPT-3.5-turbo, GPT-4, and DALL-E including jump starting GPT-4, speech-to-text, text-to-speech, text to image generation with DALL-E, Google Cloud AI,HuggingGPT, and more

bert chatgpt chatgpt-api dall-e dall-e-api deep-learning gpt-3-5-turbo gpt-4 gpt-4-api huggingface-transformers machine-learning natural-language-processing nlp openai python pytorch roberta-model transformers trax

Last synced: 14 Jun 2024

https://github.com/onejune2018/awesome-llm-eval

Awesome-LLM-Eval: a curated list of tools, datasets/benchmark, demos, leaderboard, papers, docs and models, mainly for Evaluation on LLMs. 一个由工具、基准/数据、演示、排行榜和大模型等组成的精选列表,主要面向基础大模型评测,旨在探求生成式AI的技术边界.

awsome-list awsome-lists benchmark bert chatglm chatgpt dataset evaluation gpt3 large-language-model leaderboard llama llm llm-evaluation machine-learning nlp openai qwen rag

Last synced: 14 Jun 2024

https://github.com/MaartenGr/keyBERT

Minimal keyword extraction with BERT

bert keyphrase-extraction keyword-extraction mmr

Last synced: 13 Jun 2024

https://github.com/dmmiller612/bert-extractive-summarizer

Easy to use extractive text summarization with BERT

bert coreference extractive-summarization pytorch

Last synced: 13 Jun 2024

https://github.com/hiyouga/dual-contrastive-learning

Code for our paper "Dual Contrastive Learning: Text Classification via Label-Aware Data Augmentation"

bert contrastive-learning deep-learning natural-language-processing neural-networks text-classification transformers

Last synced: 13 Jun 2024

https://github.com/xuyige/BERT4doc-Classification

Code and source for paper ``How to Fine-Tune BERT for Text Classification?``

bert natural-language-processing text-classification

Last synced: 13 Jun 2024

https://github.com/SanghunYun/UDA_pytorch

UDA(Unsupervised Data Augmentation) implemented by pytorch

bert pytorch-implementation text-classification

Last synced: 13 Jun 2024

https://github.com/ymcui/PERT

PERT: Pre-training BERT with Permuted Language Model

bert nlp plm pre-trained-model pytorch tensorflow transformers

Last synced: 13 Jun 2024

https://github.com/Tencent/PatrickStar

PatrickStar enables Larger, Faster, Greener Pretrained Models for NLP and democratizes AI for everyone.

bert gpt nlp pretrained-models pytorch

Last synced: 13 Jun 2024

https://github.com/sunyilgdx/NSP-BERT

The code for our paper "NSP-BERT: A Prompt-based Zero-Shot Learner Through an Original Pre-training Task —— Next Sentence Prediction"

bert correference-resolution entity-linking entity-typing natural-language-inference nlp prompt-learning sentence-classification sentiment-analysis tensorflow text-classification zero-shot

Last synced: 13 Jun 2024

https://github.com/alibaba/EasyTransfer

EasyTransfer is designed to make the development of transfer learning in NLP applications easier.

bert knowledge-distillation nlp-applications transfer-learning

Last synced: 13 Jun 2024

https://github.com/ymcui/Chinese-ELECTRA

Pre-trained Chinese ELECTRA(中文ELECTRA预训练模型)

bert chinese chinese-electra electra language-model nlp pre-trained-model pytorch tensorflow

Last synced: 13 Jun 2024

https://github.com/extreme-bert/extreme-bert

ExtremeBERT is a toolkit that accelerates the pretraining of customized language models on customized datasets, described in the paper “ExtremeBERT: A Toolkit for Accelerating Pretraining of Customized BERT”.

bert deep-learning language-model language-models machine-learning natural-language-processing nlp python pytorch transformer

Last synced: 13 Jun 2024

https://github.com/Sleepychord/CogLTX

The source code of NeurIPS 2020 paper "CogLTX: Applying BERT to Long Texts"

bert pytorch

Last synced: 13 Jun 2024

https://github.com/ymcui/MacBERT

Revisiting Pre-trained Models for Chinese Natural Language Processing (MacBERT)

bert language-model macbert nlp pytorch tensorflow transformers

Last synced: 13 Jun 2024

https://github.com/google-research/bigbird

Transformers for Longer Sequences

bert deep-learning longer-sequences nlp transformer

Last synced: 13 Jun 2024

https://github.com/vipulraheja/IteraTeR

Official implementation of the paper "IteraTeR: Understanding Iterative Revision from Human-Written Text" (ACL 2022)

bart bert iterative-text-editing iterative-text-revision natural-language-processing nlp pegasus roberta text-editing text-revision transformer transformers writing-assistant writing-systems

Last synced: 11 Jun 2024

https://github.com/cambridgeltl/sapbert

[NAACL'21 & ACL'21] SapBERT: Self-alignment pretraining for BERT & XL-BEL: Cross-Lingual Biomedical Entity Linking.

acl2021 bert bionlp contrastive-learning language-model lexical-semantics machine-learning metric-learning naacl2021 nlp representation-learning

Last synced: 11 Jun 2024

https://github.com/datawhalechina/leedl-tutorial

《李宏毅深度学习教程》(李宏毅老师推荐👍),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases

bert chatgpt cnn deep-learning diffusion gan leedl-tutorial machine-learning network-compression pruning reinforcement-learning rnn self-attention transfer-learning transformer tutorial

Last synced: 11 Jun 2024

https://github.com/ben1234560/AiLearning-Theory-Applying

快速上手Ai理论及应用实战:基础知识、Transformer、NLP、ML、DL、竞赛。含大量注释及数据集,力求每一位能看懂并复现。

ai bert dataming deep-learning kaggle-competition learning-by-doing machine-learning nlp

Last synced: 11 Jun 2024

https://github.com/Santosh-Gupta/ScientificSummarizationDataSets

Datasets I have created for scientific summarization, and a trained BertSum model

bert bertsum dataset extractive-summarization pointer-generator summarization tensor2tensor transformer

Last synced: 10 Jun 2024

https://github.com/UKPLab/gpl

Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: Generative Pseudo Labeling for Unsupervised Domain Adaptation of Dense Retrieval" https://arxiv.org/abs/2112.07577

bert domain-adaptation information-retrieval nlp transformers vector-search

Last synced: 09 Jun 2024

https://github.com/sno2/bertml

Use common pre-trained ML models in Deno!

bert deno machine-learning nlp rust

Last synced: 09 Jun 2024

https://github.com/guillaume-be/rust-bert

Rust native ready-to-use NLP pipelines and transformer-based models (BERT, DistilBERT, GPT2,...)

bart bert deep-learning electra gpt gpt-2 language-generation machine-learning ner nlp question-answering roberta rust rust-lang sentiment-analysis transformer translation

Last synced: 09 Jun 2024

https://github.com/oneapi-src/vertical-search-engine

AI Starter Kit for Semantic Vertical Search Engines using Intel® Extension for Pytorch

ai-starter-kit bert deep-learning pytorch

Last synced: 08 Jun 2024

https://github.com/ShannonAI/service-streamer

Boosting your Web Services of Deep Learning Applications.

bert deep-learning model-deployment pytorch tensorflow web

Last synced: 08 Jun 2024

https://github.com/deepset-ai/FARM

:house_with_garden: Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.

bert deep-learning germanbert language-models ner nlp nlp-framework nlp-library pretrained-models pytorch question-answering roberta transfer-learning xlnet-pytorch

Last synced: 07 Jun 2024

https://github.com/MilaNLProc/contextualized-topic-models

A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021.

bert embeddings multilingual-models multilingual-topic-models neural-topic-models nlp nlp-library nlp-machine-learning text-as-data topic-coherence topic-modeling transformer

Last synced: 07 Jun 2024

https://github.com/graykode/xlnet-Pytorch

Simple XLNet implementation with Pytorch Wrapper

bert natural-language-processing nlp pytorch xlnet xlnet-pytorch

Last synced: 07 Jun 2024

https://github.com/graykode/ai-docstring

Visual Studio Code extension to quickly generate docstrings for python functions using AI(NLP) technology.

bert code-summarization docstrings nlp vs-code-extenstion

Last synced: 07 Jun 2024

https://github.com/nouhadziri/DialogEntailment

The implementation of the paper "Evaluating Coherence in Dialogue Systems using Entailment"

bert dialogue-evaluation evaluation-framework natural-language-inference

Last synced: 06 Jun 2024

https://github.com/lonePatient/bert-sentence-similarity-pytorch

This repo contains a PyTorch implementation of a pretrained BERT model for sentence similarity task.

bert nlp pytorch sentence-similarity text-classification

Last synced: 06 Jun 2024

https://github.com/imgarylai/bert-embedding

🔡 Token level embeddings from BERT model on mxnet and gluonnlp

bert gluonnlp mxnet natural-language-processing nlp word-embeddings

Last synced: 06 Jun 2024

https://github.com/xu-song/bert-as-language-model

BERT as language model, fork from https://github.com/google-research/bert

bert language-model tensorflow

Last synced: 06 Jun 2024

https://github.com/yuanxiaosc/Deep_dynamic_contextualized_word_representation

TensorFlow code and pre-trained models for A Dynamic Word Representation Model Based on Deep Context. It combines the idea of BERT model and ELMo's deep context word representation.

bert elmo nlp transformer

Last synced: 06 Jun 2024

https://github.com/YC-wind/embedding_study

中文预训练模型生成字向量学习,测试BERT,ELMO的中文效果

bert chinese elmo elmo-tutorial embeddings z-w

Last synced: 06 Jun 2024

https://github.com/yuanxiaosc/BERT-for-Sequence-Labeling-and-Text-Classification

This is the template code to use BERT for sequence lableing and text classification, in order to facilitate BERT for more tasks. Currently, the template code has included conll-2003 named entity identification, Snips Slot Filling and Intent Prediction.

atis-dataset bert conll-2003 sequence-labeling snips-dataset template-project text-classification

Last synced: 06 Jun 2024

https://github.com/GaoQ1/rasa_chatbot_cn

building a chinese dialogue system based on the newest version of rasa(基于最新版本rasa搭建的对话系统)

bert chinese demo intent-classification policy python rasa rasa-chatbot rasa-core rasa-nlu rasa-nlu-gao rasa-x slot-filling tensorflow train-dialogue transformer

Last synced: 06 Jun 2024

https://github.com/GaoQ1/rasa_nlu_gq

turn natural language into structured data(支持中文,自定义了N种模型,支持不同的场景和任务)

bert bilstm-idcnn jieba natural-language nlp nlu rasa rasa-nlu rasa-nlu-gao tensorflow

Last synced: 06 Jun 2024

https://github.com/GaoQ1/rasa-bert-finetune

支持rasa-nlu 的bert finetune

bert finetune rasa rasa-nlu

Last synced: 06 Jun 2024

https://github.com/ianycxu/GCN-with-BERT

Graph Convolutional Networks (GCN) with BERT for Coreference Resolution Task [Pytorch][DGL]

bert bert-model coreference-resolution gcn gnn graph-convolutional-networks graph-neural-networks nlp pytorch

Last synced: 06 Jun 2024