Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Natural language processing

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

https://github.com/QData/TextAttack

TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs.io/en/master/

adversarial-attacks adversarial-examples adversarial-machine-learning data-augmentation machine-learning natural-language-processing nlp security

Last synced: 03 Nov 2024

https://github.com/qdata/textattack

TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs.io/en/master/

adversarial-attacks adversarial-examples adversarial-machine-learning data-augmentation machine-learning natural-language-processing nlp security

Last synced: 29 Oct 2024

https://github.com/intellabs/nlp-architect

A model library for exploring state-of-the-art deep learning topologies and techniques for optimizing Natural Language Processing neural networks

bert deep-learning deeplearning dynet nlp nlu pytorch quantization tensorflow transformers

Last synced: 26 Sep 2024

https://github.com/IntelLabs/nlp-architect

A model library for exploring state-of-the-art deep learning topologies and techniques for optimizing Natural Language Processing neural networks

bert deep-learning deeplearning dynet nlp nlu pytorch quantization tensorflow transformers

Last synced: 30 Oct 2024

https://github.com/NervanaSystems/nlp-architect

A model library for exploring state-of-the-art deep learning topologies and techniques for optimizing Natural Language Processing neural networks

bert deep-learning deeplearning dynet nlp nlu pytorch quantization tensorflow transformers

Last synced: 18 Aug 2024

https://github.com/li-plus/chatglm.cpp

C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)

chatglm chatglm2 chatglm3 codegeex2-6b glm4 large-language-models nlp

Last synced: 14 Oct 2024

https://github.com/textlint/textlint

The pluggable natural language linter for text and markdown.

javascript lint linter markdown natural-language nlp textlint

Last synced: 28 Oct 2024

https://github.com/eugeneyan/ml-surveys

📋 Survey papers summarizing advances in deep learning, NLP, CV, graphs, reinforcement learning, recommendations, graphs, etc.

computer-vision deep-learning embeddings machine-learning nlp recommender-system reinforcement-learning survey

Last synced: 14 Oct 2024

https://github.com/huggingface/knockknock

🚪✊Knock Knock: Get notified when your training ends with only two additional lines of code

computer-vision cv deep-learning machine-learning natural-language-processing neural-networks nlp nlproc python python36 train

Last synced: 14 Oct 2024

https://github.com/TeamHG-Memex/eli5

A library for debugging/inspecting machine learning classifiers and explaining their predictions

crfsuite data-science explanation inspection lightgbm machine-learning nlp python scikit-learn xgboost

Last synced: 09 Nov 2024

https://github.com/teamhg-memex/eli5

A library for debugging/inspecting machine learning classifiers and explaining their predictions

crfsuite data-science explanation inspection lightgbm machine-learning nlp python scikit-learn xgboost

Last synced: 10 Oct 2024

https://github.com/bigscience-workshop/promptsource

Toolkit for creating, sharing and using natural language prompts.

machine-learning natural-language-processing nlp

Last synced: 15 Oct 2024

https://github.com/thisandagain/sentiment

AFINN-based sentiment analysis for Node.js.

afinn analysis javascript nlp sentiment sentiment-analysis

Last synced: 13 Nov 2024

https://github.com/baidu/familia

A Toolkit for Industrial Topic Modeling

lda nlp sentence-lda topic-modeling topic-models twe

Last synced: 09 Oct 2024

https://github.com/baidu/Familia

A Toolkit for Industrial Topic Modeling

lda nlp sentence-lda topic-modeling topic-models twe

Last synced: 06 Nov 2024

https://github.com/datawhalechina/Daily-interview

Datawhale成员整理的面经,内容包括机器学习,CV,NLP,推荐,开发等,欢迎大家star

cv interview-questions machine-learning nlp

Last synced: 12 Nov 2024

https://github.com/go-ego/gse

Go efficient multilingual NLP and text segmentation; support English, Chinese, Japanese and others.

chinese english go gse hmm hmm-viterbi-algorithm japanese jieba nlp segment trie

Last synced: 29 Oct 2024

https://github.com/datawhalechina/daily-interview

Datawhale成员整理的面经,内容包括机器学习,CV,NLP,推荐,开发等,欢迎大家star

cv interview-questions machine-learning nlp

Last synced: 15 Oct 2024

https://github.com/readbeyond/aeneas

aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)

alignment audio cli dtw espeak espeak-ng festival ffmpeg forced-alignment linux macos nlp python smil speech srt text text-to-speech tts windows

Last synced: 13 Oct 2024

https://github.com/km1994/NLP-Interview-Notes

该仓库主要记录 NLP 算法工程师相关的面试题

bert deel-learning ner nlp transformer

Last synced: 06 Nov 2024

https://github.com/km1994/nlp-interview-notes

该仓库主要记录 NLP 算法工程师相关的面试题

bert deel-learning ner nlp transformer

Last synced: 14 Oct 2024

https://github.com/guillaume-be/rust-bert

Rust native ready-to-use NLP pipelines and transformer-based models (BERT, DistilBERT, GPT2,...)

bart bert deep-learning electra gpt gpt-2 language-generation machine-learning ner nlp question-answering roberta rust rust-lang sentiment-analysis transformer translation

Last synced: 12 Oct 2024

https://github.com/adapter-hub/adapters

A Unified Library for Parameter-Efficient and Modular Transfer Learning

adapters bert lora natural-language-processing nlp parameter-efficient-learning parameter-efficient-tuning pytorch transformers

Last synced: 29 Oct 2024

https://github.com/blmoistawinde/HarvestText

文本挖掘和预处理工具(文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等),无监督或弱监督方法

dependency-parser gitee harvesttext keyword-extraction named-entity-recognition new-word-discovery nlp pyhanlp sentiment-analysis text-cleaning text-segmentation text-summarization unsupervised

Last synced: 27 Oct 2024

https://github.com/blmoistawinde/harvesttext

文本挖掘和预处理工具(文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等),无监督或弱监督方法

dependency-parser gitee harvesttext keyword-extraction named-entity-recognition new-word-discovery nlp pyhanlp sentiment-analysis text-cleaning text-segmentation text-summarization unsupervised

Last synced: 15 Oct 2024

https://github.com/brikerman/kashgari

Kashgari is a production-level NLP Transfer learning framework built on top of tf.keras for text-labeling and text-classification, includes Word2Vec, BERT, and GPT2 Language Embedding.

bert bert-model gpt-2 machine-learning named-entity-recognition ner nlp nlp-framework seq2seq sequence-labeling text-classification text-labeling transfer-learning

Last synced: 14 Oct 2024

https://github.com/BrikerMan/Kashgari

Kashgari is a production-level NLP Transfer learning framework built on top of tf.keras for text-labeling and text-classification, includes Word2Vec, BERT, and GPT2 Language Embedding.

bert bert-model gpt-2 machine-learning named-entity-recognition ner nlp nlp-framework seq2seq sequence-labeling text-classification text-labeling transfer-learning

Last synced: 30 Oct 2024

https://github.com/curiousily/Getting-Things-Done-with-Pytorch

Jupyter Notebook tutorials on solving real-world problems with Machine Learning & Deep Learning using PyTorch. Topics: Face detection with Detectron 2, Time Series anomaly detection with LSTM Autoencoders, Object Detection with YOLO v5, Build your first Neural Network, Time Series forecasting for Coronavirus daily cases, Sentiment Analysis with BER

anomaly-detection bert computer-vision coronavirus deep-learning face-detection face-recognition lstm machine-learning nlp object-detection pytorch sentiment-analysis time-series time-series-anomaly-detection time-series-forecasting transfer-learning transformer tutorial yolo

Last synced: 06 Nov 2024

https://github.com/curiousily/getting-things-done-with-pytorch

Jupyter Notebook tutorials on solving real-world problems with Machine Learning & Deep Learning using PyTorch. Topics: Face detection with Detectron 2, Time Series anomaly detection with LSTM Autoencoders, Object Detection with YOLO v5, Build your first Neural Network, Time Series forecasting for Coronavirus daily cases, Sentiment Analysis with BER

anomaly-detection bert computer-vision coronavirus deep-learning face-detection face-recognition lstm machine-learning nlp object-detection pytorch sentiment-analysis time-series time-series-anomaly-detection time-series-forecasting transfer-learning transformer tutorial yolo

Last synced: 14 Oct 2024

https://github.com/RasaHQ/rasa_core

Rasa Core is now part of the Rasa repo: An open source machine learning framework to automate text-and voice-based conversations

bot bot-framework botkit bots chatbot chatbot-framework conversational-agents conversational-ai conversational-bots machine-learning machine-learning-library nlp rasa

Last synced: 06 Nov 2024

https://github.com/xusenlinzy/api-for-open-llm

Openai style api for open large language models, using LLMs just as chatgpt! Support for LLaMA, LLaMA-2, BLOOM, Falcon, Baichuan, Qwen, Xverse, SqlCoder, CodeLLaMA, ChatGLM, ChatGLM2, ChatGLM3 etc. 开源大模型的统一后端接口

baichuan chatglm code-llama docker internlm langchain llama llama2 llms nlp openai qwen sqlcoder xverse

Last synced: 10 Oct 2024

https://github.com/google-research/electra

ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators

deep-learning nlp tensorflow

Last synced: 15 Oct 2024

https://github.com/duoergun0729/nlp

兜哥出品 <一本开源的NLP入门书籍>

ai fasttext nlp security word2vec

Last synced: 15 Oct 2024

https://github.com/modelscope/data-juicer

A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!

chinese data-analysis data-science data-visualization dataset gpt gpt-4 instruction-tuning large-language-models llama llava llm llms multi-modal nlp opendata pre-training pytorch sora streamlit

Last synced: 13 Oct 2024

https://github.com/TigerResearch/TigerBot

TigerBot: A multi-language multi-task LLM

chinese data llama2 llm nlp

Last synced: 05 Nov 2024

https://github.com/tigerresearch/tigerbot

TigerBot: A multi-language multi-task LLM

chinese data llama2 llm nlp

Last synced: 10 Oct 2024

https://github.com/crownpku/information-extraction-chinese

Chinese Named Entity Recognition with IDCNN/biLSTM+CRF, and Relation Extraction with biGRU+2ATT 中文实体识别与关系提取

chinese-nlp information-extraction named-entity-recognition nlp relation-extraction

Last synced: 15 Oct 2024

https://github.com/crownpku/Information-Extraction-Chinese

Chinese Named Entity Recognition with IDCNN/biLSTM+CRF, and Relation Extraction with biGRU+2ATT 中文实体识别与关系提取

chinese-nlp information-extraction named-entity-recognition nlp relation-extraction

Last synced: 07 Aug 2024

https://github.com/huggingface/course

The Hugging Face course on Transformers

deep-learning hacktoberfest nlp transformers

Last synced: 05 Nov 2024

https://github.com/chartbeat-labs/textacy

NLP, before and after spaCy

natural-language-processing nlp python spacy

Last synced: 14 Oct 2024

https://github.com/NLP-LOVE/Introduction-NLP

HanLP作者的新书《自然语言处理入门》详细笔记!业界良心之作,书中不是枯燥无味的公式罗列,而是用白话阐述的通俗易懂的算法模型。从基本概念出发,逐步介绍中文分词、词性标注、命名实体识别、信息抽取、文本聚类、文本分类、句法分析这几个热门问题的算法原理与工程实现。

ai deep-learning mechine-learing nlp

Last synced: 06 Nov 2024

https://github.com/datawhalechina/learn-nlp-with-transformers

we want to create a repo to illustrate usage of transformers in chinese

bert nlp transformer

Last synced: 09 Oct 2024

https://github.com/nlp-love/introduction-nlp

HanLP作者的新书《自然语言处理入门》详细笔记!业界良心之作,书中不是枯燥无味的公式罗列,而是用白话阐述的通俗易懂的算法模型。从基本概念出发,逐步介绍中文分词、词性标注、命名实体识别、信息抽取、文本聚类、文本分类、句法分析这几个热门问题的算法原理与工程实现。

ai deep-learning mechine-learing nlp

Last synced: 15 Oct 2024

https://github.com/chiphuyen/lazynlp

Library to scrape and clean web pages to create massive datasets.

artificial-intelligence data-science language-model natural-language-processing nlp open python text-mining

Last synced: 15 Nov 2024

https://github.com/DerwenAI/pytextrank

Python implementation of TextRank algorithms ("textgraphs") for phrase extraction

graph-algorithms machine-learning natural-language natural-language-processing nlp python spacy spacy-extension summarization textgraphs textrank

Last synced: 29 Oct 2024

https://github.com/derwenai/pytextrank

Python implementation of TextRank algorithms ("textgraphs") for phrase extraction

graph-algorithms machine-learning natural-language natural-language-processing nlp python spacy spacy-extension summarization textgraphs textrank

Last synced: 29 Oct 2024

https://github.com/harderthenharder/transformers_tasks

⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SFT etc.

information-extraction nlp reinforcement-learning text-classification text-generation text-matching transformers

Last synced: 14 Oct 2024

https://github.com/TingFree/NLPer-Arsenal

收录NLP竞赛策略实现、各任务baseline、相关竞赛经验贴(当前赛事、往期赛事、训练赛)、NLP会议时间、常用自媒体、GPU推荐等,持续更新中

baselines gpu nlp nlp-competition nlp-conference nlp-media pytorch

Last synced: 03 Nov 2024

https://github.com/luopeixiang/named_entity_recognition

中文命名实体识别(包括多种模型:HMM,CRF,BiLSTM,BiLSTM+CRF的具体实现)

bi-lstm bi-lstm-crf chinese-ner crf hmm named-entity-recognition ner nlp pytorch-ner pytorch-nlp sequence-labeling

Last synced: 14 Oct 2024

https://github.com/asappresearch/sru

Training RNNs as Fast as CNNs (https://arxiv.org/abs/1709.02755)

deep-learning nlp pytorch recurrent-neural-networks

Last synced: 14 Oct 2024

https://github.com/freedomintelligence/medical_nlp

Medical NLP Competition, dataset, large models, paper

collection datasets list medical models nlp

Last synced: 15 Oct 2024

https://github.com/lonePatient/BERT-NER-Pytorch

Chinese NER(Named Entity Recognition) using BERT(Softmax, CRF, Span)

adversarial-training albert bert chinese crf focal-loss labelsmoothing ner nlp pytorch softmax span

Last synced: 16 Nov 2024

https://github.com/koth/kcws

Deep Learning Chinese Word Segment

chinese-text-segmentation deep-learning nlp pos-tagger tensorflow

Last synced: 14 Oct 2024

https://github.com/lonepatient/bert-ner-pytorch

Chinese NER(Named Entity Recognition) using BERT(Softmax, CRF, Span)

adversarial-training albert bert chinese crf focal-loss labelsmoothing ner nlp pytorch softmax span

Last synced: 14 Oct 2024

https://github.com/songyouwei/absa-pytorch

Aspect Based Sentiment Analysis, PyTorch Implementations. 基于方面的情感分析,使用PyTorch实现。

aspect-based-sentiment-analysis attention bert natural-language-processing nlp sentiment-analysis sentiment-classification

Last synced: 14 Oct 2024

https://github.com/delip/pytorchnlpbook

Code and data accompanying Natural Language Processing with PyTorch published by O'Reilly Media https://amzn.to/3JUgR2L

deep-learning deep-neural-networks natural-language-processing neural-machine-translation neural-networks nlp pytorch pytorch-nlp pytorch-tutorial

Last synced: 15 Oct 2024

https://github.com/jalammar/ecco

Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the behavior of Transformer-based language models (like GPT2, BERT, RoBERTA, T5, and T0).

explorables language-models natural-language-processing nlp pytorch visualization

Last synced: 15 Oct 2024

https://github.com/alibaba/AliceMind

ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab

bert deep-learning natural-language-processing nlp

Last synced: 27 Oct 2024

https://github.com/modeltc/lightllm

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

deep-learning gpt llama llm model-serving nlp openai-triton

Last synced: 10 Oct 2024

https://github.com/ModelTC/lightllm

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

deep-learning gpt llama llm model-serving nlp openai-triton

Last synced: 28 Oct 2024

https://github.com/songyouwei/ABSA-PyTorch

Aspect Based Sentiment Analysis, PyTorch Implementations. 基于方面的情感分析,使用PyTorch实现。

aspect-based-sentiment-analysis attention bert natural-language-processing nlp sentiment-analysis sentiment-classification

Last synced: 02 Nov 2024

https://github.com/delip/PyTorchNLPBook

Code and data accompanying Natural Language Processing with PyTorch published by O'Reilly Media https://amzn.to/3JUgR2L

deep-learning deep-neural-networks natural-language-processing neural-machine-translation neural-networks nlp pytorch pytorch-nlp pytorch-tutorial

Last synced: 03 Sep 2024

https://github.com/huggingface/setfit

Efficient few-shot learning with Sentence Transformers

few-shot-learning nlp sentence-transformers

Last synced: 15 Oct 2024

https://github.com/rguthrie3/DeepLearningForNLPInPytorch

An IPython Notebook tutorial on deep learning for natural language processing, including structure prediction.

deep-learning lstm neural-network nlp pytorch tutorial

Last synced: 02 Nov 2024

https://github.com/rguthrie3/deeplearningfornlpinpytorch

An IPython Notebook tutorial on deep learning for natural language processing, including structure prediction.

deep-learning lstm neural-network nlp pytorch tutorial

Last synced: 14 Oct 2024

https://github.com/GauravBh1010tt/DeepLearn

Implementation of research papers on Deep Learning+ NLP+ CV in Python using Keras, Tensorflow and Scikit Learn.

audio-processing computer-vision deep-learning nlp

Last synced: 25 Oct 2024