Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Natural language processing

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

https://github.com/google-research/bert

TensorFlow code and pre-trained models for BERT

google natural-language-processing natural-language-understanding nlp tensorflow

Last synced: 30 Jul 2024

https://github.com/hankcs/HanLP

中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理

dependency-parser hanlp named-entity-recognition natural-language-processing nlp pos-tagging semantic-parsing text-classification

Last synced: 31 Jul 2024

https://github.com/microsoft/unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

beit beit-3 bitnet deepnet document-ai foundation-models kosmos kosmos-1 layoutlm layoutxlm llm minilm mllm multimodal nlp pre-trained-model textdiffuser trocr unilm xlm-e

Last synced: 30 Jul 2024

https://github.com/huggingface/datasets

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

computer-vision datasets deep-learning hacktoberfest machine-learning natural-language-processing nlp numpy pandas pytorch speech tensorflow

Last synced: 30 Jul 2024

https://github.com/ymcui/Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

alpaca alpaca-2 large-language-models llama llama-2 llm lora nlp plm pre-trained-language-models quantization

Last synced: 30 Jul 2024

https://github.com/RasaHQ/rasa_nlu

💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants

bot bot-framework botkit bots chatbot chatbots chatbots-framework conversation-driven-development conversational-agents conversational-ai conversational-bots machine-learning machine-learning-library mitie natural-language-processing nlp nlu rasa spacy wit

Last synced: 03 Aug 2024

https://github.com/RasaHQ/rasa

💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants

bot bot-framework botkit bots chatbot chatbots chatbots-framework conversation-driven-development conversational-agents conversational-ai conversational-bots machine-learning machine-learning-library mitie natural-language-processing nlp nlu rasa spacy wit

Last synced: 30 Jul 2024

https://github.com/NLP-LOVE/ML-NLP

此项目是机器学习(Machine Learning)、深度学习(Deep Learning)、NLP面试中常考到的知识点和代码实现,也是作为一个算法工程师必会的理论基础知识。

deep-learning machine-learning nlp

Last synced: 01 Aug 2024

https://github.com/dair-ai/ML-YouTube-Courses

📺 Discover the latest machine learning / AI courses on YouTube.

ai data-science deep-learning machine-learning natural-language-processing nlp

Last synced: 30 Jul 2024

https://github.com/graykode/nlp-tutorial

Natural Language Processing Tutorial for Deep Learning Researchers

attention bert natural-language-processing nlp paper pytorch tensorflow transformer tutorial

Last synced: 31 Jul 2024

https://github.com/flairNLP/flair

A very simple framework for state-of-the-art Natural Language Processing (NLP)

machine-learning named-entity-recognition natural-language-processing nlp pytorch semantic-role-labeling sequence-labeling word-embeddings

Last synced: 30 Jul 2024

https://github.com/deepset-ai/haystack

:mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.

ai bert chatgpt generative-ai gpt-3 information-retrieval language-model large-language-models machine-learning nlp python pytorch question-answering semantic-search squad summarization transformers

Last synced: 31 Jul 2024

https://github.com/ai4finance-foundation/fingpt

FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.

chatgpt finance fingpt fintech large-language-models machine-learning nlp prompt-engineering pytorch reinforcement-learning robo-advisor sentiment-analysis technical-analysis

Last synced: 02 Aug 2024

https://github.com/NVIDIA/DeepLearningExamples

State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

computer-vision deep-learning drug-discovery forecasting large-language-models mxnet nlp paddlepaddle pytorch recommender-systems speech-recognition speech-synthesis tensorflow tensorflow2 translation

Last synced: 31 Jul 2024

https://github.com/AI4Finance-Foundation/FinGPT

FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.

chatgpt finance fingpt fintech large-language-models machine-learning nlp prompt-engineering pytorch reinforcement-learning robo-advisor sentiment-analysis technical-analysis

Last synced: 31 Jul 2024

https://github.com/PaddlePaddle/PaddleHub

Awesome pre-trained models toolkit based on PaddlePaddle. (400+ models including Image, Text, Audio, Video and Cross-Modal with Easy Inference & Serving)

awesome deep-learning model nlp text2image vision

Last synced: 31 Jul 2024

https://github.com/botpress/botpress

The open-source hub to build & deploy GPT/LLM Agents ⚡️

agent ai botpress chatbot chatgpt gpt gpt-4 langchain llm nlp openai prompt

Last synced: 30 Jul 2024

https://github.com/PaddlePaddle/PaddleNLP

👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Question Answering, ℹ️ Information Extraction, 📄 Document Intelligence, 💌 Sentiment Analysis etc.

bert compression distributed-training document-intelligence embedding ernie information-extraction llama llm neural-search nlp paddlenlp pretrained-models question-answering search-engine semantic-analysis sentiment-analysis transformers uie

Last synced: 31 Jul 2024

https://github.com/allenai/allennlp

An open-source NLP research library, built on PyTorch.

data-science deep-learning natural-language-processing nlp python pytorch

Last synced: 30 Jul 2024

https://github.com/nlp-compromise/compromise

modest natural-language processing

named-entity-recognition nlp part-of-speech

Last synced: 29 Aug 2024

https://github.com/spencermountain/compromise

modest natural-language processing

named-entity-recognition nlp part-of-speech

Last synced: 31 Jul 2024

https://github.com/chiphuyen/stanford-tensorflow-tutorials

This repository contains code examples for the Stanford's course: TensorFlow for Deep Learning Research.

chatbot course-materials deep-learning machine-learning natural-language-processing nlp python stanford tensorflow tutorial

Last synced: 30 Jul 2024

https://stanfordnlp.github.io/CoreNLP/

CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing, coreference, sentiment analysis, etc.

named-entity-recognition natural-language-processing nlp nlp-parsing stanford-nlp

Last synced: 02 Aug 2024

https://github.com/stanfordnlp/CoreNLP

CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing, coreference, sentiment analysis, etc.

named-entity-recognition natural-language-processing nlp nlp-parsing stanford-nlp

Last synced: 31 Jul 2024

https://github.com/ymcui/Chinese-BERT-wwm

Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)

bert bert-wwm bert-wwm-ext chinese-bert nlp pytorch rbt roberta roberta-wwm tensorflow

Last synced: 31 Jul 2024

https://github.com/Mooler0410/LLMsPracticalGuide

A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)

large-language-models natural-language-processing nlp survey

Last synced: 01 Aug 2024

https://github.com/sloria/TextBlob

Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.

natural-language-processing nlp nltk pattern python python-3

Last synced: 30 Jul 2024

https://github.com/huggingface/tokenizers

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

bert gpt language-model natural-language-processing natural-language-understanding nlp transformers

Last synced: 31 Jul 2024

https://github.com/dair-ai/ML-Papers-of-the-Week

🔥Highlighting the top ML papers every week.

ai data-science deeplearning machine-learning nlp

Last synced: 31 Jul 2024

https://github.com/jadore801120/attention-is-all-you-need-pytorch

A PyTorch implementation of the Transformer model in "Attention is All You Need".

attention attention-is-all-you-need deep-learning natural-language-processing nlp pytorch

Last synced: 31 Jul 2024

https://github.com/Morizeyao/GPT2-Chinese

Chinese version of GPT2 training code, using BERT tokenizer.

chinese gpt-2 nlp text-generation transformer

Last synced: 31 Jul 2024

https://github.com/stanfordnlp/stanza

Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages

artificial-intelligence corenlp deep-learning machine-learning named-entity-recognition natural-language-processing nlp python pytorch universal-dependencies

Last synced: 31 Jul 2024

https://github.com/ymcui/Chinese-LLaMA-Alpaca-2

中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)

64k alpaca alpaca-2 alpaca2 flash-attention large-language-models llama llama-2 llama2 llm nlp rlhf yarn

Last synced: 31 Jul 2024

https://github.com/thunlp/WantWords

An open-source online reverse dictionary.

natural-language-processing nlp reverse-dictionary word

Last synced: 31 Jul 2024

https://github.com/PaddlePaddle/models

Officially maintained, supported by PaddlePaddle, including CV, NLP, Speech, Rec, TS, big models and so on.

computer-vision cv deep-learning models natural-language-processing neural-network nlp paddlepaddle recommendation speech

Last synced: 31 Jul 2024

https://github.com/NLPchina/ansj_seg

ansj分词.ict的真正java实现.分词效果速度都超过开源版的ict. 中文分词,人名识别,词性标注,用户自定义词典

ansj chinese java nlp

Last synced: 31 Jul 2024

https://github.com/dragen1860/TensorFlow-2.x-Tutorials

TensorFlow 2.x version's Tutorials and Examples, including CNN, RNN, GAN, Auto-Encoders, FasterRCNN, GPT, BERT examples, etc. TF 2.0版入门实例代码,实战教程。

artificial-intelligence computer-vision deep-learning machine-learning neural-network nlp tensorflow tensorflow-2 tensorflow-examples tensorflow-tutorials

Last synced: 31 Jul 2024

https://github.com/jessevig/bertviz

BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)

bert gpt2 machine-learning natural-language-processing neural-network nlp pytorch roberta transformer transformers visualization

Last synced: 01 Aug 2024

https://github.com/PaddlePaddle/ERNIE

Official implementations for various pre-training models of ERNIE-family, covering topics of Language Understanding & Generation, Multimodal Understanding & Generation, and beyond.

bert ernie language-understanding natural-language-processing nlp

Last synced: 01 Aug 2024

https://github.com/zihangdai/xlnet

XLNet: Generalized Autoregressive Pretraining for Language Understanding

deep-learning nlp tensorflow

Last synced: 01 Aug 2024

https://github.com/modelscope/modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

cv deep-learning machine-learning multi-modal nlp python science speech

Last synced: 31 Jul 2024

https://github.com/axa-group/nlp.js

An NLP library for building bots, with entity extraction, sentiment analysis, automatic language identify, and so more

bot bots chatbot classifier conversational-ai entity-extraction hacktoberfest javascript natural-language-processing nlp nlu nodejs sentiment-analysis

Last synced: 31 Jul 2024

https://github.com/codertimo/BERT-pytorch

Google AI 2018 BERT pytorch implementation

bert language-model nlp pytorch transformer

Last synced: 01 Aug 2024

https://github.com/MaartenGr/BERTopic

Leveraging BERT and c-TF-IDF to create easily interpretable topics.

bert ldavis machine-learning nlp sentence-embeddings topic topic-modeling topic-modelling topic-models transformers

Last synced: 01 Aug 2024

https://github.com/WooooDyy/LLM-Agent-Paper-List

The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.

agent large-language-models llm nlp survey

Last synced: 31 Jul 2024

https://github.com/axa-group/Parsr

Transforms PDF, Documents and Images into Enriched Structured Data

data document extraction hacktoberfest images nlp ocr parsr pdf python typescript

Last synced: 30 Jul 2024

https://github.com/clovaai/donut

Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

computer-vision document-ai eccv-2022 multimodal-pre-trained-model nlp ocr

Last synced: 01 Aug 2024

https://github.com/vi3k6i5/flashtext

Extract Keywords from sentence or Replace keywords in sentences.

data-extraction keyword-extraction nlp search-in-text word2vec

Last synced: 31 Jul 2024

https://github.com/dsdanielpark/Bard-API

The unofficial python package that returns response of Google Bard through cookie value.

ai-api api bard bard-api chatbot google google-bard google-bard-api google-bard-python google-maps-api googlebard llm nlp

Last synced: 31 Jul 2024

https://github.com/dsdanielpark/bard-api

The unofficial python package that returns response of Google Bard through cookie value.

ai-api api bard bard-api chatbot google google-bard google-bard-api google-bard-python google-maps-api googlebard llm nlp

Last synced: 02 Aug 2024

https://github.com/aisingapore/TagUI

Free RPA tool by AI Singapore

ai nlp opencv rpa tesseract

Last synced: 30 Jul 2024

https://github.com/kelaberetiv/TagUI

Free RPA tool by AI Singapore

ai nlp opencv rpa tesseract

Last synced: 04 Aug 2024

https://github.com/chatopera/Synonyms

:herb: 中文近义词:聊天机器人,智能问答工具包

chatbot nlp synonyms

Last synced: 01 Aug 2024

https://github.com/SkalskiP/courses

This repository is a curated collection of links to various courses and resources about Artificial Intelligence (AI)

computer-vision deep-learning deep-neural-networks generative-model machine-learning mlops multimodal natural-language-processing nlp stable-diffusion transformers tutorial

Last synced: 31 Jul 2024

https://github.com/Nyandwi/machine_learning_complete

A comprehensive machine learning repository containing 30+ notebooks on different concepts, algorithms and techniques.

computer-vision data-analysis data-science data-visualization datascience deep-learning keras machine-learning matplotlib neural-networks nlp numpy open-source pandas python scikit-learn seaborn tensorflow

Last synced: 01 Aug 2024

https://github.com/spro/practical-pytorch

Go to https://github.com/pytorch/tutorials - this repo is deprecated and no longer maintained

natural-language-generation natural-language-processing nlg nlp seq2seq

Last synced: 30 Jul 2024

https://github.com/SCIR-HI/Huatuo-Llama-Med-Chinese

Repo for BenTsao [original name: HuaTuo (华驼)], Instruction-tuning Large Language Models with Chinese Medical Knowledge. 本草(原名:华驼)模型仓库,基于中文医学知识的大语言模型指令微调

aidoctor bloom chinese huozi llama llm medgpt medical medqa nlp

Last synced: 31 Jul 2024

https://github.com/trigaten/learn_prompting

Prompt Engineering, Generative AI, and LLM Guide by Learn Prompting | Join our discord for the largest Prompt Engineering learning community

chatgpt chatgpt-api deep-learning gpt-3 gpt-4 gpt-4-api gpt3 large-language-models llm machine-learning nlp openai-api prompt-engineering prompt-toolkit prompt-tuning prompting transformers

Last synced: 02 Aug 2024

https://github.com/dsgiitr/d2l-pytorch

This project reproduces the book Dive Into Deep Learning (https://d2l.ai/), adapting the code from MXNet into PyTorch.

book computer-vision d2l data-science deep-learning dive-into-deep-learning mxnet nlp pytorch pytorch-implmention

Last synced: 31 Jul 2024

https://github.com/errata-ai/vale

:pencil: A markup-aware linter for prose built with speed and extensibility in mind.

linter linting nlp vale

Last synced: 30 Jul 2024

https://github.com/trigaten/Learn_Prompting

Prompt Engineering, Generative AI, and LLM Guide by Learn Prompting | Join our discord for the largest Prompt Engineering learning community

chatgpt chatgpt-api deep-learning gpt-3 gpt-4 gpt-4-api gpt3 large-language-models llm machine-learning nlp openai-api prompt-engineering prompt-toolkit prompt-tuning prompting transformers

Last synced: 31 Jul 2024

https://github.com/shibing624/text2vec

text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。

embeddings nlp sentence-embeddings similarity text-similarity text2vec word2vec

Last synced: 01 Aug 2024