Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Natural language processing

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

https://github.com/graykode/commit-autosuggestions

A tool that AI automatically recommends commit messages.

bert commit-autosuggestions natural-language nlp text-generation

Last synced: 13 Nov 2024

https://github.com/gutfeeling/beginner_nlp

A curated list of beginner resources in Natural Language Processing

natural-language-processing nlp nlp-resources

Last synced: 07 Aug 2024

https://github.com/thunlp/Few-NERD

Code and data of ACL 2021 paper "Few-NERD: A Few-shot Named Entity Recognition Dataset"

deep-learning entity-typing few-shot-learning named-entity-recognition nlp

Last synced: 03 Aug 2024

https://github.com/towhee-io/examples

Analyze the unstructured data with Towhee, such as reverse image search, reverse video search, audio classification, question and answer systems, molecular search, etc.

audio-classification cross-modal embeddings image-classification machine-learning nlp video-tagging

Last synced: 13 Nov 2024

https://github.com/neurocult/agency

🕵️‍♂️ Library designed for developers eager to explore the potential of Large Language Models (LLMs) and other generative AI through a clean, effective, and Go-idiomatic approach.

agents ai artificial-general-intelligence artificial-intelligence artificial-neural-networks autonomous-agents chatgpt generative-ai go golang gpt language-models llm llmops machine-learning neural-network nlp openai rag vector-database

Last synced: 06 Nov 2024

https://github.com/qipeng/gcn-over-pruned-trees

Graph Convolution over Pruned Dependency Trees Improves Relation Extraction (authors' PyTorch implementation)

dependency-parse-trees dependency-parsing information-extraction natural-language-processing nlp relation-extraction

Last synced: 02 Nov 2024

https://github.com/mb-14/gomarkov

Markov chains in golang

golang markov-chain nlp

Last synced: 13 Nov 2024

https://github.com/polm/fugashi

A Cython MeCab wrapper for fast, pythonic Japanese tokenization and morphological analysis.

cython-wrapper japanese mecab nlp tokenizer

Last synced: 11 Nov 2024

https://github.com/omarsar/nlp_highlights

The most important NLP highlights of 2018 (PDF Report)

analytics artificial-intelligence conversational-ai deep-learning health nlp technology

Last synced: 13 Oct 2024

https://github.com/dair-ai/nlp_fundamentals

📘 Contains a series of hands-on notebooks for learning the fundamentals of NLP

deep-learning education nlp

Last synced: 10 Nov 2024

https://github.com/kefirski/pytorch_RVAE

Recurrent Variational Autoencoder that generates sequential data implemented with pytorch

deep-learning nlp python pytorch vae

Last synced: 02 Nov 2024

https://github.com/neuml/tldrstory

📊 Semantic search for headlines and story text

machine-learning nlp python search txtai

Last synced: 01 Nov 2024

https://github.com/shamspias/customizable-gpt-chatbot

A dynamic, scalable AI chatbot built with Django REST framework, supporting custom training from PDFs, documents, websites, and YouTube videos. Leveraging OpenAI's GPT-3.5, Pinecone, FAISS, and Celery for seamless integration and performance.

artificial-intelligence autogpt chatbot conversational-ai data-preprocessing django django-rest-framework gpt-3 gpt-voice langchain langchain-python longchain machine-learning natural-language-processing nlp python voice-chat voice-recognition voice-to-text voice-transcription

Last synced: 06 Nov 2024

https://github.com/kakaobrain/word2word

Easy-to-use word-to-word translations for 3,564 language pairs.

bilingual-lexicon-extraction nlp opensubtitles translation

Last synced: 10 Nov 2024

https://github.com/nashex/gpt4-playground

Clone of OpenAI's ChatGPT and Playground environments to enable experimenting with API keys.

gpt4 gpt4-api nextjs nlp openai playground

Last synced: 09 Nov 2024

https://github.com/hit-scir/huozi

活字通用大模型

fine-tuning large-language-models llm nlp

Last synced: 10 Nov 2024

https://github.com/planeshifter/node-word2vec

Node.js interface to the Google word2vec tool.

nlp word2vec

Last synced: 13 Nov 2024

https://github.com/dongjunlee/transformer-tensorflow

TensorFlow implementation of 'Attention Is All You Need (2017. 6)'

attention deep-learning experiments hb-experiment nlp tensorflow transformer translation

Last synced: 08 Nov 2024

https://github.com/Planeshifter/node-word2vec

Node.js interface to the Google word2vec tool.

nlp word2vec

Last synced: 02 Nov 2024

https://github.com/keiffster/program-y

Python 3.x based AIML 2.0 Chatbot interpreter, framework, related programs and knowledge files

ai aiml aiml2 api chatbot framework nlp nlp-parsing python python3 tutorial virtual virtualassistant

Last synced: 29 Oct 2024

https://github.com/DongjunLee/transformer-tensorflow

TensorFlow implementation of 'Attention Is All You Need (2017. 6)'

attention deep-learning experiments hb-experiment nlp tensorflow transformer translation

Last synced: 07 Nov 2024

https://github.com/ymcui/pert

PERT: Pre-training BERT with Permuted Language Model

bert nlp plm pre-trained-model pytorch tensorflow transformers

Last synced: 28 Oct 2024

https://github.com/Nashex/gpt4-playground

Clone of OpenAI's ChatGPT and Playground environments to enable experimenting with API keys.

gpt4 gpt4-api nextjs nlp openai playground

Last synced: 04 Nov 2024

https://github.com/ymcui/PERT

PERT: Pre-training BERT with Permuted Language Model

bert nlp plm pre-trained-model pytorch tensorflow transformers

Last synced: 03 Aug 2024

https://github.com/yuhaozhang/tacred-relation

PyTorch implementation of the position-aware attention model for relation extraction

information-extraction natural-language-processing nlp relation-extraction

Last synced: 08 Nov 2024

https://github.com/MAIF/melusine

📧 Melusine: Use python to automatize your email processing workflow

courriels datascience emails natural-language-processing nlp nlp-machine-learning python python3

Last synced: 03 Nov 2024

https://github.com/dccuchile/spanish-word-embeddings

Spanish word embeddings computed with different methods and from different corpora

fasttext-embeddings glove-embeddings nlp spanish word-embeddings word2vec-embeddinngs

Last synced: 05 Aug 2024

https://github.com/explosion/displacy

:boom: displaCy.js: An open-source NLP visualiser for the modern web

css javascript natural-language-processing nlp spacy svg visualization

Last synced: 25 Sep 2024

https://github.com/momegas/megabots

🤖 State-of-the-art, production ready LLM apps made mega-easy, so you don't have to build them from scratch 🤯 Create a bot, now 🫵

chatbot faiss fastapi gpt-35-turbo gpt-4 information-retrieval langchain llama natural-language-processing nlp pinecone prompt-engineering python question-answering s3

Last synced: 11 Oct 2024

https://github.com/Koziev/NLP_Datasets

My NLP datasets for Russian language

datasets nlp nlp-resources

Last synced: 13 Nov 2024

https://github.com/domluna/memn2n

End-To-End Memory Network using Tensorflow

memory-networks nlp tensorflow

Last synced: 26 Oct 2024

https://github.com/deepset-ai/covid-qa

API & Webapp to answer questions about COVID-19. Using NLP (Question Answering) and trusted data sources.

api corona covid-19 covid-data faq nlp question-answering search

Last synced: 06 Nov 2024

https://github.com/alibaba-edu/simple-effective-text-matching

Source code of the ACL2019 paper "Simple and Effective Text Matching with Richer Alignment Features".

deep-learning nlp quora-question-pairs snli tensorflow

Last synced: 06 Nov 2024

https://github.com/thisandagain/troll

Language sentiment analysis and neural networks... for trolls.

javascript moderation neural-network nlp sentiment sentiment-analysis

Last synced: 26 Oct 2024

https://github.com/kyzhouhzau/nlpgnn

1. Use BERT, ALBERT and GPT2 as tensorflow2.0's layer. 2. Implement GCN, GAN, GIN and GraphSAGE based on message passing.

albert albert-ner bert bert-cls bert-ner bilstm-attention gan gcn gin gnn gpt2 graph-classfication graph-convolutional-networks graphsage message-passing nlp tensorflow2 textcnn textgcn tf2

Last synced: 14 Oct 2024

https://github.com/oswaldoludwig/Seq2seq-Chatbot-for-Keras

This repository contains a new generative model of chatbot based on seq2seq modeling.

chatbot conversational-agents deep-learning dialogue dialogue-generation gan generative-adversarial-network glove keras nlp seq2seq

Last synced: 02 Nov 2024

https://github.com/xplip/pixel

Research code for pixel-based encoders of language (PIXEL)

deep-learning deep-neural-networks language-model machine-learning nlp pytorch

Last synced: 14 Nov 2024

https://github.com/davidmigloz/langchain_dart

Build LLM-powered Dart/Flutter applications.

ai dart flutter generative-ai llms nlp

Last synced: 03 Nov 2024

https://github.com/wuba/qa_match

A simple effective ToolKit for short text matching

58 ai deep-learning dssm lstm machine-learning nlp qabot qatools tensorflow

Last synced: 03 Aug 2024

https://github.com/shibing624/dialogbot

dialogbot, provide search-based dialogue, task-based dialogue and generative dialogue model. 对话机器人,基于问答型对话、任务型对话、聊天型对话等模型实现,支持网络检索问答,领域知识问答,任务引导问答,闲聊问答,开箱即用。

chatbot deep-learning dialog dialogbot nlp qa question-answering

Last synced: 12 Nov 2024

https://github.com/machine-learning-apps/Issue-Label-Bot

Code For The Issue Label Bot, an App that automatically labels issues using machine learning, available on the GitHub Marketplace. This is also code for the blog article: "How to automate tasks on GitHub with machine learning for fun and profit"

bigquery bootstrap data-science deep-learning end-to-end-application flask gcp-cloud gharchive github-api-v3 github-app keras kubernetes machine-learning machine-learning-tutorials nlp production-machine-learning tensorflow

Last synced: 25 Oct 2024

https://github.com/yunwei37/covid-19-nlp-vis

使用 flask + pyecharts 搭建的新冠肺炎疫情数据可视化交互分析网站平台,包含疫情数据获取、每日疫情地图、曲线图展示,数据统计分析、态势感知、确诊人数预测分析算法设计、NLP舆情监测等任务(部署在http://covid.yunwei123.tech/)

covid-19 flask maps nlp pyecharts visualization

Last synced: 26 Oct 2024

https://github.com/machine-learning-apps/issue-label-bot

Code For The Issue Label Bot, an App that automatically labels issues using machine learning, available on the GitHub Marketplace. This is also code for the blog article: "How to automate tasks on GitHub with machine learning for fun and profit"

bigquery bootstrap data-science deep-learning end-to-end-application flask gcp-cloud gharchive github-api-v3 github-app keras kubernetes machine-learning machine-learning-tutorials nlp production-machine-learning tensorflow

Last synced: 29 Sep 2024

https://github.com/enoch3712/ExtractThinker

ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.

ai llm nlp ocr openai python

Last synced: 05 Nov 2024

https://github.com/discopy/discopy

The Python toolkit for computing with string diagrams.

category-theory diagrams nlp quantum-computing

Last synced: 09 Aug 2024

https://github.com/CogStack/OpenGPT

A framework for creating grounded instruction based datasets and training conversational domain expert Large Language Models (LLMs).

chatgpt gpt-4 health healthcare huggingface llm medicine nlp opengpt

Last synced: 03 Aug 2024

https://github.com/drahnr/cargo-spellcheck

Checks all your documentation for spelling and grammar mistakes with hunspell and a nlprule based checker for grammar

cargo cargo-plugin cargo-spellcheck grammar grammar-mistakes grammarchecker hacktoberfest hunspell languagetool nlp spellchecker spelling

Last synced: 13 Nov 2024

https://github.com/HIT-SCIR/huozi

活字通用大模型

fine-tuning large-language-models llm nlp

Last synced: 08 Nov 2024

https://github.com/explosion/prodigy-openai-recipes

✨ Bootstrap annotation with zero- & few-shot learning via OpenAI GPT-3

annotation-tool few-shot-learning gpt-3 nlp openai openai-api prodigy zero-shot-learning

Last synced: 25 Sep 2024

https://github.com/dpressel/dliss-tutorial

Tutorial for International Summer School on Deep Learning, 2019

deep-learning machine-learning nlp

Last synced: 26 Oct 2024

https://github.com/asahi417/lm-question-generation

Multilingual/multidomain question generation datasets, models, and python library for question generation.

bart nlp pytorch question-answering question-generation t5

Last synced: 04 Nov 2024

https://github.com/cli99/llm-analysis

Latency and Memory Analysis of Transformer Models for Training and Inference

analysis deep-learning language-model language-models machine-learning nlp transformers

Last synced: 06 Aug 2024

https://github.com/swhl/ai-competition-collections

AI比赛经验帖子 & 训练和测试技巧帖子 集锦(收集整理各种人工智能比赛经验帖)

competition cv data-discovery graph-neural-networks knowledge-graph nlp recommender-system speech

Last synced: 01 Nov 2024

https://github.com/xiangking/ark-nlp

A private nlp coding package, which quickly implements the SOTA solutions.

bert nlp transfomer

Last synced: 06 Nov 2024

https://github.com/JetRunner/BERT-of-Theseus

⛵️The official PyTorch implementation for "BERT-of-Theseus: Compressing BERT by Progressive Module Replacing" (EMNLP 2020).

bert glue model-compression nlp transformers

Last synced: 03 Nov 2024

https://github.com/SimGus/Chatette

A powerful dataset generator for Rasa NLU, inspired by Chatito

botkit chatbot chatbots chatito cli dataset-generation nlg nlp nlu parsing python rasa rasa-nlu sentence

Last synced: 31 Oct 2024

https://github.com/qiangsiwei/bert_distill

BERT distillation(基于BERT的蒸馏实验 )

bert classification distillation nlp

Last synced: 02 Nov 2024

https://github.com/UKPLab/gpl

Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: Generative Pseudo Labeling for Unsupervised Domain Adaptation of Dense Retrieval" https://arxiv.org/abs/2112.07577

bert domain-adaptation information-retrieval nlp transformers vector-search

Last synced: 05 Aug 2024

https://github.com/mcs07/chemdataextractor

Automatically extract chemical information from scientific documents

chemistry information-extraction natural-language-processing nlp python text-mining

Last synced: 07 Nov 2024

https://github.com/graykode/ai-docstring

Visual Studio Code extension to quickly generate docstrings for python functions using AI(NLP) technology.

bert code-summarization docstrings nlp vs-code-extenstion

Last synced: 04 Nov 2024

https://github.com/xkzhangsan/xk-time

xk-time 是时间转换,时间计算,时间格式化,时间解析,日历,时间cron表达式和时间NLP等的工具,使用Java8(JSR-310),线程安全,简单易用,多达70几种常用日期格式化模板,支持Java8时间类和Date,轻量级,无第三方依赖。

calendar cron cron-java8 date datetimeformatter-formatter dateutil formatter java jsr-310 localdate localdatetime nlp time timeconvertion

Last synced: 04 Aug 2024

https://github.com/natasha/yargy

Rule-based facts extraction for Russian language

earley-parser information-extraction morphology nlp python russian tomita tomita-parser

Last synced: 10 Nov 2024

https://github.com/alibaba-edu/simple-effective-text-matching-pytorch

A pytorch implementation of the ACL2019 paper "Simple and Effective Text Matching with Richer Alignment Features".

deep-learning nlp pytorch quora-question-pairs snli

Last synced: 06 Nov 2024

https://github.com/GaoQ1/rasa_nlu_gq

turn natural language into structured data(支持中文,自定义了N种模型,支持不同的场景和任务)

bert bilstm-idcnn jieba natural-language nlp nlu rasa rasa-nlu rasa-nlu-gao tensorflow

Last synced: 02 Nov 2024

https://github.com/dair-ai/nlp_newsletter

📰Natural language processing (NLP) newsletter

deep-learning machine-learning nlp

Last synced: 10 Nov 2024

https://github.com/hankcs/multi-criteria-cws

Simple Solution for Multi-Criteria Chinese Word Segmentation

bi-lstm-crf cws dynet multi-criteria-cws nlp

Last synced: 09 Nov 2024

https://github.com/phospho-app/phospho

Text analytics for LLM apps. PostHog for prompts. Extract evaluations, intents and events from text messages. phospho leverages LLM (OpenAI, MistralAI, Ollama, etc.)

ai analytics generative-ai llm nextjs nlp ollama python self-hosted typescript

Last synced: 13 Oct 2024

https://github.com/farukalamai/advanced-machine-learning-engineer-roadmap-2024

A Full Stack ML (Machine Learning) Roadmap involves learning the necessary skills and technologies to become proficient in all aspects of machine learning, including data collection and preprocessing, model development, deployment, and maintenance.

aws computer-vision data-analysis data-science data-visualization deep-learning git-github machine-learning machine-learning-roadmap mlops natural-language-processing neural-network nlp opencv pandas python pytorch statistics tensorflow yolo

Last synced: 07 Nov 2024

https://github.com/undertheseanlp/nlp-vietnamese-progress

Repository to track the progress in Vietnamese Natural Language Processing, including the datasets and the current state-of-the-art for the most common Vietnamese NLP tasks.

nlp vietnamese-nlp

Last synced: 11 Nov 2024

https://github.com/kevinlu1248/pyate

PYthon Automated Term Extraction

ai nlp symbolic-ai term-extraction

Last synced: 28 Sep 2024

https://github.com/gagolews/stringi

Fast and portable character string processing in R (with the Unicode ICU)

icu icu4c natural-language-processing nlp r regex regexp string-manipulation stringi stringr text text-processing tidy-data unicode

Last synced: 26 Oct 2024

https://github.com/hankcs/hanlp-lucene-plugin

HanLP中文分词Lucene插件,支持包括Solr在内的基于Lucene的系统

chinese-text-segmentation hanlp lucene nlp solr traditional-chinese

Last synced: 26 Oct 2024

https://github.com/daac-tools/vibrato

🎤 vibrato: Viterbi-based accelerated tokenizer

japanese morphological-analysis nlp rust segmentation tokenization tokenizer

Last synced: 07 Nov 2024

https://github.com/gentaiscool/code-switching-papers

A curated list of research papers and resources on code-switching

bilingual code-mixed code-mixing code-switch code-switching language nlp papers research speech

Last synced: 08 Nov 2024

https://github.com/jameshwade/gpttools

gpttools extends gptstudio for package development to help you document code, write tests, or even explain code

chatgpt nlp openai package-development rstats rstudio-addin

Last synced: 09 Nov 2024