Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Natural language processing

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

https://github.com/gutfeeling/beginner_nlp

A curated list of beginner resources in Natural Language Processing

natural-language-processing nlp nlp-resources

Last synced: 07 Aug 2024

https://github.com/graykode/commit-autosuggestions

A tool that AI automatically recommends commit messages.

bert commit-autosuggestions natural-language nlp text-generation

Last synced: 13 Nov 2024

https://github.com/thunlp/Few-NERD

Code and data of ACL 2021 paper "Few-NERD: A Few-shot Named Entity Recognition Dataset"

deep-learning entity-typing few-shot-learning named-entity-recognition nlp

Last synced: 03 Aug 2024

https://github.com/towhee-io/examples

Analyze the unstructured data with Towhee, such as reverse image search, reverse video search, audio classification, question and answer systems, molecular search, etc.

audio-classification cross-modal embeddings image-classification machine-learning nlp video-tagging

Last synced: 13 Nov 2024

https://github.com/neurocult/agency

🕵️‍♂️ Library designed for developers eager to explore the potential of Large Language Models (LLMs) and other generative AI through a clean, effective, and Go-idiomatic approach.

agents ai artificial-general-intelligence artificial-intelligence artificial-neural-networks autonomous-agents chatgpt generative-ai go golang gpt language-models llm llmops machine-learning neural-network nlp openai rag vector-database

Last synced: 06 Nov 2024

https://github.com/qipeng/gcn-over-pruned-trees

Graph Convolution over Pruned Dependency Trees Improves Relation Extraction (authors' PyTorch implementation)

dependency-parse-trees dependency-parsing information-extraction natural-language-processing nlp relation-extraction

Last synced: 02 Nov 2024

https://github.com/mb-14/gomarkov

Markov chains in golang

golang markov-chain nlp

Last synced: 13 Nov 2024

https://github.com/omarsar/nlp_highlights

The most important NLP highlights of 2018 (PDF Report)

analytics artificial-intelligence conversational-ai deep-learning health nlp technology

Last synced: 13 Oct 2024

https://github.com/polm/fugashi

A Cython MeCab wrapper for fast, pythonic Japanese tokenization and morphological analysis.

cython-wrapper japanese mecab nlp tokenizer

Last synced: 11 Nov 2024

https://github.com/dair-ai/nlp_fundamentals

📘 Contains a series of hands-on notebooks for learning the fundamentals of NLP

deep-learning education nlp

Last synced: 10 Nov 2024

https://github.com/yuhaozhang/tacred-relation

PyTorch implementation of the position-aware attention model for relation extraction

information-extraction natural-language-processing nlp relation-extraction

Last synced: 15 Nov 2024

https://github.com/kefirski/pytorch_RVAE

Recurrent Variational Autoencoder that generates sequential data implemented with pytorch

deep-learning nlp python pytorch vae

Last synced: 02 Nov 2024

https://github.com/shamspias/customizable-gpt-chatbot

A dynamic, scalable AI chatbot built with Django REST framework, supporting custom training from PDFs, documents, websites, and YouTube videos. Leveraging OpenAI's GPT-3.5, Pinecone, FAISS, and Celery for seamless integration and performance.

artificial-intelligence autogpt chatbot conversational-ai data-preprocessing django django-rest-framework gpt-3 gpt-voice langchain langchain-python longchain machine-learning natural-language-processing nlp python voice-chat voice-recognition voice-to-text voice-transcription

Last synced: 06 Nov 2024

https://github.com/neuml/tldrstory

📊 Semantic search for headlines and story text

machine-learning nlp python search txtai

Last synced: 01 Nov 2024

https://github.com/kakaobrain/word2word

Easy-to-use word-to-word translations for 3,564 language pairs.

bilingual-lexicon-extraction nlp opensubtitles translation

Last synced: 10 Nov 2024

https://github.com/nashex/gpt4-playground

Clone of OpenAI's ChatGPT and Playground environments to enable experimenting with API keys.

gpt4 gpt4-api nextjs nlp openai playground

Last synced: 09 Nov 2024

https://github.com/hit-scir/huozi

活字通用大模型

fine-tuning large-language-models llm nlp

Last synced: 10 Nov 2024

https://github.com/dongjunlee/transformer-tensorflow

TensorFlow implementation of 'Attention Is All You Need (2017. 6)'

attention deep-learning experiments hb-experiment nlp tensorflow transformer translation

Last synced: 15 Nov 2024

https://github.com/Planeshifter/node-word2vec

Node.js interface to the Google word2vec tool.

nlp word2vec

Last synced: 02 Nov 2024

https://github.com/planeshifter/node-word2vec

Node.js interface to the Google word2vec tool.

nlp word2vec

Last synced: 13 Nov 2024

https://github.com/keiffster/program-y

Python 3.x based AIML 2.0 Chatbot interpreter, framework, related programs and knowledge files

ai aiml aiml2 api chatbot framework nlp nlp-parsing python python3 tutorial virtual virtualassistant

Last synced: 29 Oct 2024

https://github.com/DongjunLee/transformer-tensorflow

TensorFlow implementation of 'Attention Is All You Need (2017. 6)'

attention deep-learning experiments hb-experiment nlp tensorflow transformer translation

Last synced: 07 Nov 2024

https://github.com/ymcui/pert

PERT: Pre-training BERT with Permuted Language Model

bert nlp plm pre-trained-model pytorch tensorflow transformers

Last synced: 28 Oct 2024

https://github.com/ymcui/PERT

PERT: Pre-training BERT with Permuted Language Model

bert nlp plm pre-trained-model pytorch tensorflow transformers

Last synced: 03 Aug 2024

https://github.com/Nashex/gpt4-playground

Clone of OpenAI's ChatGPT and Playground environments to enable experimenting with API keys.

gpt4 gpt4-api nextjs nlp openai playground

Last synced: 04 Nov 2024

https://github.com/explosion/displacy

:boom: displaCy.js: An open-source NLP visualiser for the modern web

css javascript natural-language-processing nlp spacy svg visualization

Last synced: 25 Sep 2024

https://github.com/MAIF/melusine

📧 Melusine: Use python to automatize your email processing workflow

courriels datascience emails natural-language-processing nlp nlp-machine-learning python python3

Last synced: 03 Nov 2024

https://github.com/dccuchile/spanish-word-embeddings

Spanish word embeddings computed with different methods and from different corpora

fasttext-embeddings glove-embeddings nlp spanish word-embeddings word2vec-embeddinngs

Last synced: 05 Aug 2024

https://github.com/momegas/megabots

🤖 State-of-the-art, production ready LLM apps made mega-easy, so you don't have to build them from scratch 🤯 Create a bot, now 🫵

chatbot faiss fastapi gpt-35-turbo gpt-4 information-retrieval langchain llama natural-language-processing nlp pinecone prompt-engineering python question-answering s3

Last synced: 11 Oct 2024

https://github.com/Koziev/NLP_Datasets

My NLP datasets for Russian language

datasets nlp nlp-resources

Last synced: 13 Nov 2024

https://github.com/domluna/memn2n

End-To-End Memory Network using Tensorflow

memory-networks nlp tensorflow

Last synced: 26 Oct 2024

https://github.com/deepset-ai/covid-qa

API & Webapp to answer questions about COVID-19. Using NLP (Question Answering) and trusted data sources.

api corona covid-19 covid-data faq nlp question-answering search

Last synced: 06 Nov 2024

https://github.com/alibaba-edu/simple-effective-text-matching

Source code of the ACL2019 paper "Simple and Effective Text Matching with Richer Alignment Features".

deep-learning nlp quora-question-pairs snli tensorflow

Last synced: 06 Nov 2024

https://github.com/thisandagain/troll

Language sentiment analysis and neural networks... for trolls.

javascript moderation neural-network nlp sentiment sentiment-analysis

Last synced: 26 Oct 2024

https://github.com/oswaldoludwig/Seq2seq-Chatbot-for-Keras

This repository contains a new generative model of chatbot based on seq2seq modeling.

chatbot conversational-agents deep-learning dialogue dialogue-generation gan generative-adversarial-network glove keras nlp seq2seq

Last synced: 02 Nov 2024

https://github.com/kyzhouhzau/nlpgnn

1. Use BERT, ALBERT and GPT2 as tensorflow2.0's layer. 2. Implement GCN, GAN, GIN and GraphSAGE based on message passing.

albert albert-ner bert bert-cls bert-ner bilstm-attention gan gcn gin gnn gpt2 graph-classfication graph-convolutional-networks graphsage message-passing nlp tensorflow2 textcnn textgcn tf2

Last synced: 14 Oct 2024

https://github.com/xplip/pixel

Research code for pixel-based encoders of language (PIXEL)

deep-learning deep-neural-networks language-model machine-learning nlp pytorch

Last synced: 14 Nov 2024

https://github.com/davidmigloz/langchain_dart

Build LLM-powered Dart/Flutter applications.

ai dart flutter generative-ai llms nlp

Last synced: 03 Nov 2024

https://github.com/wuba/qa_match

A simple effective ToolKit for short text matching

58 ai deep-learning dssm lstm machine-learning nlp qabot qatools tensorflow

Last synced: 03 Aug 2024

https://github.com/shibing624/dialogbot

dialogbot, provide search-based dialogue, task-based dialogue and generative dialogue model. 对话机器人,基于问答型对话、任务型对话、聊天型对话等模型实现,支持网络检索问答,领域知识问答,任务引导问答,闲聊问答,开箱即用。

chatbot deep-learning dialog dialogbot nlp qa question-answering

Last synced: 12 Nov 2024

https://github.com/machine-learning-apps/Issue-Label-Bot

Code For The Issue Label Bot, an App that automatically labels issues using machine learning, available on the GitHub Marketplace. This is also code for the blog article: "How to automate tasks on GitHub with machine learning for fun and profit"

bigquery bootstrap data-science deep-learning end-to-end-application flask gcp-cloud gharchive github-api-v3 github-app keras kubernetes machine-learning machine-learning-tutorials nlp production-machine-learning tensorflow

Last synced: 25 Oct 2024

https://github.com/enoch3712/ExtractThinker

ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.

ai llm nlp ocr openai python

Last synced: 05 Nov 2024

https://github.com/discopy/discopy

The Python toolkit for computing with string diagrams.

category-theory diagrams nlp quantum-computing

Last synced: 09 Aug 2024

https://github.com/CogStack/OpenGPT

A framework for creating grounded instruction based datasets and training conversational domain expert Large Language Models (LLMs).

chatgpt gpt-4 health healthcare huggingface llm medicine nlp opengpt

Last synced: 03 Aug 2024

https://github.com/yunwei37/covid-19-nlp-vis

使用 flask + pyecharts 搭建的新冠肺炎疫情数据可视化交互分析网站平台,包含疫情数据获取、每日疫情地图、曲线图展示,数据统计分析、态势感知、确诊人数预测分析算法设计、NLP舆情监测等任务(部署在http://covid.yunwei123.tech/)

covid-19 flask maps nlp pyecharts visualization

Last synced: 26 Oct 2024

https://github.com/machine-learning-apps/issue-label-bot

Code For The Issue Label Bot, an App that automatically labels issues using machine learning, available on the GitHub Marketplace. This is also code for the blog article: "How to automate tasks on GitHub with machine learning for fun and profit"

bigquery bootstrap data-science deep-learning end-to-end-application flask gcp-cloud gharchive github-api-v3 github-app keras kubernetes machine-learning machine-learning-tutorials nlp production-machine-learning tensorflow

Last synced: 29 Sep 2024

https://github.com/drahnr/cargo-spellcheck

Checks all your documentation for spelling and grammar mistakes with hunspell and a nlprule based checker for grammar

cargo cargo-plugin cargo-spellcheck grammar grammar-mistakes grammarchecker hacktoberfest hunspell languagetool nlp spellchecker spelling

Last synced: 13 Nov 2024

https://github.com/HIT-SCIR/huozi

活字通用大模型

fine-tuning large-language-models llm nlp

Last synced: 08 Nov 2024

https://github.com/explosion/prodigy-openai-recipes

✨ Bootstrap annotation with zero- & few-shot learning via OpenAI GPT-3

annotation-tool few-shot-learning gpt-3 nlp openai openai-api prodigy zero-shot-learning

Last synced: 25 Sep 2024

https://github.com/dpressel/dliss-tutorial

Tutorial for International Summer School on Deep Learning, 2019

deep-learning machine-learning nlp

Last synced: 26 Oct 2024

https://github.com/swhl/ai-competition-collections

AI比赛经验帖子 & 训练和测试技巧帖子 集锦(收集整理各种人工智能比赛经验帖)

competition cv data-discovery graph-neural-networks knowledge-graph nlp recommender-system speech

Last synced: 15 Nov 2024

https://github.com/asahi417/lm-question-generation

Multilingual/multidomain question generation datasets, models, and python library for question generation.

bart nlp pytorch question-answering question-generation t5

Last synced: 04 Nov 2024

https://github.com/cli99/llm-analysis

Latency and Memory Analysis of Transformer Models for Training and Inference

analysis deep-learning language-model language-models machine-learning nlp transformers

Last synced: 06 Aug 2024

https://github.com/xiangking/ark-nlp

A private nlp coding package, which quickly implements the SOTA solutions.

bert nlp transfomer

Last synced: 06 Nov 2024

https://github.com/mcs07/chemdataextractor

Automatically extract chemical information from scientific documents

chemistry information-extraction natural-language-processing nlp python text-mining

Last synced: 14 Nov 2024

https://github.com/JetRunner/BERT-of-Theseus

⛵️The official PyTorch implementation for "BERT-of-Theseus: Compressing BERT by Progressive Module Replacing" (EMNLP 2020).

bert glue model-compression nlp transformers

Last synced: 03 Nov 2024

https://github.com/SimGus/Chatette

A powerful dataset generator for Rasa NLU, inspired by Chatito

botkit chatbot chatbots chatito cli dataset-generation nlg nlp nlu parsing python rasa rasa-nlu sentence

Last synced: 31 Oct 2024

https://github.com/UKPLab/gpl

Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: Generative Pseudo Labeling for Unsupervised Domain Adaptation of Dense Retrieval" https://arxiv.org/abs/2112.07577

bert domain-adaptation information-retrieval nlp transformers vector-search

Last synced: 05 Aug 2024

https://github.com/qiangsiwei/bert_distill

BERT distillation(基于BERT的蒸馏实验 )

bert classification distillation nlp

Last synced: 02 Nov 2024

https://github.com/xkzhangsan/xk-time

xk-time 是时间转换,时间计算,时间格式化,时间解析,日历,时间cron表达式和时间NLP等的工具,使用Java8(JSR-310),线程安全,简单易用,多达70几种常用日期格式化模板,支持Java8时间类和Date,轻量级,无第三方依赖。

calendar cron cron-java8 date datetimeformatter-formatter dateutil formatter java jsr-310 localdate localdatetime nlp time timeconvertion

Last synced: 04 Aug 2024

https://github.com/natasha/yargy

Rule-based facts extraction for Russian language

earley-parser information-extraction morphology nlp python russian tomita tomita-parser

Last synced: 10 Nov 2024

https://github.com/graykode/ai-docstring

Visual Studio Code extension to quickly generate docstrings for python functions using AI(NLP) technology.

bert code-summarization docstrings nlp vs-code-extenstion

Last synced: 04 Nov 2024

https://github.com/alibaba-edu/simple-effective-text-matching-pytorch

A pytorch implementation of the ACL2019 paper "Simple and Effective Text Matching with Richer Alignment Features".

deep-learning nlp pytorch quora-question-pairs snli

Last synced: 06 Nov 2024

https://github.com/GaoQ1/rasa_nlu_gq

turn natural language into structured data(支持中文,自定义了N种模型,支持不同的场景和任务)

bert bilstm-idcnn jieba natural-language nlp nlu rasa rasa-nlu rasa-nlu-gao tensorflow

Last synced: 02 Nov 2024

https://github.com/dair-ai/nlp_newsletter

📰Natural language processing (NLP) newsletter

deep-learning machine-learning nlp

Last synced: 10 Nov 2024

https://github.com/hankcs/multi-criteria-cws

Simple Solution for Multi-Criteria Chinese Word Segmentation

bi-lstm-crf cws dynet multi-criteria-cws nlp

Last synced: 09 Nov 2024

https://github.com/phospho-app/phospho

Text analytics for LLM apps. PostHog for prompts. Extract evaluations, intents and events from text messages. phospho leverages LLM (OpenAI, MistralAI, Ollama, etc.)

ai analytics generative-ai llm nextjs nlp ollama python self-hosted typescript

Last synced: 13 Oct 2024

https://github.com/kevinlu1248/pyate

PYthon Automated Term Extraction

ai nlp symbolic-ai term-extraction

Last synced: 28 Sep 2024

https://github.com/undertheseanlp/nlp-vietnamese-progress

Repository to track the progress in Vietnamese Natural Language Processing, including the datasets and the current state-of-the-art for the most common Vietnamese NLP tasks.

nlp vietnamese-nlp

Last synced: 11 Nov 2024

https://github.com/gagolews/stringi

Fast and portable character string processing in R (with the Unicode ICU)

icu icu4c natural-language-processing nlp r regex regexp string-manipulation stringi stringr text text-processing tidy-data unicode

Last synced: 26 Oct 2024

https://github.com/hankcs/hanlp-lucene-plugin

HanLP中文分词Lucene插件,支持包括Solr在内的基于Lucene的系统

chinese-text-segmentation hanlp lucene nlp solr traditional-chinese

Last synced: 26 Oct 2024

https://github.com/daac-tools/vibrato

🎤 vibrato: Viterbi-based accelerated tokenizer

japanese morphological-analysis nlp rust segmentation tokenization tokenizer

Last synced: 07 Nov 2024

https://github.com/gentaiscool/code-switching-papers

A curated list of research papers and resources on code-switching

bilingual code-mixed code-mixing code-switch code-switching language nlp papers research speech

Last synced: 08 Nov 2024

https://github.com/jameshwade/gpttools

gpttools extends gptstudio for package development to help you document code, write tests, or even explain code

chatgpt nlp openai package-development rstats rstudio-addin

Last synced: 09 Nov 2024

https://github.com/sekwiatkowski/Komputation

Komputation is a neural network framework for the Java Virtual Machine written in Kotlin and CUDA C.

artificial-intelligence convolutional-neural-networks cuda framework gpu jvm kotlin machine-learning neural-networks nlp nvidia recurrent-neural-networks seq2seq

Last synced: 02 Nov 2024