Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Natural language processing

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

https://github.com/brolin59/trnlp

TÜRKÇE İÇİN DOĞAL DİL İŞLEME ARAÇLARI

dogal-dil-isleme morfoloji morfolojik-analiz nlp turkish-nlp turkish-sentence-tokenizer

Last synced: 02 Aug 2024

https://github.com/microsoft/browsecloud

A web app to create and browse text visualizations for automated customer listening.

bayesian-networks counting-grids nlp text-classification text-processing visualization

Last synced: 05 Aug 2024

https://github.com/kevincobain2000/jProcessing

Japanese Natural Langauge Processing Libraries

japanese nlp word-sense-disambiguation wsd

Last synced: 31 Jul 2024

https://github.com/erfanzar/EasyDeL

EasyDeL is an OpenSource Library to make your training faster and more Optimized With cool Options for training and serving Both in Python And Mojo🔥

easydel flax gpt jax machine-learning mojo nlp optax pytorch transformers

Last synced: 03 Aug 2024

https://github.com/RocketChat/hubot-natural

Natural Language Processing Chatbot for RocketChat

chatbot coffeescript hubot hubot-natural nlp nodejs rocketchat rocketchat-hubot

Last synced: 31 Jul 2024

https://github.com/EmilHvitfeldt/R-text-data

List of textual data sources to be used for text mining in R

data-science nlp rstats text-analysis text-analytics-in-r text-mining tidytext

Last synced: 05 Aug 2024

https://github.com/CLUEbenchmark/DataCLUE

DataCLUE: 数据为中心的NLP基准和工具包

ai chinese classification-algorithm data-centric human-in-the-loop nlp

Last synced: 03 Aug 2024

https://github.com/emres/turkish-deasciifier

Turkish deasciifier in Python based on Deniz Yüret's turkish-mode for Emacs

deasciifier diacritics diacritics-reconstruction diacritics-restoration nlp nlp-library python turkish turkish-nlp

Last synced: 02 Aug 2024

https://github.com/kudoai/duckduckgpt

🐤 DuckDuckGo add-on that brings the magic of ChatGPT to search results (powered by GPT-4!)

ai artificial-intelligence bot chatbot chatgpt chatgpt3 ddg duckduckgo gpt gpt-3 gpt-4 greasemonkey javascript machine-learning nlp openai search userscripts web

Last synced: 31 Jul 2024

https://github.com/KudoAI/duckduckgpt

🐤 DuckDuckGo add-on that brings the magic of ChatGPT to search results (powered by GPT-4!)

ai artificial-intelligence bot chatbot chatgpt chatgpt3 ddg duckduckgo gpt gpt-3 gpt-4 greasemonkey javascript machine-learning nlp openai search userscripts web

Last synced: 31 Jul 2024

https://github.com/thunlp/OpenBackdoor

An open-source toolkit for textual backdoor attack and defense (NeurIPS 2022 D&B, Spotlight)

backdoor-attacks nlp

Last synced: 03 Aug 2024

https://github.com/Planeshifter/text-miner

text mining utilities for Node.js

nlp text-mining

Last synced: 02 Aug 2024

https://github.com/ianycxu/GCN-with-BERT

Graph Convolutional Networks (GCN) with BERT for Coreference Resolution Task [Pytorch][DGL]

bert bert-model coreference-resolution gcn gnn graph-convolutional-networks graph-neural-networks nlp pytorch

Last synced: 01 Aug 2024

https://github.com/MxDkl/pls

CLI to convert natural language to terminal commands

chatgpt cli llm nlp openai terminal

Last synced: 01 Aug 2024

https://github.com/datquocnguyen/RDRPOSTagger

A fast and accurate POS and morphological tagging toolkit (EACL 2014)

java nlp part-of-speech-tagger pos-tagger pos-tagging python3

Last synced: 31 Jul 2024

https://github.com/arian-askari/ChatGPT-RetrievalQA

A dataset for training/evaluating Question Answering Retrieval models on ChatGPT responses with the possibility to training/evaluating on real human responses.

ai chatgpt chatgpt-information-retrieval chatgpt-ir data-augmentation dataset deep-learning gpt-3 gpt2 gpt3 information-retrieval information-retrieval-chatgpt ir ir-chatgpt machine-learning nlp openai python sequence-to-sequence text-retrieval

Last synced: 31 Jul 2024

https://github.com/eisenjulian/nlp_estimator_tutorial

Educational material on using the TensorFlow Estimator framework for text classification

estimator nlp tensorflow text-classification

Last synced: 03 Sep 2024

https://github.com/HKUST-KnowComp/MnemonicReader

A PyTorch implementation of Mnemonic Reader for the Machine Comprehension task

document-reader machine-comprehension mnemonic-reader nlp pytorch r-net squad

Last synced: 07 Aug 2024

https://github.com/hsinyuan-huang/FusionNet-NLI

An example for applying FusionNet to Natural Language Inference

deep-learning machine-comprehension nlp

Last synced: 07 Aug 2024

https://github.com/yagays/ja-timex

自然言語で書かれた時間情報表現を抽出/規格化するルールベースの解析器

datetime nlp python regular-expression temporal time-parsing

Last synced: 01 Aug 2024

https://github.com/A-baoYang/alpaca-7b-chinese

Finetune LLaMA-7B with Chinese instruction datasets

alpaca chatgpt deep-learning fine-tuning instruction-following llm lora nlp pytorch

Last synced: 30 Jul 2024

https://github.com/Living-with-machines/DeezyMatch

A Flexible Deep Learning Approach to Fuzzy String Matching

deep-learning hacktoberfest hut23 hut23-96 machine-learning natural-language-processing nlp

Last synced: 31 Jul 2024

https://github.com/farach/huggingfaceR

Hugging Face state-of-the-art models in R

huggingface nlp r rstats

Last synced: 02 Aug 2024

https://github.com/comtravo/ctparse

Parse natural language time expressions in python

machine-learning nlp python python-library regular-expression time-parsing

Last synced: 02 Aug 2024

https://github.com/proycon/clam

Quickly turn command-line applications into RESTful webservices with a web-application front-end. You provide a specification of your command line application, its input, output and parameters, and CLAM wraps around your application to form a fully fledged RESTful webservice.

nlp python rest webservice wrapper

Last synced: 01 Aug 2024

https://github.com/omarsar/pytorch_notebooks

A collection of PyTorch notebooks for learning and practicing deep learning

ai deeplearning machine-learning nlp notebook pytorch

Last synced: 01 Aug 2024

https://github.com/km1994/recommendation_advertisement_search

整理自然语言处理、推荐系统、搜索引擎等AI领域的入门笔记,论文学习笔记和面试资料(关于NLP那些你不知道的事、关于推荐系统那些你不知道的事、NLP百面百搭、推荐系统百面百搭、搜索引擎百面百搭)

advertisement nlp recommendation-system search-engine

Last synced: 02 Aug 2024

https://github.com/RevanthRameshkumar/CRD3

The repo containing the Critical Role Dungeons and Dragons Dataset.

acl2020 dataset dialogue-systems machine-learning nlp storytelling summarization

Last synced: 01 Aug 2024

https://github.com/AlekseyKorshuk/optimum-transformers

Accelerated NLP pipelines for fast inference on CPU and GPU. Built with Transformers, Optimum and ONNX Runtime.

benchmark huggingface infinity natural-language-processing nlp onnx onnxruntime optimum pipeline transformers

Last synced: 07 Aug 2024

https://github.com/patil-suraj/onnx_transformers

Accelerated NLP pipelines for fast inference on CPU. Built with Transformers and ONNX runtime.

inference nlp onnx onnxruntime transformers

Last synced: 02 Aug 2024

https://github.com/explosion/spacy-dev-resources

💫 Scripts, tools and resources for developing spaCy

natural-language-processing nlp python spacy

Last synced: 07 Aug 2024

https://github.com/alisafaya/Arabic-BERT

Arabic edition of BERT pretrained language models

arabic arabic-nlp bert bert-language-models language-model nlp transformer

Last synced: 03 Aug 2024

https://github.com/cosmoquester/2021-dialogue-summary-competition

[2021 훈민정음 한국어 음성•자연어 인공지능 경진대회] 대화요약 부문 알라꿍달라꿍 팀의 대화요약 학습 및 추론 코드를 공유하기 위한 레포입니다.

dialogue huggingface-transformers nlp pytorch-lightning summarization

Last synced: 02 Aug 2024

https://github.com/hliyan/jarvis

J.A.R.V.I.S - Just Another Rudimentary Verbal Instruction Shell

chatbot cli nlp

Last synced: 31 Jul 2024

https://github.com/minhpqn/nlp_100_drill_exercises

100 bài luyện tập xử lý ngôn ngữ tự nhiên

dependency-parsing exercises nlp nlp-tool

Last synced: 01 Aug 2024

https://github.com/proycon/colibri-core

Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way. At the core is the tool ``colibri-patternmodeller`` whi ch allows you to build, view, manipulate and query pattern models.

c-plus-plus computational-linguistics corpus library linguistics ngram ngrams nlp pattern-recognition python skipgram text-processing

Last synced: 31 Jul 2024

https://github.com/grid-parity-exchange/Egret

Tools for building power systems optimization problems

energy-system milp minlp nlp optimization power powerflow python snl-applications snl-science-libs

Last synced: 03 Aug 2024

https://github.com/suminb/hanja

한글, 한자 라이브러리

hangul hanja nlp python

Last synced: 03 Aug 2024

https://github.com/mallahyari/llm-hub

A curated collection of interesting applications, repos, and tutorials using large language models (LLM) like GPT-3

chatgpt deep-learning gpt-3 gpt-4 language-model llms nlp openai

Last synced: 31 Jul 2024

https://github.com/cocoa-ai/SentimentCoreMLDemo

😃 iOS11 demo application for sentiment polarity analysis.

coreml coreml-models ios machine-learning nlp sentiment-analysis sentiment-polarity swift swift4

Last synced: 03 Aug 2024

https://github.com/nullnull/simstring

A Python implementation of the SimString, a simple and efficient algorithm for approximate string matching.

nlp nlp-library python

Last synced: 05 Aug 2024

https://github.com/dmotz/emdash

📚🧙‍♂️ Wisdom indexer — use AI to organize text snippets so you can actually remember & learn from what you read

ai books ebook ebooks elm embeddings epub kindle kindle-clippings kindle-highlights literature ml nlp notes reading semantic-search

Last synced: 07 Sep 2024

https://github.com/dlab-berkeley/R-Deep-Learning

Workshop (6 hours): Deep learning in R using Keras. Building & training deep nets, image classification, transfer learning, text analysis, visualization

biomedical cloudml deep-learning keras nlp tensorflow

Last synced: 02 Aug 2024

https://github.com/graykode/toeicbert

TOEIC(Test of English for International Communication) solving using pytorch-pretrained-BERT model.

ai bert deep-learning lm mask nlp pytorch pytorch-pretrained toeic

Last synced: 01 Aug 2024

https://github.com/minibikini/paasaa

🔤 Natural language detection for Elixir

detect-language elixir language language-detection nlp

Last synced: 01 Aug 2024

https://github.com/ZhixiuYe/Intra-Bag-and-Inter-Bag-Attentions

Code for NAACL 2019 paper: Distant Supervision Relation Extraction with Intra-Bag and Inter-Bag Attentions

deeplearning distant-supervision nlp pytorch relation-extraction

Last synced: 01 Aug 2024

https://github.com/bayeru/chat-to-your-database

Chat to your database with AI. An experimental app to test the abilities of LLMs to query SQL databases using natural language.

chatgpt chatgpt-app database langchain langchain-typescript llm llms mysql natural-language-processing nlp openai postgres sql sqlite

Last synced: 10 Aug 2024

https://github.com/winkjs/wink-nlp-utils

NLP Functions for amplifying negations, managing elisions, creating ngrams, stems, phonetic codes to tokens and more.

bag-of-words natural-language-processing ngrams nlp phonetize sentence-boundary-detection stem stop-words tokenize

Last synced: 31 Jul 2024

https://github.com/ClipsAI/clipsai

Clips AI is an open-source Python library that automatically converts long videos into clips.

computer-vision nlp video-processing

Last synced: 01 Aug 2024

https://github.com/johnbumgarner/wordhoard

This Python module can be used to obtain antonyms, synonyms, hypernyms, hyponyms, homophones and definitions.

antonyms bag-of-words definitions dictionary homophones hypernyms hyponyms lexicon nlp python python3 synonyms text-analysis textual-analysis wordlists wordnet wordnets wordsearch

Last synced: 04 Aug 2024

https://github.com/DFKI-NLP/TRE

[AKBC 19] Improving Relation Extraction by Pre-trained Language Representations

information-extraction machine-learning multi-task-learning nlp relation-extraction transformer

Last synced: 01 Aug 2024

https://github.com/Nipun1212/Claude_api

Claude_api is a Python package that provides a convenient way to interact with Claude 2 from Anthropic.

anthropic anthropic-claude claude claude-ai claude-api nlp

Last synced: 02 Aug 2024

https://github.com/proycon/flat

FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.github.io/folia), a rich XML-based format for linguistic annotation. Flat allows users to view annotated FoLiA documents and enrich these documents with new annotations, a wide variety of linguistic annotation types is supported through the FoLiA paradigm.

annotation-tool clariah clarin computational-linguistics folia javascript linguistic-annotation-framework linguistics nlp python web-application

Last synced: 03 Aug 2024

https://github.com/ahmedbesbes/media-agent

Scrape data from social media and chat with it using Langchain

langchain large-language-models llms nlp nlproc python tweepy

Last synced: 01 Aug 2024

https://github.com/ahmedbesbes/twitter-agent

Scrape data from social media and chat with it using Langchain

langchain large-language-models llms nlp nlproc python tweepy

Last synced: 22 Aug 2024

https://github.com/textlint-rule/sentence-splitter

Split {Japanese, English} text into sentences.

english japanese javascript nlp segement sentence

Last synced: 04 Aug 2024

https://mcgill-nlp.github.io/weblinx/

WebLINX is a benchmark for building web navigation agents with conversational capabilities

agent agents computer-vision llm multimodal navigation nlp web

Last synced: 03 Aug 2024

https://github.com/pooya-mohammadi/deep_utils

An open-source toolkit which is full of handy functions, including the most used models and utilities for deep-learning practitioners!

augmentation coco computer-vision cutmix deep-learning face-detection face-recognition machine-learning modelcheckpoint nlp object-detection python pytorch senet tensorflow utils vggface2 yolov5

Last synced: 02 Aug 2024

https://github.com/SunLemuria/OpenGPTAndBeyond

Open efforts to implement ChatGPT-like models and beyond.

alpaca chatbot chatglm chatgpt large-language-models llm nlp openai opensource

Last synced: 01 Aug 2024

https://github.com/SergeyShk/ruTS

Библиотека для извлечения статистик из текстов на русском языке.

computational-linguistics natural-language-processing nlp russian-specific text-analytics

Last synced: 07 Aug 2024

https://github.com/awslabs/speech-representations

Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)

deep-learning nlp speech-recognition

Last synced: 30 Jul 2024

https://github.com/deepset-ai/haystack-demos

Fully working applications that demonstrate how to use Haystack to implement common NLP use cases

nlp python question-answering semantic-search

Last synced: 01 Aug 2024

https://github.com/kudoai/bravegpt

🦁 Brave Search add-on that brings the magic of ChatGPT to search results (powered by GPT-4!)

ai artificial-intelligence brave brave-search chat chatbot chatgpt chatgpt3 gpt gpt-3 gpt-4 greasemonkey javascript machine-learning nlp openai search userscripts web websearch

Last synced: 31 Jul 2024

https://github.com/rerender2021/echo

A simple asr translator powered by avernakis react.

asr ave avernakis nlp offline translation

Last synced: 01 Aug 2024

https://github.com/KudoAI/bravegpt

🦁 Brave Search add-on that brings the magic of ChatGPT to search results (powered by GPT-4!)

ai artificial-intelligence brave brave-search chat chatbot chatgpt chatgpt3 gpt gpt-3 gpt-4 greasemonkey javascript machine-learning nlp openai search userscripts web websearch

Last synced: 01 Aug 2024

https://github.com/bnosac/ruimtehol

R package to Embed All the Things! using StarSpace

classification embeddings natural-language-processing nlp r similarity starspace text-mining

Last synced: 31 Jul 2024

https://github.com/leehanchung/lora-instruct

Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA

agi falcon gpt llama llm lora mpt nlp redpajama

Last synced: 09 Aug 2024

https://github.com/ben-aaron188/rgpt3

Making requests from R to the GPT models

chatgpt gpt3 llm nlp openai r

Last synced: 02 Aug 2024

https://chats-lab.github.io/KokoMind/

KokoMind: Can LLMs Understand Social Interactions?

chatgpt deep-learning gpt-4 language-model neural-network nlp

Last synced: 03 Aug 2024

https://github.com/lonePatient/BERT-chinese-text-classification-pytorch

This repo contains a PyTorch implementation of a pretrained BERT model for text classification.

bert chinese chinese-text-classification nlp pytorch text-classification

Last synced: 01 Aug 2024

https://github.com/harunzafer/nuve

Natural Language Processing Library for Turkish in C#

ngram-extraction nlp nuve turkish

Last synced: 02 Aug 2024

https://github.com/JDongian/python-jamo

Hangul syllable decomposition and synthesis using jamo.

hangul korean nlp python

Last synced: 03 Aug 2024

https://github.com/MoritzLaurer/GPT-google-sheets

Code and documentation for running generative LLMs like ChatGPT or GPT4 in google sheets without any coding knowledge. Transform unstructured text to structured data.

chatgpt gpt3 gpt4 nlp nlp-machine-learning

Last synced: 03 Aug 2024

https://github.com/adhikary97/Sharetape-Open-Source

Script that takes any long form video or podcast and outputs clips for social media

instagram-reels nlp podcast tiktok video-clipper video-clips youtube

Last synced: 04 Aug 2024

https://github.com/jmisilo/clip-gpt-captioning

CLIPxGPT Captioner is Image Captioning Model based on OpenAI's CLIP and GPT-2.

computer-vision cv deep-learning image-caption image-caption-generator image-captioning machine-learning nlp python pytorch

Last synced: 01 Aug 2024

https://github.com/ropensci-archive/monkeylearn

:no_entry: ARCHIVED :no_entry: Accesses the Monkeylearn API for Text Classifiers and Extractors

classifier extractor monkeylearn nlp nlp-machine-learning peer-reviewed r r-package rstats

Last synced: 30 Jul 2024

https://github.com/IlyaGusev/tgcontest

Telegram Data Clustering contest solution by Mindful Squirrel

classification clustering cpp data-science document-similarity fasttext machine-learning nlp

Last synced: 01 Aug 2024

https://github.com/deeplearningturkiye/kelime_kok_ayirici

Derin Öğrenme Tabanlı - seq2seq - Türkçe için kelime kökü bulma web uygulaması - Turkish Stemmer (tr_stemmer)

flask keras nlp python stemmer

Last synced: 02 Aug 2024

https://github.com/hpcaitech/CachedEmbedding

A memory efficient DLRM training solution using ColossalAI

colossal-ai deep-learning dlrm embeddings nlp pytorch recommandation-system

Last synced: 01 Aug 2024