Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Natural language processing

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

https://github.com/shrebox/Personified-Chatbot

A personified chatbot responding to a query based on the answering pattern of Dr. APJ Abdul Kalam using Information Retrieval, Natural Language Processing, and Deep Learning techniques.

apj-abdul-kalam chatbot deep-learning information-retrieval lstm natural-language-processing nlp ranking-algorithm seq2seq-chatbot seq2seq-model summarization word2vec

Last synced: 11 Nov 2024

https://github.com/kampersanda/tongrams-rs

Rust library providing fast language model queries in compressed space

compression elias-fano language-model ngrams nlp trie

Last synced: 11 Nov 2024

https://github.com/liebeck/spacy-iwnlp

German lemmatization with IWNLP as extension for spaCy

nlp spacy spacy-extension spacy-pipeline

Last synced: 14 Oct 2024

https://github.com/quickgrid/AI-Resources

Research Paper Summaries, Setup & Performance Notes, Resource Links on AI, Deep Learning, NLP, Computer Vision for my learning.

ai ai-notes ai-research blender computer-vision deep-learning nlp paper-summaries papers research-paper research-paper-summaries

Last synced: 02 Nov 2024

https://github.com/korpling/pepper

A highly extensible plattform for conversion and manipulation of linguistic data between an unbound set of formats. Pepper can be used stand-alone as a command line interface, or be integrated as an API into other software products.

annotations converter format java linguistic-formats linguistics nlp pepper

Last synced: 15 Nov 2024

https://github.com/botpress/nlu

This repo contains every ML/NLU related code written by Botpress in the NodeJS environment. This includes the Botpress Standalone NLU Server.

machine nlp nlu nodejs

Last synced: 05 Nov 2024

https://github.com/quickgrid/ai-resources

Research Paper Summaries, Setup & Performance Notes, Resource Links on AI, Deep Learning, NLP, Computer Vision for my learning.

ai ai-notes ai-research blender computer-vision deep-learning nlp paper-summaries papers research-paper research-paper-summaries

Last synced: 07 Aug 2024

https://github.com/mgechev/gently-js

Module which returns the offensive words in a string. A soft reminder to be nicer to each other ❤️.

code-of-conduct nlp wordnet

Last synced: 22 Oct 2024

https://github.com/davidsvy/Neural-Scam-Artist

Web Scraping, Document Deduplication & GPT-2 Fine-tuning with a newly created scam dataset.

dataset deduplication fine-tuning fraud gpt2 huggingface lsh minhash nlp pytorch readability scam transformer web-scraping

Last synced: 05 Aug 2024

https://github.com/i008/nyyelp

predicting yelp review rating using recurrent neural networks

deep-learning nlp python recurrent-neural-networks yelp-dataset

Last synced: 13 Nov 2024

https://github.com/code-kern-ai/sequence-learn

With sequence-learn, you can build models for named entity recognition as quickly as if you were building a sklearn classifier.

machine-learning named-entity-recognition natural-language-processing ner nlp python

Last synced: 10 Nov 2024

https://github.com/hscspring/ALL4AI

AI Related Tools/Projects

ai jupyter linux machine-learning nlp python ssh toolbox

Last synced: 07 Nov 2024

https://github.com/4ai/agn

Official Code for Merging Statistical Feature via Adaptive Gate for Improved Text Classification (AAAI2021)

bert deep-learning nlp text-classification

Last synced: 10 Nov 2024

https://github.com/hscspring/all4ai

AI Related Tools/Projects

ai jupyter linux machine-learning nlp python ssh toolbox

Last synced: 28 Oct 2024

https://github.com/dbklim/russian_subtitles_dataset

Preprocessing of the dataset of 347 subtitles for the TV series (thanks to Taiga Corpus) to build a word2vec model, JamSpell model, neural network training, chat bot training or in any other NLP task.

bot cnn corpus dataset lstm machine-learning ml natural-language-processing nlp nlu rnn russian subtitles text text-analysis text-processing word2vec

Last synced: 11 Nov 2024

https://github.com/generall/oneshotnlp

PyTorch text matching models implementation for One-Shot Named Entity Linking

neural-network nlp

Last synced: 14 Oct 2024

https://github.com/amsqr/NaiveSumm

NaiveSumm is a naive summarization approach based on Luhn1958 work "The Automatic Creation of Literature Abstracts" It uses the frequencies of words in the document in order to calculate and extract the sentences that include the most frequent words.

natural-language-processing nlp python summarization

Last synced: 31 Oct 2024

https://github.com/kaustubhhiware/c0derunr

An attempt at a cleaner UI for online IDE's: http://c0derunr.herokuapp.com

django ide nlp python webapp

Last synced: 22 Oct 2024

https://github.com/jawahar273/practNLPTools-lite

Practical Natural Language Processing Tools for Humans is build on the top of Senna Natural Language Processing (NLP) predictions: part-of-speech (POS) tags, chunking (CHK), name entity recognition (NER), semantic role labeling (SRL) and syntactic parsing (PSG) with skip-gram all in Python and still more features will be added. The website give is for downlarding Senna tool

nlp practnlptools3 senna senna-nlp

Last synced: 07 Aug 2024

https://github.com/hpprc/bert-classification-tutorial-2024

【2024年版】BERTによるテキスト分類

bert huggingface nlp

Last synced: 27 Oct 2024

https://github.com/AmrHendy/programming-language-translator

An easy way to use the released TransCoder by Facebook AI Research to convert code from one programming language to another using unsupervised neural machine translation (NMT) systems that use deep-learning to translate text from one natural language to another and is trained only on monolingual source data.

machine-translation nlp programming-language transcoder transformer unsupervised-deep-learning unsupervised-translation

Last synced: 06 Aug 2024

https://github.com/proycon/gecco

Generic Environment for Context-Aware Correction of Orthography

nlp python spelling-correction

Last synced: 08 Nov 2024

https://github.com/senthilchandrasegaran/textplorer

Visual analytics application for qualitative text analysis

nlp text-visualization visual-analytics

Last synced: 27 Oct 2024

https://github.com/uetchy/homebrew-nlp

🍺 a Homebrew keg that specialized in Natural Language Processing.

homebrew natural-language-processing nlp

Last synced: 18 Oct 2024

https://github.com/KGCP/MEL-TNNT

Metadata Extractor & Loader (MEL) ■ The NLP-NER Toolkit (TNNT)

metadata-extraction named-entity-recognition natural-language-processing nlp nlp-ner pipeline

Last synced: 03 Aug 2024

https://github.com/pfnet-research/vat_nmt

Implementation of "Effective Adversarial Regularization for Neural Machine Translation", ACL 2019

acl2019 adversarial neural-machine-translation nlp nmt vat

Last synced: 15 Nov 2024

https://github.com/adriangonz/statistical-nlp-17

Repository for group 17 on the Statistical Natural Language Processing module at UCL

matching-networks nlp pytorch

Last synced: 22 Oct 2024

https://github.com/KGCP/MEL-TNNT/

Metadata Extractor & Loader (MEL) ■ The NLP-NER Toolkit (TNNT)

metadata-extraction named-entity-recognition natural-language-processing nlp nlp-ner pipeline

Last synced: 14 Aug 2024

https://github.com/stanford-oval/ovalchat

OVALChat is a customizable Web app aimed at conducting user studies with chatbots

chatbots crowdsourcing nextjs nlp react tailwindcss

Last synced: 06 Nov 2024

https://github.com/adamspannbauer/lexrankr

Extractive Text Summariztion with lexRankr (an R package implementing the LexRank algorithm)

lexrank lexrank-algorithm nlp r r-package rstat

Last synced: 27 Oct 2024

https://github.com/kalebu/desktop-chatbot-app

A python knowledge-based chatbot application built with Tkinter

chatbot chatbot-application data-science nlp nlp-projects python-tanzania python3 tanzania

Last synced: 09 Nov 2024

https://github.com/TianyuZhuuu/CHIP2018

CHIP2018问句匹配大赛 Rank6解决方案

nlp pytorch sentence-similarity

Last synced: 06 Nov 2024

https://github.com/code-kern-ai/embedders

With embedders, you can easily convert your texts into sentence- or token-level embeddings within a few lines of code. Use cases for this include similarity search between texts, information extraction such as named entity recognition, or basic text classification.

classification machine-learning named-entity-recognition natural-language-processing ner nlp python representation-learning similarity-search

Last synced: 10 Nov 2024

https://github.com/appcoda/naturallanguageprocessing

A Quick Demo for NLP in Swift 4

demo nlp playgrounds swift swift4

Last synced: 15 Nov 2024

https://github.com/pyurbans/urbans

A tool for translating text from source grammar to target grammar (context-free) with corresponding dictionary.

artificial-intelligence data-science machine-translation nlp python

Last synced: 10 Nov 2024

https://github.com/tomaarsen/ttstextnormalization

Convert English text from written expressions into spoken forms

competition nlp normalization spoken-forms text-normalization tts

Last synced: 08 Nov 2024

https://github.com/chinnichaitanya/spellwise

🚀 Extremely fast fuzzy matcher & spelling checker in Python!

caverphone editex levenshtein natural-language-processing nlp spellcheck spelling-correction trie typox

Last synced: 27 Oct 2024

https://github.com/breezedeus/loveshare

breezedeus的各种分享

cnocr cnstd cv deep-learning llm nlp ocr pix2text

Last synced: 15 Nov 2024

https://github.com/deepset-ai/haystack-search-pipeline-streamlit

🚀 Template Haystack Search Application with Streamlit

haystack nlp python streamlit

Last synced: 06 Nov 2024

https://github.com/doccano/spacy-partial-tagger

A simple library for training named entity recognition model from partially annotated data

named-entity-recognition natural-language-processing nlp spacy weak weak-supervision weakly-supervised-learning

Last synced: 10 Oct 2024

https://github.com/yuvalpinter/nytwit

New York Times Word Innovation Types dataset

computational-linguistics corpus dataset news nlp

Last synced: 27 Oct 2024

https://github.com/twardoch/split-markdown4gpt

A Python tool for splitting large Markdown files into smaller sections based on a specified token limit. This is particularly useful for processing large Markdown files with GPT models, as it allows the models to handle the data in manageable chunks.

data-preprocessing gpt gpt-3 gpt-35-turbo gpt-35-turbo-16k gpt-4 markdown markdown-processing mistletoe natural-language-processing nlp openai openai-gpt python split-text summarization text-analysis text-processing text-summarization text-tokenization

Last synced: 27 Oct 2024

https://github.com/li-plus/rouge-metric

A Python wrapper of the official ROUGE-1.5.5.pl script and a re-implementation of full ROUGE metrics.

machine-learning nlp pypi python rouge rouge-metric summarization

Last synced: 06 Nov 2024

https://github.com/banyh/PyStanfordNLP

A Python Wrapper of Stanford Chinese Segmenter

nlp postagging python-wrapper stanford stanford-chinese-segmenter

Last synced: 14 Nov 2024

https://github.com/explosion/spacy-benchmarks

💫 Runtime performance comparison of spaCy against other NLP libraries

benchmarking benchmarks natural-language-processing nlp spacy

Last synced: 25 Sep 2024

https://github.com/winkjs/wink-porter2-stemmer

Javascript Implementation of Porter Stemmer Algorithm V2 by Dr Martin F Porter

natural-language-processing nlp porter-stemmer-algorithm porter-stemmer-v2 stemmer

Last synced: 09 Nov 2024

https://github.com/dpressel/arcs-py

Arc-Eager and Arc-Hybrid Greedy Dependency Parser with Dynamic Oracle in Python (with no Dependencies!)

nlp nlp-dependency-parsing

Last synced: 28 Oct 2024

https://github.com/lkstrp/newspaper-scraper

The all-in-one Python package for seamless newspaper article indexing, scraping, and processing – supports public and premium content!

news newspaper nlp parser scraper

Last synced: 07 Nov 2024

https://github.com/revdotcom/words2num

Convert words to numbers

inverse-text-normalization nlp

Last synced: 11 Nov 2024

https://github.com/yhy1117/x-mixup

Implementation of ICLR 2022 paper "Enhancing Cross-lingual Transfer by Manifold Mixup".

cross-lingual-transfer manifold-mixup nlp

Last synced: 28 Aug 2024

https://github.com/fer-aguirre/pmdm

Political Misogynistic Discourse Monitor team from the 2021 JournalismAI Collab Challenges

nlp social-network-analysis text-classification

Last synced: 05 Nov 2024

https://github.com/rileynwong/rpi-poetry-generator

Poetry theremin: use Raspberry Pi with hardware sensors to generate poetry using NLP techniques, based on physical light and distance conditions

distance-sensor generative-art generative-poetry generative-text hardware interactive interactive-art interactive-text-generation light-sensor natural-language-processing nlp nltk poetry python raspberry-pi rpi sensors sentiment-analysis

Last synced: 14 Oct 2024

https://github.com/percevalw/metanno

Annotator building tool for Jupyter

annotator customizable jupyter modular nlp

Last synced: 08 Nov 2024

https://github.com/ahammadmejbah/ahammadmejbah

Data Science || Machine Learning || Deep Learning || Computer Vision || NLP Enthusiast Talks about #datascience, #deeplearning, #dataanalytics, #machinelearning, and #machinelearningalgorithms

artificial-intelligence computer-vision data-science deep-learning machine-learning nlp python

Last synced: 11 Nov 2024

https://github.com/richardlitt/thesis

My thesis on "Open Source Code and Low Resource Languages" for an MSc in Language Science and Technology at Saarland University

dissertation endangered-languages low-resource-languages lrl nlp nlproc saarland saarland-university thesis

Last synced: 21 Oct 2024

https://github.com/joshdevins/demo-es-lang-ident

Demo: Elasticsearch Language Identification

demo elasticsearch language-identification nlp search

Last synced: 23 Oct 2024

https://github.com/gmontamat/poor-mans-transformers

Implement Transformers (and Deep Learning) from scratch in NumPy

deep-learning from-scratch machine-learning ml-framework neural-network nlp transformers

Last synced: 30 Oct 2024

https://github.com/contextlab/abstract2paper

Auto-generate an entire paper from a prompt or abstract using NLP

auto-text gpt-neo nlp notebook-jupyter text-generation

Last synced: 06 Nov 2024

https://github.com/vishnunkumar/doc_transformers

Document processing using transformers

ai ml nlp ocr

Last synced: 16 Nov 2024

https://github.com/liyucheng09/llm-compressive

Longitudinal Evaluation of LLMs via Data Compression

benchmark evaluation llm llms nlp

Last synced: 30 Oct 2024

https://github.com/artitw/bert_qa

Accelerating the development of question-answering systems based on BERT and TF 2.0

artificial-intelligence bert machine-learning natural-language-processing natural-language-understanding nlp

Last synced: 28 Oct 2024

https://github.com/azu/nlp-pattern-match

Natural Language pattern matching library for JavaScript.

english japanese javascript morphological-analysis nlcst nlp pos

Last synced: 01 Nov 2024

https://github.com/bramvanroy/astred

An easy-to-use library to linguistically compare one sentence and its words to another, in the same language or a different one. For instance useful for comparing a translation with the original text, to find differences and similarities between two different translations, or to see how a machine translation differs from a reference translation.

alignment linguistics nlp parallel-corpus parsing spacy stanza translation

Last synced: 14 Oct 2024

https://github.com/arbox/wlapi

Ruby based API for the project Wortschatz Leipzig.

computational-linguistics natural-language-processing nlp ruby rubynlp

Last synced: 15 Nov 2024

https://github.com/wetneb/pynif

A small Python library for NLP Interchange Format (NIF) for NER(D) systems

entity-linking gerbil named-entity-recognition nif nlp python

Last synced: 28 Oct 2024

https://github.com/proycon/deepfrog

An NLP-suite powered by deep learning

deep-learning deep-neural-networks dutch folia frog nlp transformers

Last synced: 08 Nov 2024

https://github.com/cmccomb/rust-stop-words

Common stop words in a variety of languages

languages natural-language-procressing nlp nltk rust-crate stopwords

Last synced: 12 Oct 2024

https://github.com/alexcg1/easy_text_generator

Generate text from machine-learning models right in your browser

machine-learning nlp python streamlit

Last synced: 27 Oct 2024

https://github.com/hpprc/defsent

DefSent: Sentence Embeddings using Definition Sentences

bert natural-language-processing nlp transformers

Last synced: 27 Oct 2024

https://github.com/wassname/phoneme2grapheme

Teaching machines to spell with deep learning (acc=>80%) e.g. a model hears "pɹˈaʊd˺ɚ" and writes "prowder" (but it should be "prouder")

cmudict deep-learning deeplearning machine-learning nlp pronunciation spelling

Last synced: 15 Oct 2024

https://github.com/fursovia/geometric_embedding

"Zero-Training Sentence Embedding via Orthogonal Basis" paper implementation

embeddings nlp

Last synced: 03 Aug 2024

https://github.com/tlkh/t2t-tuner

Convenient Text-to-Text Training for Transformers

gpt huggingface language-model nlp pytorch t5 transformers

Last synced: 07 Nov 2024

https://github.com/bloomberg/mixce-acl2023

Implementation of MixCE method described in ACL 2023 paper by Zhang et al.

language-model machine-learning nlp python pytorch transformer

Last synced: 09 Nov 2024

https://github.com/thunlp/babelnet-sememe-prediction

Code and data of the AAAI-20 paper "Towards Building a Multilingual Sememe Knowledge Base: Predicting Sememes for BabelNet Synsets"

babelnet nlp semantics sememe

Last synced: 10 Nov 2024

https://github.com/mindspore-courses/deepnlp-models-mindspore

About MindSpore implementations of various Deep NLP models in cs-224n(Stanford Univ)

deep-learning mindspore nlp tutorial

Last synced: 09 Nov 2024

https://github.com/anthonysigogne/keyword-mining

API - extract a list of keywords from a text.

docker keyword keyword-extraction nlp python-2 seo

Last synced: 12 Oct 2024

https://github.com/bububa/jiagu

Jiagu深度学习自然语言处理工具 知识图谱关系抽取 中文分词 词性标注 命名实体识别 情感分析 新词发现 关键词 文本摘要 文本聚类

chinese-nlp chinese-word-segmentation classification clustering cws ner nlp pos segmentation

Last synced: 08 Nov 2024