Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Natural language processing

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

https://github.com/revdotcom/words2num

Convert words to numbers

inverse-text-normalization nlp

Last synced: 11 Nov 2024

https://github.com/gmontamat/poor-mans-transformers

Implement Transformers (and Deep Learning) from scratch in NumPy

deep-learning from-scratch machine-learning ml-framework neural-network nlp transformers

Last synced: 30 Oct 2024

https://github.com/percevalw/metanno

Annotator building tool for Jupyter

annotator customizable jupyter modular nlp

Last synced: 08 Nov 2024

https://github.com/ahammadmejbah/ahammadmejbah

Data Science || Machine Learning || Deep Learning || Computer Vision || NLP Enthusiast Talks about #datascience, #deeplearning, #dataanalytics, #machinelearning, and #machinelearningalgorithms

artificial-intelligence computer-vision data-science deep-learning machine-learning nlp python

Last synced: 11 Nov 2024

https://github.com/lkstrp/newspaper-scraper

The all-in-one Python package for seamless newspaper article indexing, scraping, and processing – supports public and premium content!

news newspaper nlp parser scraper

Last synced: 07 Nov 2024

https://github.com/wassname/phoneme2grapheme

Teaching machines to spell with deep learning (acc=>80%) e.g. a model hears "pɹˈaʊd˺ɚ" and writes "prowder" (but it should be "prouder")

cmudict deep-learning deeplearning machine-learning nlp pronunciation spelling

Last synced: 15 Oct 2024

https://github.com/thunlp/babelnet-sememe-prediction

Code and data of the AAAI-20 paper "Towards Building a Multilingual Sememe Knowledge Base: Predicting Sememes for BabelNet Synsets"

babelnet nlp semantics sememe

Last synced: 10 Nov 2024

https://github.com/liyucheng09/llm-compressive

Longitudinal Evaluation of LLMs via Data Compression

benchmark evaluation llm llms nlp

Last synced: 30 Oct 2024

https://github.com/bloomberg/mixce-acl2023

Implementation of MixCE method described in ACL 2023 paper by Zhang et al.

language-model machine-learning nlp python pytorch transformer

Last synced: 09 Nov 2024

https://github.com/dluman/rusty

Rust bindings for the spaCy library.

nlp rust

Last synced: 16 Nov 2024

https://github.com/tlkh/t2t-tuner

Convenient Text-to-Text Training for Transformers

gpt huggingface language-model nlp pytorch t5 transformers

Last synced: 07 Nov 2024

https://github.com/vishnunkumar/doc_transformers

Document processing using transformers

ai ml nlp ocr

Last synced: 16 Nov 2024

https://github.com/contextlab/abstract2paper

Auto-generate an entire paper from a prompt or abstract using NLP

auto-text gpt-neo nlp notebook-jupyter text-generation

Last synced: 06 Nov 2024

https://github.com/alexcg1/easy_text_generator

Generate text from machine-learning models right in your browser

machine-learning nlp python streamlit

Last synced: 27 Oct 2024

https://github.com/proycon/deepfrog

An NLP-suite powered by deep learning

deep-learning deep-neural-networks dutch folia frog nlp transformers

Last synced: 08 Nov 2024

https://github.com/bramvanroy/astred

An easy-to-use library to linguistically compare one sentence and its words to another, in the same language or a different one. For instance useful for comparing a translation with the original text, to find differences and similarities between two different translations, or to see how a machine translation differs from a reference translation.

alignment linguistics nlp parallel-corpus parsing spacy stanza translation

Last synced: 14 Oct 2024

https://github.com/hpprc/defsent

DefSent: Sentence Embeddings using Definition Sentences

bert natural-language-processing nlp transformers

Last synced: 27 Oct 2024

https://github.com/azu/nlp-pattern-match

Natural Language pattern matching library for JavaScript.

english japanese javascript morphological-analysis nlcst nlp pos

Last synced: 01 Nov 2024

https://github.com/artitw/bert_qa

Accelerating the development of question-answering systems based on BERT and TF 2.0

artificial-intelligence bert machine-learning natural-language-processing natural-language-understanding nlp

Last synced: 28 Oct 2024

https://github.com/wetneb/pynif

A small Python library for NLP Interchange Format (NIF) for NER(D) systems

entity-linking gerbil named-entity-recognition nif nlp python

Last synced: 28 Oct 2024

https://github.com/cmccomb/rust-stop-words

Common stop words in a variety of languages

languages natural-language-procressing nlp nltk rust-crate stopwords

Last synced: 12 Oct 2024

https://github.com/tencent-ailab/season

[EMNLP 2022] Salience Allocation as Guidance for Abstractive Summarization

nlp summarization summarization-model

Last synced: 18 Nov 2024

https://github.com/arbox/wlapi

Ruby based API for the project Wortschatz Leipzig.

computational-linguistics natural-language-processing nlp ruby rubynlp

Last synced: 15 Nov 2024

https://github.com/kklemon/flashperceiver

Fast and memory efficient PyTorch implementation of the Perceiver with FlashAttention.

attention-mechanism deep-learning flash-attention nlp perceiver transformer

Last synced: 19 Nov 2024

https://github.com/fursovia/geometric_embedding

"Zero-Training Sentence Embedding via Orthogonal Basis" paper implementation

embeddings nlp

Last synced: 17 Nov 2024

https://github.com/gdamdam/sumo

Tool to extracts the text from a web article urls and get frequency words, entities recognition, automatic summary and more

automatic-summarization content-extraction entity-recognition nlp nltk semantic-analysis sentence-extraction

Last synced: 14 Nov 2024

https://github.com/greenelab/preprint-similarity-search

A web app that uses machine learning to recommend the most suitable journals based on the text content of your preprint

journals nlp nlp-machine-learning web-app

Last synced: 13 Nov 2024

https://github.com/fedenunez/tulp

Tulp is a command-line tool that can help you create and process piped content using the power of ChatGPT directly from the terminal.

chatgpt chatgpt-api console llm nlp shell unix-shell

Last synced: 13 Nov 2024

https://github.com/bububa/jiagu

Jiagu深度学习自然语言处理工具 知识图谱关系抽取 中文分词 词性标注 命名实体识别 情感分析 新词发现 关键词 文本摘要 文本聚类

chinese-nlp chinese-word-segmentation classification clustering cws ner nlp pos segmentation

Last synced: 08 Nov 2024

https://github.com/vaibhavs10/ml-with-text

[Tutorial] Demystifying Natural Language Processing with Python

machine-learning natural-language-processing nlp python text-classification

Last synced: 25 Oct 2024

https://github.com/varunon9/sentence-type-classifier

Classify English sentences into assertive, negative, interrogative, imperative and exclamatory based on grammar.

english-grammar nlp nlp-machine-learning sentence-classification

Last synced: 27 Oct 2024

https://github.com/centrefordigitalhumanities/tscan

T-scan: an analysis tool for dutch texts to assess the complexity of the text, based on original work by Rogier Kraf

dutch-language feature-extraction nlp text-difficulty

Last synced: 06 Nov 2024

https://github.com/google-research/pangea

Panoramic Graph Environment Annotation toolkit, for collecting audio and text annotations in panoramic graph environments such as Matterport3D and StreetLearn.

annotation-tool computer-vision crowdsourcing nlp

Last synced: 10 Nov 2024

https://github.com/yashdew/assessor

An open-source Resume Analyzer and Ranking tool for recruiters and candidates.

flask hacktoberfest hacktoberfest2021 nextjs nlp python spacy

Last synced: 27 Oct 2024

https://github.com/michellebonat/fed_funds_ml

Use machine learning (NLP) to demonstrate whether Federal Funds rate changes can be accurately predicted using just the FOMC - the US Federal Reserve Bank - meetings minutes.

ai federal-reserve-bank finance financial-services machine-learning nlp python3

Last synced: 08 Nov 2024

https://github.com/proycon/foliapy

An extensive Python library for dealing with FoLiA (Format for Linguistic Annotation) documents, a rich XML-based format for linguistic annotation finding application in Natural Language Processing (NLP). This library was formerly part of PyNLPl.

clariah clarin computational-linguistics folia nlp pynlpl xml

Last synced: 31 Oct 2024

https://github.com/mindspore-courses/deepnlp-models-mindspore

About MindSpore implementations of various Deep NLP models in cs-224n(Stanford Univ)

deep-learning mindspore nlp tutorial

Last synced: 09 Nov 2024

https://github.com/anthonysigogne/keyword-mining

API - extract a list of keywords from a text.

docker keyword keyword-extraction nlp python-2 seo

Last synced: 12 Oct 2024

https://github.com/neokd/datastorehouse

DataStoreHouse is an open-source project that aims to create a collaborative platform for gathering and sharing a wide variety of datasets. It provides a centralised repository where individuals and organisations can contribute, discover, and collaborate on diverse datasets for various domains.

api csv datasets good-first-issue hacktoberfest hacktoberfest2023 json machinelearning nextjs13 nlp open-source opensource opensource-projects python reactjs

Last synced: 27 Oct 2024

https://github.com/bminixhofer/gerpt2

German small and large versions of GPT2.

common-crawl german gpt2 language-model machine-learning nlp

Last synced: 28 Oct 2024

https://github.com/mmxgn/sprl-spacy

Implementation of Spatial Role Labeling using the Spacy NLP framework.

nlp problog spacy spatial-role-labeling sprl

Last synced: 10 Oct 2024

https://github.com/ianramzy/article-summary-deep-learning

📖 Using deep learning and scraping to analyze/summarize articles! Just drop in any URL!

fact-extractor flask named-entity-recognition nlp summarization web-scraping

Last synced: 19 Nov 2024

https://github.com/mfarragher/obsidian-nlp-analytics

Proofs of concept for workflows that augment Obsidian.md knowledge management via NLP analytics & modelling

knowledge-management nlp nlp-machine-learning obsidian-md python

Last synced: 23 Oct 2024

https://github.com/systats/textlearnR

A simple collection of well working NLP models (Keras, H2O, StarSpace) tuned and benchmarked on a variety of datasets.

classification hyperparameter-optimization keras nlp r text-mining

Last synced: 05 Aug 2024

https://github.com/MilaNLProc/bertlang

A web interface to understand language-specific BERT-models

artificial-intelligence bert-model machine-learning nlp nlp-machine-learning

Last synced: 28 Aug 2024

https://github.com/liamdugan/summary-qg

Code for the ACL 2022 Paper "A Feasibility Study of Answer-Agnostic Question Generation for Education"

natural-language-processing nlp question-answer-generation question-answering question-generation

Last synced: 27 Oct 2024

https://github.com/sno2/bertml

Use common pre-trained ML models in Deno!

bert deno machine-learning nlp rust

Last synced: 17 Aug 2024

https://github.com/neurotech-hq/pysimilar

A python library for computing the similarity between two string(text) based on cosine similarity

cosine-similarity natural-language natural-language-processing natural-language-understanding nlp python-tanzania tanzania

Last synced: 08 Nov 2024

https://github.com/bryanlimy/dnn-dependency-parser

TensorFlow implementation of A Fast and Accurate Dependency Parser using Neural Networks

dependency-parser mlp neural-network nlp tensorflow

Last synced: 23 Oct 2024

https://github.com/igorbenav/clientai

A unified client for seamless interaction with multiple AI providers.

ai api api-rest artificial-intelligence language-model llm nlp ollama ollama-client openai openai-api python replicate replicate-api

Last synced: 28 Oct 2024

https://github.com/spacyturk/spacyturk

spaCyTurk - trained models & pipelines for Turkish

floret nlp nlp-library spacy turkish-nlp

Last synced: 12 Oct 2024

https://github.com/yutkin/news-aggregator

Classification and aggregation of russian news articles. University coursework.

classification coursework machine-learning news news-aggregator nlp university

Last synced: 05 Nov 2024

https://github.com/nlpie/biomedicus

BioMedICUS: A biomedical and clinical NLP engine.

biomedical-informatics health-informatics natural-language-processing nlp text-analysis

Last synced: 17 Nov 2024

https://github.com/koichiyasuoka/supar-unidic

Tokenizer POS-tagger Lemmatizer and Dependency-parser for modern and contemporary Japanese with BERT models

dependency-parser japanese-language nlp

Last synced: 16 Nov 2024

https://github.com/spidy20/flask_nlp_chatbot

This is simple chatbot using NLP which is implemented on Flask WebApp.

chatbot chatbot-framework chatterbot flask flask-api flask-application nlp nlp-chatbot nlp-machine-learning nltk

Last synced: 15 Nov 2024

https://github.com/damo-nlp-sg/bgca

Code and Data for "Bidirectional Generative Framework for Cross-domain Aspect-based Sentiment Analysis" (ACL 2023)

aspect-based-sentiment-analysis natural-language-processing nlp

Last synced: 13 Nov 2024

https://github.com/jakartaresearch/maleo

Wrapper library for text cleansing, preprocessing in NLP

indonesian-language machine-learning nlp nlp-library

Last synced: 15 Nov 2024

https://github.com/undertheseanlp/sentiment

Vietnamese Sentiment Analysis

nlp sentiment-analysis vietnamese vietnamese-nlp

Last synced: 11 Nov 2024

https://github.com/daoyuanli2816/transformer-tutorial-cn

一个transformer模型的简单的中文教程

chinese-simplified huggingface nlp transformer tutorial-code

Last synced: 08 Nov 2024

https://github.com/fahdseddik/deeplearning.ai-natural-language-processing-specialization

This is all my notebooks, lab solutions, and assignments for the DeepLearning.AI Natural Language Processing Specialization on Coursera.

attention-model coursera coursera-specialization deeplearning-ai natural-language-processing nlp probabilistic-models sequence-models vector-space-models

Last synced: 07 Nov 2024

https://github.com/amazon-science/pizza-semantic-parsing-dataset

The PIZZA dataset continues the exploration of task-oriented parsing by introducing a new dataset for parsing pizza and drink orders, whose semantics cannot be captured by flat slots and intents.

dataset natural-language-processing nlp semantic-parsing

Last synced: 12 Nov 2024

https://github.com/tomhosking/hercules

Hercules: Attributable and Scalable Opinion Summarization (ACL 2023)

nlp opinion-summarization summarization vq-vae

Last synced: 27 Oct 2024

https://github.com/caiyinqiong/study_notes

This is my study notes for my PhD in AI, NLP, IR, and more.

information-retrieval mechine-learing nlp notes paper-list

Last synced: 10 Nov 2024

https://github.com/kavgan/clinical-concepts

Discovering Related Clinical Concepts using Large Amounts of Clinical Notes. An unsupervised graphical approach to mine related concepts by leveraging the volume within large amounts of clinical notes.

clinical-concepts clinical-nlp clinical-notes concept-graph graph-nlp nlp paper terminologies

Last synced: 30 Oct 2024

https://github.com/mekhyw/cookiebot-telegram-group-bot

Conversational AI group bot for Telegram. It can also schedule posts, combat raiders/spammers, generate memes, scrape images, provide drawing ideas, call all members and more!

google-cloud-platform google-cloud-pubsub llm nlp opencv python telegram

Last synced: 12 Nov 2024

https://github.com/saransh-cpp/ocred

Clever, simple, and intuitive wrapper functionalities for OCRing specific textual materials.

hacktoberfest image-processing nlp nltk-python ocr python tesseract-ocr

Last synced: 08 Nov 2024

https://github.com/megagonlabs/ebe-dataset

Evidence-based Explanation Dataset (AACL-IJCNLP 2020)

dataset japanese-language nlp text-classification text-generation

Last synced: 10 Nov 2024

https://github.com/arne-cl/brat-embedded-visualization-examples

minimal examples of brat annotation visualizations

annotation brat javascript nlp visualization

Last synced: 10 Nov 2024

https://github.com/liaad/tieval

An Evaluation Framework for Temporal Information Extraction Systems

evaluation-framework information-extraction nlp temporal-relations

Last synced: 10 Nov 2024

https://github.com/go-air/dupi

A tool to find all duplicates in large sets of text documents.

analysis analytics golang index nlp search

Last synced: 08 Nov 2024

https://github.com/yuanxiaosc/Deep_dynamic_contextualized_word_representation

TensorFlow code and pre-trained models for A Dynamic Word Representation Model Based on Deep Context. It combines the idea of BERT model and ELMo's deep context word representation.

bert elmo nlp transformer

Last synced: 02 Nov 2024

https://github.com/jovotech/snips-nlu-server

An open source natural language understanding (NLU) server

nlp nlu nlu-engine snips-nlu

Last synced: 07 Nov 2024

https://github.com/chrislemke/deep-martin

Text simplification for a better world: Deep-Martin Transformer 🤗

deep-learning huggingface nlp python pytorch text-simplification transformers

Last synced: 16 Nov 2024

https://github.com/jfilter/hyperhyper

🧮 Python package to construct word embeddings for small data using PMI and SVD

embeddings nlp pmi pmi-svd ppmi python python-package word-analogy word-embeddings word-similarity

Last synced: 11 Nov 2024

https://github.com/naturale0/nlp-do-it-yourself

Implement well-known NLP models from scratch with high-level APIs.

machine-learning natural-language-processing nlp pytorch-examples tensorflow-examples

Last synced: 09 Oct 2024