Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Natural language processing

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

https://github.com/denismosolov/alice-entities-library

Набор именованных сущностей для платформы Яндекс.Диалоги. Используйте при создании навыков Алисы.

alice-sdk alice-skills nlp yandex-dialogs

Last synced: 16 Nov 2024

https://github.com/jfilter/german-lemmatizer

✂️ Python package (using a Docker image under the hood) to lemmatize German texts.

german lemmatization lemmatizer natural-language-processing nlp python

Last synced: 11 Nov 2024

https://github.com/gesiscss/ptm

Introduction to Natural Language Processing with a special emphasis on the analysis of Job Advertisements

binder data-science information-retrieval labour-market nlp r text-mining topic-modeling

Last synced: 09 Nov 2024

https://github.com/cyclecycle/visualise-spacy-tree

Create dependency tree plots from SpaCy Doc objects

nlp python spacy

Last synced: 14 Oct 2024

https://github.com/undertheseanlp/speech_classification

Vietnamese Speech Classification experiments

nlp speech vietnamese vietnamese-nlp

Last synced: 11 Nov 2024

https://github.com/dai-wenxun/pointer-generator-networks

Pytorch implementation of "Get To The Point: Summarization with Pointer-Generator Networks"

nlp pointer-generator pytorch-implementation seq2seq summarization

Last synced: 14 Oct 2024

https://github.com/vochicong/datalab-nlp

NLP extension to Google Cloud Datalab

docker google-cloud-datalab japanese nlp

Last synced: 21 Oct 2024

https://github.com/vblagoje/reno-auto

Automatically generate comprehensive Reno release notes for your PR requests

github-actions nlp release-automation

Last synced: 09 Nov 2024

https://github.com/centre-for-humanities-computing/conspiracies

A python package for discovering and examining conspiracies using NLP.

conspiracies conspiracy knowledge-graph nlp spacy

Last synced: 14 Oct 2024

https://github.com/viveksck/simplicity

Code and Data for Simple Models for Word Formation in English Slang

nlp slang

Last synced: 14 Oct 2024

https://github.com/yunsii/fasttext.wasm.js

Node and Browser env supported WebAssembly version of fastText: Library for efficient text classification and representation learning.

browser browser-extension fasttext language language-detection language-detector language-identification natural-language natural-language-processing nlp node nodejs wasm web-extension webassembly worker

Last synced: 09 Nov 2024

https://github.com/renovamen/image-captioning

PyTorch re-implementation of some papers on image captioning | 图像描述

adaptive-attention attention captions cv image-captioning nlp pytorch show-and-tell show-attend-and-tell

Last synced: 10 Nov 2024

https://github.com/amey-thakur/sentiment-analyzer

A simple Python Program to Analyze Sentiments using TextBlob Python Library.

amey ameythakur analysis natural-language-processing nlp sentiment-analysis textblob textblob-sentiment-analysis

Last synced: 09 Nov 2024

https://github.com/oneapi-src/disease-prediction

AI Starter Kit for the implementation of AI-based NLP Disease Prediction system using Intel® Extension for PyTorch* and Intel® Neural Compressor

deep-learning nlp pytorch

Last synced: 05 Nov 2024

https://github.com/guenthermi/postgres-retrofit

Tools to create database-specific text value embeddings from word embedding datasets

in-database-analytics learning machine machine-learning nlp postgresql word-embeddings word2vec

Last synced: 15 Oct 2024

https://github.com/javierarce/silabea

Node package that split Spanish words into syllables.

language nlp spanish syllable syllable-count syllables

Last synced: 23 Oct 2024

https://github.com/rosette-api-community/rosettepedia

Augment Rosette API entity extraction results with information from Wikipedia.

entities entity-extraction language mediawiki mediawiki-api natural-language-processing nlp python wikidata wikipedia

Last synced: 09 Oct 2024

https://github.com/cyclecycle/role-pattern-nlp

Build and match patterns for semantic role labelling / information extraction with SpaCy

nlp python semantic-role-labeling spacy

Last synced: 12 Oct 2024

https://github.com/rajspeaks/machine-learning-approach-to-bengali-pos-tagging-using-bnlp

Machine Learning approach to Bengali Corpus POS (Parts of Speech) Tagging using BNLP (Bengali Natural Language Processing) Toolkit. This is the Minor Project Presentation at Heritage Institute of Technology under the mentorship of Prof. Sandipan Ganguly.

bengali-natural-language-processing bengali-nlp bnlp crf-model deep-learning machine-learning ml natural-language-generation natural-language-processing natural-language-toolkit natural-language-understanding nlp pos-tagger pos-tagging python3 rajdeep-das rajspeaks

Last synced: 23 Oct 2024

https://github.com/ymcui/mrc-model-analysis

Multilingual Multi-Aspect Explainability Analyses on Machine Reading Comprehension Models (iScience)

bert mrc nlp tensorflow

Last synced: 28 Oct 2024

https://github.com/ekramasif/ai-lab-final-project

Improving News Classification Model Using Support Vector Machine and Naive Bayes

artificial-intelligence classification dataset naive-bayes-classifier news-classification nlp svm-classifier text-classification

Last synced: 11 Oct 2024

https://github.com/epwalsh/allennlp-manager

[WORK IN PROGRESS] Your manager for AllenNLP experiments

allennlp machine-learning nlp

Last synced: 24 Oct 2024

https://github.com/jpoehnelt/eleventy-plugin-related

Plugin for related posts in Eleventy.

eleventy eleventy-plugin natural nlp tf-idf

Last synced: 10 Oct 2024

https://github.com/softmarshmallow/robbin

🔠 an open dictions platform (both students and developers are welcome!)

deno dictionary dictionary-app education flutter gre nestjs nlp prisma python robbin sat students

Last synced: 28 Oct 2024

https://github.com/michaelhly/farglot

A Transformer-based SocialNLP toolkit for Farcaster

farcaster nlp transformers

Last synced: 18 Oct 2024

https://github.com/argilla-io/argilla-server

A Python native FastAPI server for the Argilla backend.

api argilla fastapi llm machine-learning nlp server

Last synced: 12 Nov 2024

https://github.com/salesforce/isea

Official code repository for "iSEA: An Interactive Pipeline of Semantic Error Analysis for NLP Models"

deep-learning hci human-in-the-loop machine-learning nlp visualization

Last synced: 08 Nov 2024

https://github.com/tokenmill/dictionary-annotator

Fast and configurable UIMA dictionary annotator.

annotators csv dictionary dkpro nlp ruta

Last synced: 10 Nov 2024

https://github.com/tokenmill/snowball

Snowball version of the Porter stemmer for the Lithuanian language.

lithuanian-language nlp porter-stemmer snowball stemmer

Last synced: 10 Nov 2024

https://github.com/imwildcat/aitk

Artificial Intelligence Toolkit, a powerful tool that makes your life better.

ai baidu cloud-ai cloud-ml-engine computer-vision google machine-learning machine-translation nlp speech-recognition tencent text-to-speech

Last synced: 18 Oct 2024

https://github.com/zhudotexe/chatgpt-api-demo

Demo of how to use the ChatGPT API to create a chat application right in your terminal.

chatgpt chatgpt-api gpt-3 machine-learning natural-language-processing nlp openai openai-api

Last synced: 27 Oct 2024

https://github.com/youssefsoli/ipfe

InterPlanetary File Explorer [UofTHacks X Protocol Labs - Best Use of Estuary]

cohere estuary filecoin ipfe ipfs nlp

Last synced: 27 Sep 2024

https://github.com/madrugado/gia-corpus

Corpus of exam tests for 9-graders in Russia for NLP/ML purposes

corpus natural-language-processing nlp russian-corpus

Last synced: 09 Nov 2024

https://github.com/praful932/midas

MIDAS@IIITD NLP Task

midas nlp

Last synced: 13 Oct 2024

https://github.com/proycon/nederlab-pipeline

Linguistic enrichment pipeline for historical dutch, as used in the Nederlab project

dutch historical-dutch historical-linguistics natural-language-processing nederlab nextflow nlp workflow

Last synced: 19 Oct 2024

https://github.com/sammous/allennlp_tagger

Academic Paper/Document classifier based on SCOPUS categories

allennlp classification nlp python pytorch scopus

Last synced: 19 Oct 2024

https://github.com/hailiang-wang/hanlp-client

HanNLP Client for Node.js

chatbot nlp

Last synced: 11 Oct 2024

https://github.com/ksdkamesh99/stackquestioner

A Django Based Web App for predicting a given query belongs to which language i.e 'R', 'java', 'javascript', 'PHP', 'python'. The model is trained using LSTM's with a training accuracy of 97% and testing accuracy of 80%. The data that the model is trained with queries collected from a dataset in Kaggle originally extracted from StackOverflow.

django embeddings keras-tensorflow lstm nlp stackoverflow

Last synced: 24 Oct 2024

https://github.com/jeff-vincent/spacy-passive-to-active-voice

A rule-based, English language, passive voice converter. Requires spaCy.

nlp python3 spacy-nlp stanford-nlp

Last synced: 19 Nov 2024

https://github.com/nurseiit/46th-soz

✍️ Abai's 46th word (Qara Søz) by "AI". Character based RNN from Tensorflow Docs.

ai kazakh ml nextjs nlp tensorflow

Last synced: 13 Oct 2024

https://github.com/kardelruveyda/langchain-mystic

langchain-mystic is a LangChain framework based bot that currently specializes in dream interpretation and fortune-telling. While these are its primary features for now, stay tuned to see what other mystical abilities

chatbot chatbots langchain langchain-python mystic nlp

Last synced: 08 Nov 2024

https://github.com/frankier/wikiparse

Scrapes some Finnish word definitions from English Wiktionary.

computational-linguistics dictionary finnish natural-language-processing nlp wikimarkup wikitionary

Last synced: 08 Nov 2024

https://github.com/maxim5/cs224n-2019-winter

All lecture notes, slides and assignments from CS224n: Natural Language Processing with Deep Learning class by Stanford

cs224n deep-learning machine-learning nlp stanford-nlp

Last synced: 05 Nov 2024

https://github.com/yangboz/chatbotsmessager

🤖 iMessage to Emotional Artificial Intelligence Chat Bots

appstore chatbot hash nlp objective-c screenshot

Last synced: 27 Oct 2024

https://github.com/comcast/syntaviz

A visualization interface for analyzing a (very large) corpus of natural-language queries.

clustering nlp visualization voice-recognition

Last synced: 14 Nov 2024

https://github.com/sap-samples/acl2023-micse

Source code for ACL 2023 paper "miCSE: Mutual Information Contrastive Learning for Low-shot Sentence Embeddings".

contrastive embedding few-shot-learning learning low-shot-learning nlp sample self-supervised-learning sentence sssl

Last synced: 15 Nov 2024

https://github.com/stefanieschneider/unstruwwel

Detect and Parse Historic Dates in R

dates nlp parser r

Last synced: 20 Nov 2024

https://github.com/jpruden92/dialogflow-nlp-to-nlpjs

Transform your Dialogflow NLP model to a NLP.js model

chat chatbot dialogflow nlp nlpjs nlu text voice

Last synced: 17 Nov 2024

https://github.com/pharo-ai/Polyglot

A library for Natural Language Processing

natural-language-processing nlp pharo

Last synced: 17 Nov 2024

https://github.com/chandru-21/end-to-end-movie-recommendation-system-with-deployment-using-docker-and-kubernetes

Content Based Recommendation system uses attributes of the content to recommend similar content. It doesn't have a cold-start problem because it works through attributes or tags of the content, such as actors, genres or directors, so that new movies can be recommended right away.

contentbasedfiltering docker endtoend endtoendpipeline kubernetes machine-learning movie-recommendation movierecommendationsystem nlp python recommendation-system recommender-system streamlit streamlit-webapp

Last synced: 15 Nov 2024

https://github.com/jgolafshan/wallstreetsocial

Analyze Reddit posts/comments, uses an NLP model to recognize stock symbols and options positions

data-analysis flexible machine-learning mentioned-stocks named-entity-recognition natural-language-processing nlp reddit social-media-analysis sql wallstreetbets

Last synced: 20 Nov 2024

https://github.com/euler16/charrnn

Character Level Language Modelling using PyTorch

char-rnn deep-learning lstm nlp

Last synced: 17 Nov 2024

https://github.com/geekalexis/two-stage-sum

Two-stage text summarization with BERT and BART

nlp summarization text-summarization

Last synced: 13 Nov 2024

https://github.com/aditeyabaral/nlpc

Natural Language Toolkit built using the C Programming Language

c machine-learning nlp nlp-machine-learning nltk

Last synced: 16 Nov 2024

https://github.com/estamos/word2vec-thesis

🎓 Diploma Thesis | A Word2vec comparative study of CBOW and Skipgram

cbow continuous-bag-of-words gensim gensim-word2vec machine-learning nlp skipgram skipgram-algorithm word-embeddings word2vec

Last synced: 15 Nov 2024

https://github.com/hassanalgoz/text-generation

Generate and predict text, using Recurrent Neural Networks. (Keras+Tensorflow+Gensim)

gensim-word2vec gru keras-tensorflow lstm machine-learning nlp rnn text-processing word2vec

Last synced: 12 Nov 2024

https://github.com/aflah02/cleansetext

This is a simple library to help you clean your textual data

cleaning-data nlp preprocessing pypi text

Last synced: 20 Nov 2024

https://github.com/webpolis/musai

Machine learning-powered music generation. Full-featured tokenizer, customization options, and high-quality output files. Integration with music production tools.

deep-learning generative-art large-language-models llm machine-learning midi music music-generation nlp recurrent-neural-networks rnn text-generation tokenizer vae variational-autoencoder

Last synced: 15 Nov 2024

https://github.com/aflah02/wordnet-parser

A Custom Parser for WordNet

nlp parser wordnet

Last synced: 20 Nov 2024

https://github.com/aflah02/nlp-albumentations-data-augmentation

This repository contains helper functions which can help you generate additional data points depending on your NLP task.

data-science nlp

Last synced: 20 Nov 2024

https://github.com/ajdavidl/nlp-packages

List of packages developed with focus on natural language processing.

natural-language-processing nlp

Last synced: 20 Nov 2024

https://github.com/nateraw/pytorch-lightning-azureml

Narrow the gap between research and production 😎

azure azureml nlp pytorch-lightning transformers

Last synced: 17 Nov 2024

https://github.com/eliask93/transformer-models-for-domain-specific-machine-translation

Example application for the task of fine-tuning pretrained machine translation models on highly domain-specific, self-extracted translated sentences

bitext-mining machine-translation marian-nmt nllb nllb200 nlp sentence-extraction sentence-transformers t5

Last synced: 13 Nov 2024

https://github.com/imsanjoykb/automated-spam-mail-detection-and-flask-deployment

This is an simple NLP project in which the model is able to predict the incoming mail whether it is spam or not spam(ham). As we seen in gmail automatically the mail is classified and stored in spam or inbox so this project is prototype.

flask machine-learning naive-bayes-classifier nlp python scikit-learn

Last synced: 17 Nov 2024

https://github.com/benja1972/topicphrase

Simple project for extraction of key-phrases from single document based on Sentence Trasfomers

bert-embeddings clusters embeddings key-phrase-extraction nlp noun-phrases-candidates sentence-transformers topics

Last synced: 18 Nov 2024

https://github.com/nlpie/mtap

MTAP: A framework for distributed text analysis using gRPC and microservices-based architecture.

framework grpc java microservices mtap natural-language-processing nlp pipelines python text-analysis

Last synced: 17 Nov 2024

https://github.com/thakur-nandan/topic-modeling

This repository contains as intuitive example on topic-modeling using regular LDA, and how GuidedLDA is better than regular LDA

gensim guidedlda latent latent-dirichlet-allocation lda nlp nlp-machine-learning nltk python seededlda text topic-modeling

Last synced: 15 Oct 2024

https://github.com/tuanacelik/what-would-mother-say

💁‍♀️ A Tweet creation Agent that fetches usernames last k tweets and generates a tweet about the requested topic

agent haystack llm nlp opensource

Last synced: 28 Oct 2024

https://github.com/ills-montreal/emir

When is an Embedding Model More Promising than Another?, NeurIPS'24

embedders information-theory molecule nlp representation-learning

Last synced: 06 Nov 2024

https://github.com/iampukar/language-processing

Solutions to NLP coursework from National Research University Higher School of Economics, through Coursera

coursera hse-aml natural-language-processing nlp

Last synced: 07 Nov 2024

https://github.com/ppasupat/factored-span-parsing

Code for the EMNLP 2019 paper "Span-based Hierarchical Semantic Parsing for Task-Oriented Dialog"

nlp semantic-parsing

Last synced: 08 Nov 2024

https://github.com/m-elbably/symspell-ex

Distributed spelling correction & fuzzy search based on symmetric delete spelling correction algorithm (SymSpell)

algorithm nlp spelling-correction

Last synced: 06 Nov 2024

https://github.com/finn-no/keras-conv-sentence-classifier

Keras Implementation of "Convolutional Neural Networks for Sentence Classification"

convolutional-neural-network keras nlp

Last synced: 14 Nov 2024

https://github.com/xiaohk/stat333_project_2

Madison restaurant Yelp rating prediction based on review text

convolutional-neural-networks logistic-regression machine-learning naive-bayes-classifier nlp

Last synced: 07 Nov 2024

https://github.com/avestura/persiannews

📰 My final project for NLP course

csharp fsharp guilan-university nlp persian persian-nlp

Last synced: 14 Nov 2024

https://github.com/ssciwr/ammico

AI-based Media and Misinformation Content Analysis Tool: Analyze text and images

classification computer-vision nlp text-extraction translation

Last synced: 09 Nov 2024