Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Natural language processing

Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.

https://github.com/retraigo/appraisal

Machine Learning utilities for TypeScript

encoding machine-learning nlp tf-idf typescript

Last synced: 04 Aug 2024

https://github.com/davikawasaki/utfpr-ce-undergrad-final-project

UTFPR Computer Engineering Undergrad Final Project - Computing Exam Questions Classification Using Natural-Language Processing

adaptive-teaching computing-classification machine-learning natural-language-processing nlp nltk python sklearn

Last synced: 30 Jul 2024

https://github.com/rosette-api-community/text-embeddings-sample

A little python code to show how to get similarity between word embeddings returned from the Rosette API's new /text-embedding endpoint.

machine-learning natural-language-processing nlp python text-embedding text-extraction text-similarity word-similarity

Last synced: 03 Aug 2024

https://github.com/evelinacs/semantic_parsing_with_IRTGs

Experiments of developing an IRTG which simultaneously encodes transformations between phrase structure trees, dependency graphs and semantic graphs.

computational-linguistics dependency-graph grammar grammar-rules graph-transformation irtg natural-language-processing nlp penn-treebank phrase-structure-tree python python3 rule-based semantic-parsing surface-realization universal-dependencies

Last synced: 07 Aug 2024

https://github.com/Ermlab/polish-word-embeddings-review

Evaluation of polish word embeddings prepared by various research groups. Evaluation is done by words analogy task

computational-linguistics deep-learning fasttext machine-learning nlp polish-language word2vec wordembeddings

Last synced: 31 Jul 2024

https://github.com/krrish-v/mark_importer

Provide a category for all the imported bookmarks, makes easy to manage by using a AI model

bookmarks bookmarks-manager nlp

Last synced: 01 Aug 2024

https://github.com/kasnerz/lightnlg

A minimalistic codebase for training NLG models from HuggingFace Transformers using PyTorch Lightning.

bart generation gpt-2 language-model nlg nlp pytorch pytorch-lightning

Last synced: 04 Aug 2024

https://github.com/notAI-tech/Anuvaad

State of the art open-source translation for Indic languages.

hindi india indic-languages kannada malayalam marathi mt5 multilingual nlp tamil telugu transformer transformers translation

Last synced: 03 Aug 2024

https://github.com/nicolasdz/UNGDC

The UN General Debate Corpus (UNGDC) is a dataset of all speeches given at the high-level UN forum usually held in September of each year.

diplomacy international-relations nlp united-nations

Last synced: 01 Aug 2024

https://github.com/kaydotdev/in-a-word-bot

A Telegram bot for text summary generation

aiogram bot nlp python summary telegram-bot tldr

Last synced: 29 Jul 2024

https://github.com/nisheethjaiswal/Data-Annotator-for-SpaCy

🚀SpAnnor annotator for Named Entity Recognition easy to use tool. The annotator allows users to quickly assign custom labels to one or more entities in the text. Easy to setup for Data Training for SpaCy 🔥.

data-annotation data-annotation-tools data-labeling data-preparation named-entity-recognition nlp spacy-nlp text-labeling

Last synced: 17 Aug 2024

https://github.com/AdvitDeepak/messages-wrapped

NLP analysis and dashboard of user's messages (Instagram, iMessage, Android)

dash messages nlp nltk plotly

Last synced: 07 Aug 2024

https://github.com/cortega26/PDF-Text-Analizer

This repository houses a script that can download PDFs from a specified URL, convert them to text, and perform text analysis. This analysis includes identifying the language, eliminating stopwords, and counting word and phrase frequency. It's worth noting that the script is capable of analyzing texts in multiple languages.

nlp ocr pdf pdf-converter text-analysis text-mining text-summarization

Last synced: 07 Aug 2024

https://github.com/gtoffoli/commons-textanalysis

Text-analysis support for Django clients, talking through HTTP API to an extended spaCy deployment.

django nlp python spacy text-analysis

Last synced: 07 Aug 2024

https://github.com/1994nikunj/nlp-toolkit-desktop-app

The code is a collection of NLP analyses, including text cleaning, most common words, n-grams generation, co-occurrence matrix generation, wordcloud generation, topic modeling (using Latent Dirichlet Allocation), and general text statistics.

data-analysis n-grams network-visualization nlp python text-cleaning topic-modeling wordcloud-generator

Last synced: 07 Aug 2024

https://github.com/pharo-ai/ngram

N-gram functionality for Pharo

language-modeling natural-language-processing ngrams nlp pharo

Last synced: 03 Aug 2024

https://github.com/neomatrix369/nlp-java-jvm-example

A repo with NLP examples of libraries/packages/framework written in Java/JVM

bash clojure docker graal graalvm java jvm kotlin natural-language-processing natural-language-understanding nlp scala shell

Last synced: 03 Aug 2024

https://github.com/euskadi31/go-ngram

an n-gram is a contiguous sequence of n items from a given sequence of text or speech.

go golang golang-library machine-learning ngram ngram-analysis ngrams nlp

Last synced: 02 Aug 2024

https://github.com/SoftCreatR/php-mistral-ai-sdk

A powerful and easy-to-use PHP SDK for the Mistral AI API, allowing seamless integration of advanced AI-powered features into your PHP projects.

ai api api-client api-wrapper artificial-intelligence gpt gpt-3 gpt-4 hacktoberfest llama machine-learning mistral natural-language-processing nlp perplexity php replit sdk

Last synced: 09 Aug 2024

https://github.com/rosette-api-community/rosette-for-excel

Microsoft Excel add-in that implements many endpoints through ribbon functions and formula support

entity-extraction excel machine-learning natural-language-processing nlp nlp-apis rosette text-analytics

Last synced: 03 Aug 2024

https://github.com/bhattbhavesh91/texthero-demo

Tutorial to demonstrate the power of Texthero which is a library used for Text preprocessing, representation and visualization from zero to hero.

nlp nlp-pipeline text-clustering text-mining text-preprocessing text-representation text-visualization texthero texthero-tutorial word-embeddings

Last synced: 01 Aug 2024

https://github.com/jonaschn/topbox

Python 3 wrapper around the Stanford Topic Modeling Toolbox. Intended to be used for hassle-free supervised topic classification with Labeled Latent Dirichlet Allocation (L-LDA, LLDA, sLDA).

labeled-lda nlp stanford-nlp stanford-tmt topic-modeling

Last synced: 02 Aug 2024

https://github.com/mohadese-yousefi/spell-correction

Simple autocorrect misspelled word base on distance.

nlp spelling-correction

Last synced: 04 Aug 2024

https://github.com/srstevenson/keyword-extractor

Extract keywords from plain text documents

nlp spacy tf-idf

Last synced: 04 Aug 2024

https://github.com/AVoss84/pdf_extract

Text classification based on PDF inputs

classification fastapi nlp python streamlit

Last synced: 01 Aug 2024

https://github.com/toshimelonhead/Springboard-Berkshire

Topic model analysis of Berkshire Hathaway annual letters (Completed Capstone Project #2)

gensim nlp spacy springboard textacy topic-modeling

Last synced: 13 Aug 2024

https://github.com/tomhalloin/Springboard-Berkshire

Topic model analysis of Berkshire Hathaway annual letters (Completed Capstone Project #2)

gensim nlp spacy springboard textacy topic-modeling

Last synced: 03 Aug 2024

https://github.com/ashishu007/NLP-Tasks

Various NLP tasks using Huggingface and Flask

flask natural-language-processing nlp pytorch question-answering

Last synced: 01 Aug 2024

https://github.com/seanghay/khmerphonemizer

A Free, Standalone and Open-Source Khmer Grapheme-to-Phonemes.

cambodia khmer khmer-language nlp phonemes

Last synced: 01 Aug 2024

https://github.com/Mel-iza/The-Natural-Language-Processing-Workshop

Sharing the NLP The Natural Language Processing Workshop book exercises and notes

nlp text-analysis text-classification text-processing

Last synced: 01 Aug 2024

https://nevmenandr.github.io/bashwiki-report/

Отчет о компьютерном анализе башкирской википедии, предпринятом в 2013 году

bashkir minority-language natural-language-processing nlp

Last synced: 03 Aug 2024

https://github.com/Alvayang923/Elon_NLP

基于nlp对马斯克推文的探索性分析 An exploratory analysis of Elon Musk’s communication on Twitter

elon-musk nlp twitter

Last synced: 29 Jul 2024

https://github.com/simonebel/nlp-datasets-benchmark

NLP Datasets Benchmark

dataset nlp

Last synced: 01 Aug 2024

https://github.com/DavidLee95/whatsapp_chatbot

A customized AI chatbot integrated into WhatsApp to improve the customer experience

ai artificial-intelligence chatbot gpt natural-language-processing nlp openai python whatsapp whatsapp-bot

Last synced: 29 Jul 2024

https://github.com/fauziihsan/nlp-fauzi-2019

Crawling twitter data using twitter API in php language

crawling learning machine nlp php twitter

Last synced: 29 Jul 2024

https://github.com/love-irish/spellchecker

A ruby spellchecker library that works well with Irish

irish nlp spellchecker

Last synced: 30 Jul 2024

https://github.com/nadinejackson1/newyou.ai

newyou.ai is an AI-powered chatbot built on Voiceflow, leveraging ChatGPT-4 for natural language understanding and response generation. It helps users set, track, and achieve their personal goals through seamless integration with WhatsApp.

ai artificial-intelligence chatgpt-4 conversational-ai goal-setting natural-language-processing nlp personal-development voiceflow whatsapp

Last synced: 29 Jul 2024

https://github.com/AndMastro/EmergencyAwareness

Repository regarding WIR project about emergency awareness on social media.

clustering information-retrieval machine-learning nlp twitter

Last synced: 29 Jul 2024