Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Natural language processing
Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.
- GitHub: https://github.com/topics/nlp
- Wikipedia: https://en.wikipedia.org/wiki/Natural_language_processing
- Created by: Alan Turing
- Aliases: natural-language-processing, nlp-machine-learning, nlp-resources,
- Last updated: 2024-07-29 13:51:14 UTC
- JSON Representation
https://github.com/rosette-api/ruby
Rosette API Client Library for Ruby
deduplication entity-extraction language-identification machine-learning morphology named-entity-recognition natural-language-processing nlp ruby sentiment-analysis text-analytics text-embedding tokenization
Last synced: 03 Aug 2024
https://github.com/retraigo/appraisal
Machine Learning utilities for TypeScript
encoding machine-learning nlp tf-idf typescript
Last synced: 04 Aug 2024
https://github.com/davikawasaki/utfpr-ce-undergrad-final-project
UTFPR Computer Engineering Undergrad Final Project - Computing Exam Questions Classification Using Natural-Language Processing
adaptive-teaching computing-classification machine-learning natural-language-processing nlp nltk python sklearn
Last synced: 30 Jul 2024
https://github.com/rosette-api-community/text-embeddings-sample
A little python code to show how to get similarity between word embeddings returned from the Rosette API's new /text-embedding endpoint.
machine-learning natural-language-processing nlp python text-embedding text-extraction text-similarity word-similarity
Last synced: 03 Aug 2024
https://github.com/evelinacs/semantic_parsing_with_IRTGs
Experiments of developing an IRTG which simultaneously encodes transformations between phrase structure trees, dependency graphs and semantic graphs.
computational-linguistics dependency-graph grammar grammar-rules graph-transformation irtg natural-language-processing nlp penn-treebank phrase-structure-tree python python3 rule-based semantic-parsing surface-realization universal-dependencies
Last synced: 07 Aug 2024
https://github.com/Ermlab/polish-word-embeddings-review
Evaluation of polish word embeddings prepared by various research groups. Evaluation is done by words analogy task
computational-linguistics deep-learning fasttext machine-learning nlp polish-language word2vec wordembeddings
Last synced: 31 Jul 2024
https://github.com/krrish-v/mark_importer
Provide a category for all the imported bookmarks, makes easy to manage by using a AI model
bookmarks bookmarks-manager nlp
Last synced: 01 Aug 2024
https://github.com/kasnerz/lightnlg
A minimalistic codebase for training NLG models from HuggingFace Transformers using PyTorch Lightning.
bart generation gpt-2 language-model nlg nlp pytorch pytorch-lightning
Last synced: 04 Aug 2024
https://github.com/notAI-tech/Anuvaad
State of the art open-source translation for Indic languages.
hindi india indic-languages kannada malayalam marathi mt5 multilingual nlp tamil telugu transformer transformers translation
Last synced: 03 Aug 2024
https://github.com/nicolasdz/UNGDC
The UN General Debate Corpus (UNGDC) is a dataset of all speeches given at the high-level UN forum usually held in September of each year.
diplomacy international-relations nlp united-nations
Last synced: 01 Aug 2024
https://github.com/kaydotdev/in-a-word-bot
A Telegram bot for text summary generation
aiogram bot nlp python summary telegram-bot tldr
Last synced: 29 Jul 2024
https://github.com/nisheethjaiswal/Data-Annotator-for-SpaCy
🚀SpAnnor annotator for Named Entity Recognition easy to use tool. The annotator allows users to quickly assign custom labels to one or more entities in the text. Easy to setup for Data Training for SpaCy 🔥.
data-annotation data-annotation-tools data-labeling data-preparation named-entity-recognition nlp spacy-nlp text-labeling
Last synced: 17 Aug 2024
https://github.com/cortega26/PDF-Text-Analizer
This repository houses a script that can download PDFs from a specified URL, convert them to text, and perform text analysis. This analysis includes identifying the language, eliminating stopwords, and counting word and phrase frequency. It's worth noting that the script is capable of analyzing texts in multiple languages.
nlp ocr pdf pdf-converter text-analysis text-mining text-summarization
Last synced: 07 Aug 2024
https://github.com/gtoffoli/commons-textanalysis
Text-analysis support for Django clients, talking through HTTP API to an extended spaCy deployment.
django nlp python spacy text-analysis
Last synced: 07 Aug 2024
https://github.com/1994nikunj/nlp-toolkit-desktop-app
The code is a collection of NLP analyses, including text cleaning, most common words, n-grams generation, co-occurrence matrix generation, wordcloud generation, topic modeling (using Latent Dirichlet Allocation), and general text statistics.
data-analysis n-grams network-visualization nlp python text-cleaning topic-modeling wordcloud-generator
Last synced: 07 Aug 2024
https://github.com/pharo-ai/ngram
N-gram functionality for Pharo
language-modeling natural-language-processing ngrams nlp pharo
Last synced: 03 Aug 2024
https://github.com/neomatrix369/nlp-java-jvm-example
A repo with NLP examples of libraries/packages/framework written in Java/JVM
bash clojure docker graal graalvm java jvm kotlin natural-language-processing natural-language-understanding nlp scala shell
Last synced: 03 Aug 2024
https://github.com/euskadi31/go-ngram
an n-gram is a contiguous sequence of n items from a given sequence of text or speech.
go golang golang-library machine-learning ngram ngram-analysis ngrams nlp
Last synced: 02 Aug 2024
https://github.com/SoftCreatR/php-mistral-ai-sdk
A powerful and easy-to-use PHP SDK for the Mistral AI API, allowing seamless integration of advanced AI-powered features into your PHP projects.
ai api api-client api-wrapper artificial-intelligence gpt gpt-3 gpt-4 hacktoberfest llama machine-learning mistral natural-language-processing nlp perplexity php replit sdk
Last synced: 09 Aug 2024
https://github.com/rosette-api-community/rosette-for-excel
Microsoft Excel add-in that implements many endpoints through ribbon functions and formula support
entity-extraction excel machine-learning natural-language-processing nlp nlp-apis rosette text-analytics
Last synced: 03 Aug 2024
https://github.com/sinaahmadi/ScriptNormalization
Script Normalization for Unconventional Writing of Perso-Arabic scripts (ACL2023)
acl2023 arabic azeri gilaki gorani kashmiri kurdish kurdish-language-processing kurmanji less-resource-languages mazanderani nlp persian preprocessing script-normalization sindhi sorani turkish urdu
Last synced: 03 Aug 2024
https://github.com/bhattbhavesh91/texthero-demo
Tutorial to demonstrate the power of Texthero which is a library used for Text preprocessing, representation and visualization from zero to hero.
nlp nlp-pipeline text-clustering text-mining text-preprocessing text-representation text-visualization texthero texthero-tutorial word-embeddings
Last synced: 01 Aug 2024
https://github.com/jonaschn/topbox
Python 3 wrapper around the Stanford Topic Modeling Toolbox. Intended to be used for hassle-free supervised topic classification with Labeled Latent Dirichlet Allocation (L-LDA, LLDA, sLDA).
labeled-lda nlp stanford-nlp stanford-tmt topic-modeling
Last synced: 02 Aug 2024
https://github.com/mohadese-yousefi/spell-correction
Simple autocorrect misspelled word base on distance.
Last synced: 04 Aug 2024
https://github.com/srstevenson/keyword-extractor
Extract keywords from plain text documents
Last synced: 04 Aug 2024
https://github.com/AVoss84/pdf_extract
Text classification based on PDF inputs
classification fastapi nlp python streamlit
Last synced: 01 Aug 2024
https://github.com/toshimelonhead/Springboard-Berkshire
Topic model analysis of Berkshire Hathaway annual letters (Completed Capstone Project #2)
gensim nlp spacy springboard textacy topic-modeling
Last synced: 13 Aug 2024
https://github.com/tomhalloin/Springboard-Berkshire
Topic model analysis of Berkshire Hathaway annual letters (Completed Capstone Project #2)
gensim nlp spacy springboard textacy topic-modeling
Last synced: 03 Aug 2024
https://github.com/ashishu007/NLP-Tasks
Various NLP tasks using Huggingface and Flask
flask natural-language-processing nlp pytorch question-answering
Last synced: 01 Aug 2024
https://github.com/seanghay/khmerphonemizer
A Free, Standalone and Open-Source Khmer Grapheme-to-Phonemes.
cambodia khmer khmer-language nlp phonemes
Last synced: 01 Aug 2024
https://github.com/Mel-iza/The-Natural-Language-Processing-Workshop
Sharing the NLP The Natural Language Processing Workshop book exercises and notes
nlp text-analysis text-classification text-processing
Last synced: 01 Aug 2024
https://nevmenandr.github.io/bashwiki-report/
Отчет о компьютерном анализе башкирской википедии, предпринятом в 2013 году
bashkir minority-language natural-language-processing nlp
Last synced: 03 Aug 2024
https://github.com/Alvayang923/Elon_NLP
基于nlp对马斯克推文的探索性分析 An exploratory analysis of Elon Musk’s communication on Twitter
Last synced: 29 Jul 2024
https://github.com/DavidLee95/whatsapp_chatbot
A customized AI chatbot integrated into WhatsApp to improve the customer experience
ai artificial-intelligence chatbot gpt natural-language-processing nlp openai python whatsapp whatsapp-bot
Last synced: 29 Jul 2024
https://github.com/github/CodeSearchNet
Datasets, tools, and benchmarks for representation learning of code.
bert cnn data data-science datasets deep-learning machine-learning machine-learning-on-source-code ml natural-language-processing neural-networks nlp nlp-machine-learning open-data programming-language-theory python representation-learning rnn self-attention tensorflow
Last synced: 30 Jul 2024
https://github.com/love-irish/spellchecker
A ruby spellchecker library that works well with Irish
Last synced: 30 Jul 2024
https://github.com/nadinejackson1/newyou.ai
newyou.ai is an AI-powered chatbot built on Voiceflow, leveraging ChatGPT-4 for natural language understanding and response generation. It helps users set, track, and achieve their personal goals through seamless integration with WhatsApp.
ai artificial-intelligence chatgpt-4 conversational-ai goal-setting natural-language-processing nlp personal-development voiceflow whatsapp
Last synced: 29 Jul 2024
https://github.com/AndMastro/EmergencyAwareness
Repository regarding WIR project about emergency awareness on social media.
clustering information-retrieval machine-learning nlp twitter
Last synced: 29 Jul 2024