Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Natural language processing
Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.
- GitHub: https://github.com/topics/nlp
- Wikipedia: https://en.wikipedia.org/wiki/Natural_language_processing
- Created by: Alan Turing
- Aliases: natural-language-processing, nlp-machine-learning, nlp-resources,
- Last updated: 2024-11-09 00:20:12 UTC
- JSON Representation
https://github.com/phantominsights/summarizer
A Reddit bot that summarizes news articles written in Spanish or English. It uses a custom built algorithm to rank words and sentences.
nlp praw python3 reddit-bot spacy web-scraper wordcloud
Last synced: 31 Oct 2024
https://github.com/tirthajyoti/web-database-analytics
Web scrapping and related analytics using Python tools
analytics beautifulsoup4 data-science data-wrangling database json json-parser natural-language-processing nlp python regular-expression sql sqlite3 web-scraping xml-parser
Last synced: 31 Oct 2024
https://github.com/stanford-oval/genie-server
The home server version of Almond
hacktoberfest nlp raspberrypi voice
Last synced: 11 Oct 2024
https://github.com/affjljoo3581/gpt2
PyTorch Implementation of OpenAI GPT-2
gpt2 language-model natural-language-generation natural-language-processing nlp pytorch transformer
Last synced: 09 Nov 2024
https://github.com/tirthajyoti/Web-Database-Analytics
Web scrapping and related analytics using Python tools
analytics beautifulsoup4 data-science data-wrangling database json json-parser natural-language-processing nlp python regular-expression sql sqlite3 web-scraping xml-parser
Last synced: 09 Nov 2024
https://github.com/ikegami-yukino/neologdn
Japanese text normalizer for mecab-neologd
japanese-language mecab-ipadic-neologd nlp preprocessing text-normalization
Last synced: 12 Oct 2024
https://github.com/amirshnll/Persian-Swear-Words
Persian Swear Dataset - you can use in your production to filter unwanted content. دیتاست کلمات نامناسب و بد فارسی برای فیلتر کردن متن ها
dataset datasets farsi farsiswear farsiswearword nlp nlp-dataset persian persiandataset persianswearword swear sweardataset swearword
Last synced: 04 Aug 2024
https://github.com/quadrismegistus/prosodic
Prosodic: a metrical-phonological parser, written in Python. For English and Finnish, with flexible language support.
finnish-language-analysis linguistics metrical-parser nlp poetry rhythm
Last synced: 30 Oct 2024
https://github.com/neuml/txtchat
💭 Retrieval augmented generation (RAG) and language model powered search applications
large-language-models llm machine-learning nlp python rag retrieval-augmented-generation search txtai
Last synced: 28 Oct 2024
https://github.com/hsankesara/deepresearch
This repository is the collection of research papers in Deep learning, computer vision and NLP.
computer-vision deep-learning keras machine-learning nlp nueral-networks python3 research-paper
Last synced: 27 Oct 2024
https://github.com/jenojp/negspacy
spaCy pipeline object for negating concepts in text
negation negation-phrases negex nlp python spacy spacy-extension spacy-pipeline
Last synced: 14 Oct 2024
https://github.com/AMontgomerie/question_generator
An NLP system for generating reading comprehension questions
bert natural-language-generation natural-language-processing nlg nlp question-generation t5 transformers
Last synced: 02 Aug 2024
https://github.com/deepset-ai/haystack-tutorials
Here you can find all the Tutorials for Haystack 📓
generative-qa haystack llm nlp semantic-search text-generation tutorials
Last synced: 06 Nov 2024
https://github.com/tensorchord/modelz-llm
OpenAI compatible API for LLMs and embeddings (LLaMA, Vicuna, ChatGLM and many others)
llm nlp openai-api transformer
Last synced: 09 Nov 2024
https://github.com/yohasebe/engtagger
English Part-of-Speech Tagger Library; a Ruby port of Lingua::EN::Tagger
english nlp pos-tagging ruby rubynlp
Last synced: 08 Nov 2024
https://github.com/rameshaditya/scoper
Fuzzy and semantic search for captioned YouTube videos.
fuzzy-search machine-learning ml nlp search search-algorithm semantic youtube youtube-api
Last synced: 09 Nov 2024
https://github.com/sakuranew/BERT-AttributeExtraction
USING BERT FOR Attribute Extraction in KnowledgeGraph. fine-tuning and feature extraction. 使用基于bert的微调和特征提取方法来进行知识图谱百度百科人物词条属性抽取。
ai attribute-extraction bert deeplearning feature-extraction fine-tuning knowledge-graph nlp relation-extraction
Last synced: 02 Nov 2024
https://github.com/bjascob/lemminflect
A python module for English lemmatization and inflection.
inflection lemmatization nlp nlp-machine-learning python spacy spacy-extensions
Last synced: 14 Oct 2024
https://github.com/esteininger/vector-search
The definitive guide to using Vector Search to solve your semantic search production workload needs.
lucene nlp search-engine vector-search
Last synced: 07 Nov 2024
https://github.com/pen-ho/medical_knowledge_graph_app-master
医药知识图谱自动问答系统实现,包括构建知识图谱、基于知识图谱的流水线问答以及前端实现。实体识别(基于词典+BERT_CRF)、实体链接(Sentence-BERT做匹配)、意图识别(基于提问词+领域词词典)。
django-application echarts entity-linking kbqa kgqa knowledge-graph mention-detection neo4j ner nlp pytorch-transformers relation-detection relation-extraction
Last synced: 10 Oct 2024
https://github.com/lucasjinreal/weibo_terminator_workflow
Update Version of weibo_terminator, This is Workflow Version aim at Get Job Done!
crawler nlp scraper sentiment-analysis weibo-terminator
Last synced: 06 Nov 2024
https://github.com/adamlui/chatgpt-apps
🤖 Apps that utilize the astounding power of ChatGPT or enhance its UX
ai artificialintelligence brave brave-search chat chatbot chatgpt chatgpt3 chatgpt35-turbo duckduckgo gpt-3 gpt-4 gpt3 greasemonkey javascript machine-learning ml nlp openai userscripts
Last synced: 28 Sep 2024
https://github.com/lucasxlu/LagouJob
Data Analysis & Mining for lagou.com
data-analysis data-mining lagou machine-learning nlp python3 web-crawler
Last synced: 06 Aug 2024
https://github.com/ahmedbesbes/character-based-cnn
Implementation of character based convolutional neural network
character-based-model character-cnn convolutional-neural-network deep-neural-networks natural-language-processing nlp nlp-machine-learning paper-implementations pytorch youtube-video
Last synced: 11 Oct 2024
https://github.com/dataqa/nlp-labelling
Labelling platform for text using weak supervision.
annotation-tool data-labeling data-science learning-with-limited-labeled-data learning-with-noisy-labels natural-language-processing ner nlp nlp-machine-learning pseudo-labeling search-engine text-annotation-tool text-classification text-mining weak-supervision
Last synced: 29 Oct 2024
https://github.com/grumpyp/aixplora
AIxplora is a open-source tool which let's you query all kind of files not limited to any length or format.
audio chat chatbot chatgpt embeddings embeddings-model generativeai llm llms nlp openai ownfiles pdf question-answering search second-brain vectorstore
Last synced: 06 Nov 2024
https://github.com/opensemanticsearch/open-semantic-etl
Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipelines & ingestor to Solr or Elastic search index & linked data graph database
annotation documents elasticsearch enrichment etl extract extract-information extract-text extractor ingest ingestion-pipeline ingests-documents named-entity-recognition nlp ocr pdf python rdf solr solr-dataimporter
Last synced: 26 Oct 2024
https://github.com/aryn-ai/sycamore
🍁 Sycamore is an LLM-powered search and analytics platform for unstructured data.
ai dataprep etl information-retrieval llm ml nlp opensearch search semantic-search
Last synced: 18 Aug 2024
https://github.com/hmunachi/nanodl
A Jax-based library for designing and training transformer models from scratch.
attention attention-mechanism deep-learning distributed-training flax gpt jax llama machine-learning mistral nlp transformer
Last synced: 10 Oct 2024
https://github.com/Psarpei/Multi-Type-TD-TSR
Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition:
algorithms computer-science computer-vision computer-vision-algorithms computer-vision-opencv deep-learning image-processing machine-learning machine-learning-algorithms natural-language-processing nlp nlp-machine-learning ocr ocr-python ocr-recognition table-detection table-detection-using-deep-learning table-structure-recognition
Last synced: 06 Nov 2024
https://github.com/linonetwo/segmentit
任何 JS 环境可用的中文分词包,fork from leizongmin/node-segment
chinese chinese-nlp nlp segmentation
Last synced: 02 Aug 2024
https://github.com/gabeur/mmt
Multi-Modal Transformer for Video Retrieval
fusion language multimodal nlp video vision
Last synced: 03 Aug 2024
https://github.com/amirbar/rnn.wgan
Code for training and evaluation of the model from "Language Generation with Recurrent Generative Adversarial Networks without Pre-training"
gan gans nlp text-generation text-generator wgan
Last synced: 31 Oct 2024
https://github.com/abelriboulot/onnxt5
Summarization, translation, sentiment-analysis, text-generation and more at blazing speed using a T5 version implemented in ONNX.
inference nlp nlp-machine-learning onnx onnxruntime sentiment-analysis summarization text-classification text-generation transformer transformers translation
Last synced: 07 Nov 2024
https://github.com/tomasonjo/langchain2neo4j
Integrating Neo4j database into langchain ecosystem
chatbot chatgpt gpt-3 gpt-4 langchain langchain-python neo4j nlp
Last synced: 26 Sep 2024
https://github.com/gmihaila/ml_things
This is where I put things I find useful that speed up my work with Machine Learning. Ever looked in your old projects to reuse those cool functions you created before? Well, this repo is designed to be a Python Library of functions I created in my previous project that can be reused. I also share some Notebooks Tutorials and Python Code Snippets.
google-colab machine-learning nlp nlp-machine-learning notebooks python-snippets pytorch snippets transformer
Last synced: 11 Oct 2024
https://github.com/30lm32/ml-projects
ML based projects such as Spam Classification, Time Series Analysis, Text Classification using Random Forest, Deep Learning, Bayesian, Xgboost in Python
ab-testing deep-learning docker gensim geolocation imbalanced-data kdtree keras lstm-neural-networks machine-learning mlflow nlp random-forest spam-classification svm tensorboard tensorflow text-classification timeseries-analysis word2vec
Last synced: 03 Aug 2024
https://github.com/dongjunlee/text-cnn-tensorflow
Convolutional Neural Networks for Sentence Classification(TextCNN) implements by TensorFlow
classification deep-learning hb-experiment nlp sentiment-analysis tensorflow tensorflow-models text-cnn
Last synced: 10 Oct 2024
https://github.com/oxford-cs-deepnlp-2017/practical-1
Oxford Deep NLP 2017 course - Practical 1: word2vec
deep-learning natural-language-processing nlp oxford word2vec
Last synced: 07 Aug 2024
https://github.com/akanyaani/gpt-2-tensorflow2.0
OpenAI GPT2 pre-training and sequence prediction implementation in Tensorflow 2.0
gpt gpt-2 gpt2 implementation nlp openai pre-training pretraining tensorflow tensorflow2 text-generation transformer
Last synced: 02 Aug 2024
https://github.com/quanteda/spacyr
R wrapper to spaCy NLP
extract-entities nlp r spacy speech-tagging
Last synced: 14 Oct 2024
https://github.com/natasha/razdel
Rule-based token, sentence segmentation for Russian language
nlp python russian sentence-boundary-detection sentence-segmentation tokenization
Last synced: 10 Nov 2024
https://github.com/gandersen101/spaczz
Fuzzy matching and more functionality for spaCy.
ai artificial-intelligence data-science fuzzy-matching natural-language-processing nlp nlp-library python rapidfuzz regex spacy spacy-extension spacy-extensions
Last synced: 14 Oct 2024
https://github.com/cbaziotis/neat-vision
Neat (Neural Attention) Vision, is a visualization tool for the attention mechanisms of deep-learning models for Natural Language Processing (NLP) tasks. (framework-agnostic)
attention attention-mechanism attention-mechanisms attention-scores attention-visualization deep-learning deep-learning-library deep-learning-visualization natural-language-processing nlp self-attention self-attentive-rnn text-visualization visualization vuejs
Last synced: 06 Nov 2024
https://github.com/irlab-sdu/fuzi.mingcha
夫子•明察司法大模型是由山东大学、浪潮云、中国政法大学联合研发,以 ChatGLM 为大模型底座,基于海量中文无监督司法语料与有监督司法微调数据训练的中文司法大模型。该模型支持法条检索、案例分析、三段论推理判决以及司法对话等功能,旨在为用户提供全方位、高精准的法律咨询与解答服务。
chatglm-6b judicial large-language-models legal legal-ai legalai llms nlp pretrained-models
Last synced: 02 Nov 2024
https://github.com/princeton-nlp/WebShop
[NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents
decision-making language language-grounding ml nlp rl rl-environment shopping sim-to-real web-based
Last synced: 09 Nov 2024
https://github.com/amanchadha/coursera-natural-language-processing-specialization
Programming assignments from all courses in the Coursera Natural Language Processing Specialization offered by deeplearning.ai.
artificial-intelligence assignments course coursera coursera-assignment coursera-specialization deep-learning deeplearning deeplearning-ai machine-learning natural-language natural-language-processing natural-language-understanding nlp nlp-machine-learning sentiment-analysis specialization word-embeddings word-vectors
Last synced: 05 Nov 2024
https://github.com/likejazz/siamese-lstm
Siamese LSTM for evaluating semantic similarity between sentences of the Quora Question Pairs Dataset.
Last synced: 29 Oct 2024
https://github.com/kyubyong/nlp_made_easy
Explains nlp building blocks in a simple manner.
Last synced: 10 Nov 2024
https://github.com/webanno/webanno
🆕 Work continues on INCEpTION 👉 https://github.com/inception-project/inception 👈 -- ⚠️ The official WebAnno repository has reached the end of the line. -- 🚀 To migrate, export your annotation projects from WebAnno, then import them into INCEpTION and just work on.
annotation annotation-editor annotation-tool java nlp web-application
Last synced: 29 Oct 2024
https://github.com/PlanTL-GOB-ES/lm-spanish
Official source for spanish Language Models and resources made @ BSC-TEMU within the "Plan de las Tecnologías del Lenguaje" (Plan-TL).
benchmarks corpora embeddings language-model nlp transformers
Last synced: 05 Aug 2024
https://github.com/zilliztech/akcio
Akcio is a demonstration project for Retrieval Augmented Generation (RAG). It leverages the power of LLM to generate responses and uses vector databases to fetch relevant documents to enhance the quality and relevance of the output.
artificial-intelligence chatbot chatgpt dolly embeddings ernie-bot fastapi gradio langchain llm milvus minimax nlp openai retrieval-augmented-generation retrieval-chatbot semantic-search towhee
Last synced: 09 Aug 2024
https://github.com/neomatrix369/nlp_profiler
A simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data, NLP Profiler will return either high-level insights or low-level/granular statistical information about the text in that column.
google-colab grammar-checks hacktoberfest jupyter kaggle-kernels natural-language-processing nlp nlp-keywords-extraction nlp-library nlp-machine-learning nlp-parsing nlp-profiler profiler profiling profiling-datasets text-mining
Last synced: 22 Oct 2024
https://webanno.github.io/webanno/
🆕 Work continues on INCEpTION 👉 https://github.com/inception-project/inception 👈 -- ⚠️ The official WebAnno repository has reached the end of the line. -- 🚀 To migrate, export your annotation projects from WebAnno, then import them into INCEpTION and just work on.
annotation annotation-editor annotation-tool java nlp web-application
Last synced: 28 Oct 2024
https://github.com/gyunggyung/AGI-Papers
Papers and Book to look at when starting AGI 📚
all-to-all dialogue distillation efficient llm multimodal multiple-tasks nlg nlp sentence-embeddings sentence-similarity stable-diffusion text-to-video tts
Last synced: 20 Oct 2024
https://github.com/mead-ml/mead-baseline
Deep-Learning Model Exploration and Development for NLP
baseline bert classification convolutional-neural-networks deep-learning deep-learning-architectures experimentation hacktoberfest keras language-model machine-learning nlp nlp-tasks pytorch recurrent-neural-networks seq2seq tensorflow transformers visdom
Last synced: 11 Oct 2024
https://github.com/notAI-tech/fastPunct
Punctuation restoration and spell correction experiments.
attention auto-punctuation deep-learning nlp punctuation punctuation-correction punctuation-marks punctuation-restoration spellchecker spelling-correction text text-correction
Last synced: 09 Aug 2024
https://github.com/mpuig/spacy-lookup
Named Entity Recognition based on dictionaries
named-entity-recognition natural-language-processing ner nlp spacy spacy-extension spacy-pipeline
Last synced: 31 Oct 2024
https://github.com/davidberenstein1957/concise-concepts
This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with entity scoring.
few-shot-classifcation gensim hacktoberfest machine-learning natural-language-processing ner nlp spacy
Last synced: 01 Nov 2024
https://github.com/backprop-ai/backprop
Backprop makes it simple to use, finetune, and deploy state-of-the-art ML models.
bert fine-tuning image-classification language-model multilingual-models natural-language-processing nlp question-answering text-classification transfer-learning transformers
Last synced: 02 Aug 2024
https://github.com/explosion/spacy-services
💫 REST microservices for various spaCy-related tasks
falcon natural-language-processing nlp rest-api rest-microservice spacy
Last synced: 25 Sep 2024
https://github.com/samueldobbie/markup
A web-based document annotation tool, powered by GPT-4 :rocket:
active-learning annotation-tool data-labeling data-science gpt-4 machine-learning named-entity-recognition natural-language-processing ner nlp sequence-to-sequence text-annotation text-annotation-tool
Last synced: 27 Oct 2024
https://github.com/kanyun-inc/fairseq-gec
Source code for paper: Improving Grammatical Error Correction via Pre-Training a Copy-Augmented Architecture with Unlabeled Data
Last synced: 09 Nov 2024
https://github.com/indiejoseph/cnn-text-classification-tf-chinese
CNN for Chinese Text Classification in Tensorflow
chinese cnn convolutional-neural-networks deep-learning nlp tensorflow text-classification
Last synced: 02 Aug 2024
https://github.com/lucasmccabe/emailgpt
a quick and easy interface to generate emails with ChatGPT
chatgpt gpt nlp openai productivity streamlit
Last synced: 13 Oct 2024
https://github.com/vngrs-ai/vnlp
State-of-the-art, lightweight NLP tools for Turkish language. Developed by VNGRS.
deasciifier deep-learning dependency-parsing fasttext morphological-analysis morphological-disambiguation named-entity-recognition nlp normalization number-to-words part-of-speech-tagging sentence-splitting sentence-tokenizer sentiment-analysis spelling-correction stemming stopword-removal turkish-nlp word-embeddings word2vec
Last synced: 10 Oct 2024
https://github.com/IBM/transition-amr-parser
SoTA Abstract Meaning Representation (AMR) parsing with word-node alignments in Pytorch. Includes checkpoints and other tools such as statistical significance Smatch.
abstract-meaning-representation amr amr-graphs amr-parser amr-parsing machine-learning nlp semantic-parsing
Last synced: 02 Aug 2024
https://github.com/devmount/germanwordembeddings
Toolkit to obtain and preprocess German text corpora, train models and evaluate them with generated testsets. Built with Gensim and Tensorflow.
deep-learning deep-neural-networks evaluation gensim german-language model natural-language-processing neural-network nlp training word-embeddings word2vec
Last synced: 01 Nov 2024
https://github.com/hxu296/nlp-resume-parser
NLP-powered, GPT-3 enabled Resume Parser from PDF to JSON.
gpt-3 nlp nlp-parsing open-ai parser resume resume-parer
Last synced: 09 Nov 2024
https://github.com/digiteinfotech/kairon
Conversational AI Platform to build effective Proactive Digital Assistants using Visual LLM Chaining
bot bot-framework botkit bots chatbot chatbot-framework chatbots conversational-agents conversational-ai conversational-bots gpt-3-5-turbo llm machine-learning machine-learning-library natural-language-understanding nlp nlu rasa rasa-nlu spacy
Last synced: 14 Oct 2024
https://github.com/as-ideas/headliner
🏖 Easy training and deployment of seq2seq models.
neural-network nlp python seq2seq tensorflow
Last synced: 07 Nov 2024
https://github.com/lukechilds/humanscript
A truly natural scripting language
ai artificial-intelligence gpt gpt-4 inferpreter interpreter language llama llama2 llm machine-learning nlp openai openai-api scripting-language
Last synced: 10 Oct 2024
https://github.com/lucasmccabe/emailGPT
a quick and easy interface to generate emails with ChatGPT
chatgpt gpt nlp openai productivity streamlit
Last synced: 07 Nov 2024
https://github.com/lgalke/vec4ir
Word Embeddings for Information Retrieval
data-science embedding-models embeddings evaluation information-retrieval natural-language-processing nlp retrieval-model similarity-scoring word-embeddings
Last synced: 02 Aug 2024
https://github.com/vrasneur/pyfasttext
Yet another Python binding for fastText
fasttext machine-learning nlp numpy python python-bindings word-vectors
Last synced: 07 Nov 2024
https://github.com/BLLIP/bllip-parser
BLLIP reranking parser (also known as Charniak-Johnson parser, Charniak parser, Brown reranking parser) See http://pypi.python.org/pypi/bllipparser/ for Python module.
ai artificial-intelligence computational-linguistics machine-learning natural-language-processing nlp nlp-library parsing
Last synced: 30 Oct 2024
https://github.com/swabhs/open-sesame
A frame-semantic parsing system based on a softmax-margin SegRNN.
crf deep-learning dynet frame-semantic-parsing natural-language-processing nlp python27
Last synced: 12 Oct 2024
https://github.com/maxim5/cs224n-2017-winter
All lecture notes, slides and assignments from CS224n: Natural Language Processing with Deep Learning class by Stanford
cs224n deep-learning machine-learning nlp stanford-nlp
Last synced: 05 Nov 2024
https://github.com/hppRC/bert-classification-tutorial
【2023年版】BERTによるテキスト分類
bert deep-learning japanese nlp python pytorch transformers
Last synced: 06 Nov 2024
https://github.com/hpprc/bert-classification-tutorial
【2023年版】BERTによるテキスト分類
bert deep-learning japanese nlp python pytorch transformers
Last synced: 01 Nov 2024
https://github.com/daac-tools/vaporetto
🛥 Vaporetto: Very accelerated pointwise prediction based tokenizer
analyzer japanese morphological-analysis nlp rust segmentation tokenization tokenizer
Last synced: 07 Nov 2024
https://github.com/houbb/pinyin
The high performance pinyin tool for java.(java 高性能中文转拼音工具。支持同音字。)
dfa high-performance nlp pinyin pinyin-analysis pinyin-data pinyin-segmentation pinyin4j segment tiny tiny-pinyin tongyinzi
Last synced: 07 Nov 2024
https://github.com/fedml-ai/fednlp
FedNLP: An Industry and Research Integrated Platform for Federated Learning in Natural Language Processing, Backed by FedML, Inc. The Previous Research Version is Accepted to NAACL 2022
federated-learning machine-learning natural-language-processing nlp
Last synced: 08 Nov 2024
https://github.com/kirralabs/indonesian-NLP-resources
data resource untuk NLP bahasa indonesia
corpus corpus-linguistics crawler dataset dependency-parser indonesian indonesian-language named-entity-recognition nlp parallel-corpus pos-tagging sentiment-analysis
Last synced: 08 Nov 2024
https://github.com/vzhong/embeddings
Fast, DB Backed pretrained word embeddings for natural language processing.
deep-learning neural-network nlp
Last synced: 30 Oct 2024
https://github.com/philipperemy/financial-news-dataset
Reuters and Bloomberg
bloomberg dataset nlp nlp-machine-learning reuters trading trading-strategies
Last synced: 22 Oct 2024
https://github.com/FedML-AI/FedNLP
FedNLP: An Industry and Research Integrated Platform for Federated Learning in Natural Language Processing, Backed by FedML, Inc. The Previous Research Version is Accepted to NAACL 2022
federated-learning machine-learning natural-language-processing nlp
Last synced: 02 Aug 2024
https://github.com/natasha/slovnet
Deep Learning based NLP modeling for Russian language
bert deep-learning machine-learning morphology ner nlp python pytorch russian syntax
Last synced: 11 Oct 2024
https://github.com/mindflowai/mindflow
🧠 AI-powered CLI git wrapper, boilerplate code generator, chat history manager, and code search engine to streamline your dev workflow 🌊
chat-gpt cli code-generation command-line-interface dev-tools git git-wrapper information-retrieval large-language-models llm machine-learning modern-dev-tools nlp openai openai-api python search search-engine
Last synced: 29 Oct 2024
https://github.com/maxent-ai/ocrpy
OCR, Archive, Index and Search: Implementation agnostic OCR framework.
aws azure computer-vision cv deep-learning google-vision-api image-processing information-retrieval nlp ocr ocr-python python semantic-search tesseract-ocr transformers
Last synced: 07 Nov 2024
https://github.com/sunyilgdx/NSP-BERT
The code for our paper "NSP-BERT: A Prompt-based Zero-Shot Learner Through an Original Pre-training Task —— Next Sentence Prediction"
bert correference-resolution entity-linking entity-typing natural-language-inference nlp prompt-learning sentence-classification sentiment-analysis tensorflow text-classification zero-shot
Last synced: 03 Aug 2024
https://github.com/openvenues/node-postal
NodeJS bindings to libpostal for fast international address parsing/normalization
address address-parser binding international native nlp
Last synced: 09 Nov 2024
https://github.com/soskek/bert-chainer
Chainer implementation of "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"
bert chainer google natural-language-processi natural-language-understanding nlp transformer
Last synced: 02 Nov 2024
https://github.com/jieyuz2/wrench
[NeurIPS 2021] WRENCH: Weak supeRvision bENCHmark
benchmark-framework data-centric-ai data-programming dataset deep-learning machine-learning nlp robust-learning sequence-labeling weak-supervision weakly-supervised-learning
Last synced: 30 Oct 2024
https://github.com/mmxgn/spacy-clausie
Implementation of the ClausIE information extraction system for python+spacy
clausie information-extraction nlp problog python-spacy spacy
Last synced: 30 Sep 2024
https://github.com/otuncelli/turkish-stemmer-python
:snake: Turkish Language Stemmer for Python
language natural-language-processing nlp stemming-algorithm turkish-language
Last synced: 02 Aug 2024
https://github.com/IngestAI/embedditor
⚡ GUI for editing LLM vector embeddings. No more blind chunking. Upload content in any file extension, join and split chunks, edit metadata and embedding tokens + remove stop-words and punctuation with one click, add images, and download in .veml to share it with your team.
datapreprocessing datascience embedding-vectors embeddings genai laravel llm markup-language ml nlp nltk php vector-database vector-search vectorization veml
Last synced: 31 Oct 2024
https://github.com/JieyuZ2/wrench
[NeurIPS 2021] WRENCH: Weak supeRvision bENCHmark
benchmark-framework data-centric-ai data-programming dataset deep-learning machine-learning nlp robust-learning sequence-labeling weak-supervision weakly-supervised-learning
Last synced: 03 Oct 2024
https://github.com/naver/claf
CLaF: Open-Source Clova Language Framework
clova framework language natural-language-processing nlp pytorch
Last synced: 08 Nov 2024