Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Natural language processing
Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.
- GitHub: https://github.com/topics/nlp
- Wikipedia: https://en.wikipedia.org/wiki/Natural_language_processing
- Created by: Alan Turing
- Aliases: natural-language-processing, nlp-machine-learning, nlp-resources,
- Last updated: 2024-11-15 00:20:20 UTC
- JSON Representation
https://github.com/kanyun-inc/fairseq-gec
Source code for paper: Improving Grammatical Error Correction via Pre-Training a Copy-Augmented Architecture with Unlabeled Data
Last synced: 16 Nov 2024
https://github.com/indiejoseph/cnn-text-classification-tf-chinese
CNN for Chinese Text Classification in Tensorflow
chinese cnn convolutional-neural-networks deep-learning nlp tensorflow text-classification
Last synced: 14 Nov 2024
https://github.com/ajdavidl/Portuguese-NLP
List of resources and tools developed with focus on Portuguese.
nlp portuguese portuguese-language
Last synced: 12 Nov 2024
https://github.com/vngrs-ai/vnlp
State-of-the-art, lightweight NLP tools for Turkish language. Developed by VNGRS.
deasciifier deep-learning dependency-parsing fasttext morphological-analysis morphological-disambiguation named-entity-recognition nlp normalization number-to-words part-of-speech-tagging sentence-splitting sentence-tokenizer sentiment-analysis spelling-correction stemming stopword-removal turkish-nlp word-embeddings word2vec
Last synced: 10 Oct 2024
https://github.com/lucasmccabe/emailgpt
a quick and easy interface to generate emails with ChatGPT
chatgpt gpt nlp openai productivity streamlit
Last synced: 13 Oct 2024
https://github.com/devmount/germanwordembeddings
Toolkit to obtain and preprocess German text corpora, train models and evaluate them with generated testsets. Built with Gensim and Tensorflow.
deep-learning deep-neural-networks evaluation gensim german-language model natural-language-processing neural-network nlp training word-embeddings word2vec
Last synced: 15 Nov 2024
https://github.com/telekom/create-tsi
Create-tsi is a generative AI RAG toolkit which generates AI Applications with low code.
ai chatbot llm machine-learning nlp openai-api rag transformer
Last synced: 17 Nov 2024
https://github.com/hxu296/nlp-resume-parser
NLP-powered, GPT-3 enabled Resume Parser from PDF to JSON.
gpt-3 nlp nlp-parsing open-ai parser resume resume-parer
Last synced: 09 Nov 2024
https://github.com/seopbo/nlp_classification
Implementing nlp papers relevant to classification with PyTorch, gluonnlp
classification korean-nlp nlp pytorch-implementation pytorch-nlp text-classification
Last synced: 15 Nov 2024
https://github.com/digiteinfotech/kairon
Conversational AI Platform to build effective Proactive Digital Assistants using Visual LLM Chaining
bot bot-framework botkit bots chatbot chatbot-framework chatbots conversational-agents conversational-ai conversational-bots gpt-3-5-turbo llm machine-learning machine-learning-library natural-language-understanding nlp nlu rasa rasa-nlu spacy
Last synced: 14 Oct 2024
https://github.com/as-ideas/headliner
🏖 Easy training and deployment of seq2seq models.
neural-network nlp python seq2seq tensorflow
Last synced: 07 Nov 2024
https://github.com/lukechilds/humanscript
A truly natural scripting language
ai artificial-intelligence gpt gpt-4 inferpreter interpreter language llama llama2 llm machine-learning nlp openai openai-api scripting-language
Last synced: 10 Oct 2024
https://github.com/lucasmccabe/emailGPT
a quick and easy interface to generate emails with ChatGPT
chatgpt gpt nlp openai productivity streamlit
Last synced: 07 Nov 2024
https://github.com/BLLIP/bllip-parser
BLLIP reranking parser (also known as Charniak-Johnson parser, Charniak parser, Brown reranking parser) See http://pypi.python.org/pypi/bllipparser/ for Python module.
ai artificial-intelligence computational-linguistics machine-learning natural-language-processing nlp nlp-library parsing
Last synced: 30 Oct 2024
https://github.com/vrasneur/pyfasttext
Yet another Python binding for fastText
fasttext machine-learning nlp numpy python python-bindings word-vectors
Last synced: 07 Nov 2024
https://github.com/swabhs/open-sesame
A frame-semantic parsing system based on a softmax-margin SegRNN.
crf deep-learning dynet frame-semantic-parsing natural-language-processing nlp python27
Last synced: 12 Oct 2024
https://github.com/lgalke/vec4ir
Word Embeddings for Information Retrieval
data-science embedding-models embeddings evaluation information-retrieval natural-language-processing nlp retrieval-model similarity-scoring word-embeddings
Last synced: 11 Nov 2024
https://github.com/maxim5/cs224n-2017-winter
All lecture notes, slides and assignments from CS224n: Natural Language Processing with Deep Learning class by Stanford
cs224n deep-learning machine-learning nlp stanford-nlp
Last synced: 05 Nov 2024
https://github.com/hpprc/bert-classification-tutorial
【2023年版】BERTによるテキスト分類
bert deep-learning japanese nlp python pytorch transformers
Last synced: 15 Nov 2024
https://github.com/daac-tools/vaporetto
🛥 Vaporetto: Very accelerated pointwise prediction based tokenizer
analyzer japanese morphological-analysis nlp rust segmentation tokenization tokenizer
Last synced: 07 Nov 2024
https://github.com/hppRC/bert-classification-tutorial
【2023年版】BERTによるテキスト分類
bert deep-learning japanese nlp python pytorch transformers
Last synced: 06 Nov 2024
https://github.com/kirralabs/indonesian-NLP-resources
data resource untuk NLP bahasa indonesia
corpus corpus-linguistics crawler dataset dependency-parser indonesian indonesian-language named-entity-recognition nlp parallel-corpus pos-tagging sentiment-analysis
Last synced: 08 Nov 2024
https://github.com/fedml-ai/fednlp
FedNLP: An Industry and Research Integrated Platform for Federated Learning in Natural Language Processing, Backed by FedML, Inc. The Previous Research Version is Accepted to NAACL 2022
federated-learning machine-learning natural-language-processing nlp
Last synced: 08 Nov 2024
https://github.com/FedML-AI/FedNLP
FedNLP: An Industry and Research Integrated Platform for Federated Learning in Natural Language Processing, Backed by FedML, Inc. The Previous Research Version is Accepted to NAACL 2022
federated-learning machine-learning natural-language-processing nlp
Last synced: 11 Nov 2024
https://github.com/sunyilgdx/NSP-BERT
The code for our paper "NSP-BERT: A Prompt-based Zero-Shot Learner Through an Original Pre-training Task —— Next Sentence Prediction"
bert correference-resolution entity-linking entity-typing natural-language-inference nlp prompt-learning sentence-classification sentiment-analysis tensorflow text-classification zero-shot
Last synced: 16 Nov 2024
https://github.com/philipperemy/financial-news-dataset
Reuters and Bloomberg
bloomberg dataset nlp nlp-machine-learning reuters trading trading-strategies
Last synced: 22 Oct 2024
https://github.com/vzhong/embeddings
Fast, DB Backed pretrained word embeddings for natural language processing.
deep-learning neural-network nlp
Last synced: 13 Nov 2024
https://github.com/openfoodfacts/openfoodfacts-ai
This is a tracking repo for all our AI projects. 🍕 🤖🍼
artificial-intelligence computer-vision deep-learning food machine-learning neural-network nlp nutrition packaging photogrammetry prediction
Last synced: 11 Nov 2024
https://github.com/maxent-ai/ocrpy
OCR, Archive, Index and Search: Implementation agnostic OCR framework.
aws azure computer-vision cv deep-learning google-vision-api image-processing information-retrieval nlp ocr ocr-python python semantic-search tesseract-ocr transformers
Last synced: 14 Nov 2024
https://github.com/natasha/slovnet
Deep Learning based NLP modeling for Russian language
bert deep-learning machine-learning morphology ner nlp python pytorch russian syntax
Last synced: 11 Oct 2024
https://github.com/otuncelli/turkish-stemmer-python
:snake: Turkish Language Stemmer for Python
language natural-language-processing nlp stemming-algorithm turkish-language
Last synced: 12 Nov 2024
https://github.com/mindflowai/mindflow
🧠 AI-powered CLI git wrapper, boilerplate code generator, chat history manager, and code search engine to streamline your dev workflow 🌊
chat-gpt cli code-generation command-line-interface dev-tools git git-wrapper information-retrieval large-language-models llm machine-learning modern-dev-tools nlp openai openai-api python search search-engine
Last synced: 29 Oct 2024
https://github.com/openvenues/node-postal
NodeJS bindings to libpostal for fast international address parsing/normalization
address address-parser binding international native nlp
Last synced: 16 Nov 2024
https://github.com/soskek/bert-chainer
Chainer implementation of "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"
bert chainer google natural-language-processi natural-language-understanding nlp transformer
Last synced: 16 Nov 2024
https://github.com/jieyuz2/wrench
[NeurIPS 2021] WRENCH: Weak supeRvision bENCHmark
benchmark-framework data-centric-ai data-programming dataset deep-learning machine-learning nlp robust-learning sequence-labeling weak-supervision weakly-supervised-learning
Last synced: 13 Nov 2024
https://github.com/mmxgn/spacy-clausie
Implementation of the ClausIE information extraction system for python+spacy
clausie information-extraction nlp problog python-spacy spacy
Last synced: 30 Sep 2024
https://github.com/IngestAI/embedditor
⚡ GUI for editing LLM vector embeddings. No more blind chunking. Upload content in any file extension, join and split chunks, edit metadata and embedding tokens + remove stop-words and punctuation with one click, add images, and download in .veml to share it with your team.
datapreprocessing datascience embedding-vectors embeddings genai laravel llm markup-language ml nlp nltk php vector-database vector-search vectorization veml
Last synced: 31 Oct 2024
https://github.com/JieyuZ2/wrench
[NeurIPS 2021] WRENCH: Weak supeRvision bENCHmark
benchmark-framework data-centric-ai data-programming dataset deep-learning machine-learning nlp robust-learning sequence-labeling weak-supervision weakly-supervised-learning
Last synced: 03 Oct 2024
https://github.com/naver/claf
CLaF: Open-Source Clova Language Framework
clova framework language natural-language-processing nlp pytorch
Last synced: 15 Nov 2024
https://github.com/himkt/konoha
🌿 An easy-to-use Japanese Text Processing tool, which makes it possible to switch tokenizers with small changes of code.
janome japanese kytea mecab natural-language-processing nlp sentencepiece sudachi text-processing
Last synced: 13 Nov 2024
https://github.com/cohere-ai/sandbox-topically
Topic modeling helpers using managed language models from Cohere. Name text clusters using large GPT models.
machine-learning nlp python topic-modeling
Last synced: 07 Oct 2024
https://github.com/icoxfog417/graph-convolution-nlp
Graph Convolution Network for NLP
graph-convolutional-networks machine-learning natural-language-processing nlp
Last synced: 05 Nov 2024
https://github.com/bnosac/udpipe
R package for Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing Based on the UDPipe Natural Language Processing Toolkit
conll dependency-parser lemmatization natural-language-processing nlp pos-tagging r r-package r-pkg rcpp text-mining tokenizer udpipe
Last synced: 11 Nov 2024
https://github.com/jaidevd/numerizer
A Python module to convert natural language numerics into ints and floats.
information-extraction nlp regular-expressions spacy spacy-extension
Last synced: 14 Oct 2024
https://github.com/brutalcoding/aub.ai
AubAI brings you on-device gen-AI capabilities, including offline text generation and more, directly within your app.
android dart flutter gemini gemini-nano gen-ai genai indiedev ios ipados linux llamacpp localllama macos mistral-7b native-apps nlp on-device on-device-ai pubdev
Last synced: 10 Oct 2024
https://github.com/ticki/eudex
A blazingly fast phonetic reduction/hashing algorithm.
Last synced: 02 Nov 2024
https://github.com/akoksal/Turkish-Word2Vec
Pre-trained Word2Vec Model for Turkish
Last synced: 12 Nov 2024
https://github.com/vipul-sharma20/sharingan
Tool to extract news articles from newspaper and give the context about the news
context-extraction news-extraction nlp opencv
Last synced: 10 Nov 2024
https://github.com/scicloj/scicloj.ml
A Clojure machine learning library
classification clojure clustering data-pipeline data-science experiment-tracking hyperparameter-optimization machine-learning nlp regression scicloj
Last synced: 15 Nov 2024
https://github.com/Fixy-TR/fixy
Amacımız Türkçe NLP literatüründeki birçok farklı sorunu bir arada çözebilen, eşsiz yaklaşımlar öne süren ve literatürdeki çalışmaların eksiklerini gideren open source bir yazım destekleyicisi/denetleyicisi oluşturmak. Kullanıcıların yazdıkları metinlerdeki yazım yanlışlarını derin öğrenme yaklaşımıyla çözüp aynı zamanda metinlerde anlamsal analizi de gerçekleştirerek bu bağlamda ortaya çıkan yanlışları da fark edip düzeltebilmek.
acikhack2 ai artificial-intelligence bert data-science deep-learning deeplearning keras natural-language-processing neural-network neural-networks nlp python
Last synced: 12 Nov 2024
https://github.com/janlukasschroeder/nlp-cheat-sheet-python
NLP Cheat Sheet, Python, spacy, LexNPL, NLTK, tokenization, stemming, sentence detection, named entity recognition
cheat-sheet dependency-parsing introduction lemmatization lexnlp machine-learning named-entity-recognition nlp nltk pos-tagging python sentence-similarity spacy spacy-nlp spans starter-kit tokenization
Last synced: 14 Oct 2024
https://github.com/davidberenstein1957/classy-classification
This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-shot classification with Huggingface.
few-shot-classifcation hacktoberfest machine-learning natural-language-processing nlp nlu sentence-transformers spacy text-classification
Last synced: 14 Oct 2024
https://github.com/erfanzar/EasyDeL
Accelerate, Optimize performance with streamlined training and serving options with JAX.
easydel flax gpt jax machine-learning mojo nlp optax transformers
Last synced: 16 Nov 2024
https://github.com/ayaka14732/llama-2-jax
JAX implementation of the Llama 2 model
jax llama llama2 natural-language-processing nlp
Last synced: 17 Nov 2024
https://github.com/nisaaragharia/advanced_rag
Advanced Retrieval-Augmented Generation (RAG) through practical notebooks, using the power of the Langchain, OpenAI GPTs ,META LLAMA3 ,Agents.
agent agents ai chatgpt genai langchain llama3 llm machine-learning nlp openai rag retrival-augmented vectordb
Last synced: 10 Oct 2024
https://github.com/cadmiumcr/cadmium
Natural Language Processing (NLP) library for Crystal
crystal crystal-lang crystal-language inflector nlp phonetics readability sentiment-analysis shards stemmer string-distance tf-idf transliterator tries wordnet
Last synced: 15 Nov 2024
https://github.com/adamlui/chatgpt-infinity
∞ Generate endless answers from all-knowing ChatGPT (in any language!)
ai artificial-intelligence chatbot chatgpt chrome-extension experimental gpt-3 greasemonkey machine-learning nlp openai trivia userscripts
Last synced: 12 Oct 2024
https://github.com/coteries/cedille-ai
✒️ Cedille is a large French language model (6B), released under an open-source license
Last synced: 04 Nov 2024
https://github.com/neuml/rag
🚀 Retrieval Augmented Generation (RAG) with txtai. Combine search and LLMs to find insights with your own data.
large-language-models llm machine-learning nlp python rag retrieval-augmented-generation search txtai
Last synced: 20 Oct 2024
https://github.com/erfanzar/easydel
Accelerate, Optimize performance with streamlined training and serving options with JAX.
easydel flax gpt jax machine-learning mojo nlp optax transformers
Last synced: 14 Nov 2024
https://github.com/labteral/ernie
Simple State-of-the-Art BERT-Based Sentence Classification with Keras / TensorFlow 2. Built with HuggingFace's Transformers.
albert bert bert-as-service bert-embeddings bert-model bert-models distilbert huggingface huggingface-transformer keras natural-language-processing nlp roberta sentence-classification tensorflow tensorflow2 transformer-architecture transformer-tensorflow2 transformers
Last synced: 07 Aug 2024
https://github.com/tlkh/text-emotion-classification
Archived - not answering issues
deep-neural-networks keras nlp sentiment-classification
Last synced: 15 Nov 2024
https://github.com/sudharsan13296/getting-started-with-google-bert
Build and train state-of-the-art natural language processing models using BERT
albert bart bert bertsum clinical-bert distilbert electra huggingface-transformers nlp pytorch roberta sbert sentence-bert spanbert tinybert transformer videobert
Last synced: 15 Nov 2024
https://github.com/explosion/displacy-ent
:boom: displaCy-ent.js: An open-source named entity visualiser for the modern web
css javascript named-entities natural-language-processing nlp spacy visualization
Last synced: 25 Sep 2024
https://github.com/sea-snell/implicit-language-q-learning
Official code from the paper "Offline RL for Natural Language Generation with Implicit Language Q Learning"
implicit-q-learning iql language-model nlp offline-rl python pytorch q-learning reinforcement-learning
Last synced: 18 Nov 2024
https://github.com/kavgan/rouge-2.0
ROUGE automatic summarization evaluation toolkit. Support for ROUGE-[N, L, S, SU], stemming and stopwords in different languages, unicode text evaluation, CSV output.
evaluation evaluation-toolkit java metrics nlp rouge rouge-l rouge-n rouge-s rouge-su text-summarization unicode-text
Last synced: 30 Oct 2024
https://github.com/dkpro/dkpro-core
Collection of software components for natural language processing (NLP) based on the Apache UIMA framework.
dkpro java natural-language-processing nlp uima uima-components
Last synced: 13 Nov 2024
https://github.com/MaartenGr/Concept
Concept Modeling: Topic Modeling on Images and Text
computer-vision image-processing nlp topic-modeling
Last synced: 05 Nov 2024
https://github.com/fanhuaandluomu/parselawdocuments
对收集的法律文档进行一系列分析,包括根据规范自动切分、案件相似度计算、案件聚类、法律条文推荐等(试验目前基于婚姻类案件,可扩展至其它领域)。
Last synced: 12 Nov 2024
https://github.com/iPieter/RobBERT
A Dutch RoBERTa-based language model
bert bert-model language-model nlp nlp-resources roberta transformers
Last synced: 17 Nov 2024
https://github.com/vishwasg217/fin-sight
FinSight - Financial Insights at Your Fingertip: FinSight is a cutting-edge AI assistant tailored for portfolio managers, investors, and finance enthusiasts. It streamlines the process of gaining crucial insights and summaries about a company in a user-friendly manner.
fintech langchain llama-index llms nlp streamlit
Last synced: 10 Oct 2024
https://github.com/stanford-oval/genie-toolkit
The Genie open source kit for voice assistant (formerly known as Almond)
hacktoberfest natural-language nlp semantic-parsers voice-assistant
Last synced: 13 Nov 2024
https://github.com/textvec/textvec
Text vectorization tool to outperform TFIDF for classification tasks
machine-learning natural-language-processing nlp python text-analysis text-classification text-processing tf-idf
Last synced: 29 Oct 2024
https://github.com/maartengr/concept
Concept Modeling: Topic Modeling on Images and Text
computer-vision image-processing nlp topic-modeling
Last synced: 16 Nov 2024
https://github.com/rizerphe/obsidian-companion
Autocomplete your obsidian notes with AI, including ChatGPT, through a copilot-like interface.
ai ai21labs chatgpt groq groq-ai large-language-models llm llm-local nlp obsidian-md obsidian-plugin ollama oobabooga openai
Last synced: 10 Oct 2024
https://github.com/WZBSocialScienceCenter/tmtoolkit
Text Mining and Topic Modeling Toolkit for Python with parallel processing power
evaluation nlp parallel-processing python socialscience text-processing topic-modeling
Last synced: 13 Nov 2024
https://github.com/houbb/word-checker
🇨🇳🇬🇧Chinese and English word spelling corrector.(中文易错别字检测,中文拼写检测纠正。英文单词拼写校验工具)
cc csc english-word java nlp spelling spelling-correction word
Last synced: 07 Nov 2024
https://github.com/milaan9/python_natural_language_processing
This repository consists of a complete guide on natural language processing (NLP) in Python where we'll learn various techniques for implementing NLP including parsing & text processing and understand how to use NLP for text feature engineering.
bag-of-words inversedocumentfrequency ipython-notebook lemmatization named-entity-recognition nlp partofspeech-tagger python4datascience python4everybody sentence-segmentation stemming stopwords termfrequency tf-idf tokenization tutor-milaan9 vocabulary-matching
Last synced: 11 Oct 2024
https://github.com/yanndubs/hash-embeddings
PyTorch implementation of Hash Embeddings (NIPS 2017). Submission to the NIPS Implementation Challenge.
embeddings hashing nips nips-challenge nlp pytorch reproducible-research word-embeddings
Last synced: 27 Oct 2024
https://github.com/yaroslavyaroslav/openai-sublime-text
First class Sublime Text AI assistant with GPT-o1 and ollama support!
chatgpt gpt-4 nlp openai sublime-text
Last synced: 11 Nov 2024
https://github.com/dair-ai/dair-ai.github.io
Home of DAIR.AI
ai data-science education machine-learning nlp
Last synced: 10 Nov 2024
https://github.com/explosion/jupyterlab-prodigy
🧬 A JupyterLab extension for annotating data with Prodigy
active-learning annotation annotation-tool artificial-intelligence computer-vision data-annotation data-science jupyter jupyterlab labeling-tool machine-learning machine-teaching natural-language-processing nlp prodigy spacy
Last synced: 07 Oct 2024
https://github.com/intelligo-mn/neuro
🔮 Neuro.js is machine learning library for building AI assistants and chat-bots.
ai ai-assistants bot chat-bot chat-bots chatbot machine-learning natural-language-processing nlp nodejs
Last synced: 12 Nov 2024
https://github.com/guotong1988/NL2SQL-RULE
Content Enhanced BERT-based Text-to-SQL Generation https://arxiv.org/abs/1910.07179
bert deep-learning knowledge knowledge-representation nl2sql nlp pytorch rule-inject-to-model semantic-parsing text2sql
Last synced: 11 Nov 2024
https://github.com/shreyansh26/annotated-ml-papers
Annotations of the interesting ML papers I read
annotated-paper bert deep-learning gpt gpt-2 machine-learning megatron-lm nlp papers-annotations research-paper transformers xlnet
Last synced: 14 Nov 2024
https://github.com/soumyadip007/microsoft-student-partner-workshop-learning-materials-ai-nlp
This repository contains all codes and materials of the current session. It contains the required code on Natural Language Processing, Artificial intelligence.
ai cloud distributed-networking microsoft nlp peer-to-peer workshop
Last synced: 17 Nov 2024
https://github.com/dair-ai/emotion_dataset
:smile: Dataset for Emotion Recognition Research
dataset machine-learning nlp pytorch
Last synced: 17 Nov 2024
https://github.com/ines/spacy-js
🎀 JavaScript API for spaCy with Python REST API
javascript natural-language-processing nlp python rest-api spacy
Last synced: 13 Nov 2024
https://github.com/pszemraj/vid2cleantxt
Python API & command-line tool to easily transcribe speech-based video files into clean text
audio audio-processing keyword keyword-extraction nlp python sentence sentence-boundary-detection speech speech-recognition speech-to-text spelling-correction transcription transformer video video-processing video-summarisation video-summarization wav2vec2 whisper
Last synced: 14 Nov 2024
https://github.com/sajjjadayobi/PersianQA
Persian (Farsi) Question Answering Dataset (+ Models)
dataset farsi natural-language-processing nlp persian-language persian-nlp question-answering reading-comprehension squad
Last synced: 04 Aug 2024
https://github.com/franck-dernoncourt/pubmed-rct
PubMed 200k RCT dataset: a large dataset for sequential sentence classification.
corpus machine-learning medical nlp randomized-controlled-trials sentence-classification
Last synced: 14 Oct 2024
https://github.com/d5555/tageditor
🏖TagEditor - Annotation tool for spaCy
annotation annotation-tool coreference-resolution data-science labeling-tool machine-learning named-entities named-entity-recognition natural-language-processing neural-networks neuralcoref nlp spacy spacy-visualizer tagging-tool text-annotation text-tagging training-data
Last synced: 14 Oct 2024
https://github.com/ropensci/tokenizers
Fast, Consistent Tokenization of Natural Language Text
nlp peer-reviewed r r-package rstats text-mining tokenizer
Last synced: 05 Aug 2024
https://github.com/ShawnyXiao/2017-CCF-BDCI-AIJudge
2017-CCF-BDCI-让AI当法官(初赛):7th/415 (Top 1.68%)
2017 bdci ccf data-mining multiclass-classification nlp
Last synced: 01 Nov 2024
https://github.com/obss/jury
Comprehensive NLP Evaluation System
datasets evaluate evaluation huggingface machine-learning metrics natural-language-processing nlp nlp-evaluation python pytorch transformers
Last synced: 12 Nov 2024