Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists tagged with natural-language-processing

A curated list of projects in awesome lists tagged with natural-language-processing .

https://github.com/NiuTrans/ABigSurvey

A collection of 1000+ survey papers on Natural Language Processing (NLP) and Machine Learning (ML).

deep-learning machine-learning natural-language-processing neural-networks paper-list surveys

Last synced: 30 Jul 2024

https://github.com/delip/PyTorchNLPBook

Code and data accompanying Natural Language Processing with PyTorch published by O'Reilly Media https://amzn.to/3JUgR2L

deep-learning deep-neural-networks natural-language-processing neural-machine-translation neural-networks nlp pytorch pytorch-nlp pytorch-tutorial

Last synced: 03 Sep 2024

https://github.com/THUDM/P-tuning-v2

An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks

natural-language-processing p-tuning parameter-efficient-learning pretrained-language-model prompt-tuning

Last synced: 30 Jul 2024

https://github.com/philipperemy/tensorflow-1.4-billion-password-analysis

Deep Learning model to analyze a large corpus of clear text passwords.

deep-learning natural-language-processing tensorflow

Last synced: 30 Sep 2024

https://github.com/jiesutd/NCRFpp

NCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence labeling tasks (e.g. NER, POS, Segmentation). It includes character LSTM/CNN, word LSTM/CNN and softmax/CRF components.

artificial-intelligence char-cnn char-rnn chunking cnn crf lstm lstm-crf named-entity-recognition natural-language-processing nbest ner neural-networks part-of-speech-tagger pytorch sequence-labeling

Last synced: 01 Aug 2024

https://github.com/graph4ai/graph4nlp

Graph4nlp is the library for the easy use of Graph Neural Networks for NLP. Welcome to visit our DLG4NLP website (https://dlg4nlp.github.io/index.html) for various learning resources!

deep-learning graph-neural-networks machine-learning natural-language-processing nlp pytorch

Last synced: 30 Sep 2024

https://github.com/dipanjans/text-analytics-with-python

Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text Analytics with Python" published by Apress/Springer.

clustering gensim natural-language natural-language-processing nltk pattern python scikit-learn semantic sentiment sentiment-analysis spacy stanford-nlp text-analytics text-classification text-summarization

Last synced: 26 Sep 2024

https://github.com/lightning-universe/lightning-bolts

Toolbox of models, callbacks, and datasets for AI/ML researchers.

ai gan image-processing machine-learning natural-language-processing pytorch supervised-learning

Last synced: 02 Oct 2024

https://github.com/Lightning-Universe/lightning-bolts

Toolbox of models, callbacks, and datasets for AI/ML researchers.

ai gan image-processing machine-learning natural-language-processing pytorch supervised-learning

Last synced: 01 Aug 2024

https://github.com/els-rd/transformer-deploy

Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀

deep-learning deployment inference machine-learning natural-language-processing server

Last synced: 26 Sep 2024

https://github.com/ymcui/Chinese-XLNet

Pre-Trained Chinese XLNet(中文XLNet预训练模型)

natural-language-processing nlp pytorch tensorflow xlnet

Last synced: 31 Jul 2024

https://github.com/dipanjanS/text-analytics-with-python

Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text Analytics with Python" published by Apress/Springer.

clustering gensim natural-language natural-language-processing nltk pattern python scikit-learn semantic sentiment sentiment-analysis spacy stanford-nlp text-analytics text-classification text-summarization

Last synced: 02 Aug 2024

https://github.com/ELS-RD/transformer-deploy

Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀

deep-learning deployment inference machine-learning natural-language-processing server

Last synced: 01 Aug 2024

https://github.com/salesforce/wikisql

A large annotated semantic parsing corpus for developing natural language interfaces.

database dataset machine-learning natural-language natural-language-interface natural-language-processing

Last synced: 30 Sep 2024

https://github.com/explosion/spacy-models

💫 Models for the spaCy Natural Language Processing (NLP) library

machine-learning machine-learning-models models natural-language-processing nlp spacy spacy-models statistical-models

Last synced: 30 Sep 2024

https://github.com/google-research/language

Shared repository for open-sourced projects from the Google AI Language team.

machine-learning natural-language-processing research

Last synced: 30 Sep 2024

https://github.com/salesforce/WikiSQL

A large annotated semantic parsing corpus for developing natural language interfaces.

database dataset machine-learning natural-language natural-language-interface natural-language-processing

Last synced: 03 Aug 2024

https://github.com/lyuchenyang/macaw-llm

Macaw-LLM: Multi-Modal Language Modeling with Image, Video, Audio, and Text Integration

deep-learning language-model machine-learning multi-modal-learning natural-language-processing neural-networks

Last synced: 30 Sep 2024

https://github.com/datamade/usaddress

:us: a python library for parsing unstructured United States address strings into address components

address address-parser conditional-random-fields crf machine-learning natural-language-processing nlp parserator python python-library

Last synced: 30 Sep 2024

https://github.com/bfelbo/deepmoji

State-of-the-art deep learning model for analyzing sentiment, emotion, sarcasm etc.

ai deep-learning keras machine-learning natural-language-processing neural-networks nlp python sentiment-analysis tensorflow text-classification

Last synced: 26 Sep 2024

https://github.com/bfelbo/DeepMoji

State-of-the-art deep learning model for analyzing sentiment, emotion, sarcasm etc.

ai deep-learning keras machine-learning natural-language-processing neural-networks nlp python sentiment-analysis tensorflow text-classification

Last synced: 02 Aug 2024

https://github.com/thunlp/taadpapers

Must-read Papers on Textual Adversarial Attack and Defense

adversarial-attacks adversarial-defense adversarial-learning natural-language-processing nlp paper-list

Last synced: 30 Sep 2024

https://github.com/juand-r/entity-recognition-datasets

A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.

annotations corpora datasets entity-extraction entity-recognition named-entity-recognition natural-language-processing ner nlp nlp-resources

Last synced: 30 Sep 2024

https://github.com/dair-ai/Transformers-Recipe

🧠 A study guide to learn about Transformers

ai deep-learning machine-learning natural-language-processing nlp

Last synced: 01 Aug 2024

https://github.com/Hironsan/anago

Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition, Part-of-Speech Tagging and so on.

deep-learning keras machine-learning named-entity-recognition natural-language-processing sequence-labeling

Last synced: 01 Aug 2024

https://github.com/hironsan/anago

Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition, Part-of-Speech Tagging and so on.

deep-learning keras machine-learning named-entity-recognition natural-language-processing sequence-labeling

Last synced: 26 Sep 2024

https://github.com/thunlp/TAADpapers

Must-read Papers on Textual Adversarial Attack and Defense

adversarial-attacks adversarial-defense adversarial-learning natural-language-processing nlp paper-list

Last synced: 31 Jul 2024

https://github.com/OpenNMT/OpenNMT-tf

Neural machine translation and sequence learning using TensorFlow

deep-learning machine-translation natural-language-processing neural-machine-translation opennmt python tensorflow

Last synced: 01 Aug 2024

https://github.com/opennmt/opennmt-tf

Neural machine translation and sequence learning using TensorFlow

deep-learning machine-translation natural-language-processing neural-machine-translation opennmt python tensorflow

Last synced: 01 Oct 2024

https://github.com/lyuchenyang/Macaw-LLM

Macaw-LLM: Multi-Modal Language Modeling with Image, Video, Audio, and Text Integration

deep-learning language-model machine-learning multi-modal-learning natural-language-processing neural-networks

Last synced: 01 Aug 2024

https://github.com/Tiiiger/bert_score

BERT score for text generation

machine-learning natural-language-processing

Last synced: 01 Aug 2024

https://github.com/yoshitomo-matsubara/torchdistill

A coding-free framework built on PyTorch for reproducible deep learning studies. 🏆25 knowledge distillation methods presented at CVPR, ICLR, ECCV, NeurIPS, ICCV, etc are implemented so far. 🎁 Trained models, training logs and configurations are available for ensuring the reproducibiliy and benchmark.

amazon-sagemaker-lab cifar10 cifar100 coco colab-notebook glue google-colab image-classification imagenet knowledge-distillation natural-language-processing nlp object-detection pascal-voc pytorch semantic-segmentation transformer

Last synced: 01 Oct 2024

https://github.com/fukuball/jieba-php

"結巴"中文分詞:做最好的 PHP 中文分詞、中文斷詞組件。 / "Jieba" (Chinese for "to stutter") Chinese text segmentation: built to be the best PHP Chinese word segmentation module.

chinese-text-segmentation machine-learning natural-language-processing nlp

Last synced: 26 Sep 2024

https://github.com/practical-nlp/practical-nlp-code

Official Repository for Code associated with 'Practical Natural Language Processing' book by O'Reilly Media

natural-language-processing natural-language-understanding oreilly-books

Last synced: 30 Sep 2024

https://github.com/explosion/projects

🪐 End-to-end NLP workflows from prototype to production

annotations datasets natural-language-processing nlp prodigy spacy

Last synced: 30 Sep 2024

https://github.com/hyperonym/basaran

Basaran is an open-source alternative to the OpenAI text completion API. It provides a compatible streaming API for your Hugging Face Transformers-based text generation models.

generative gpt huggingface language-model llama llm model natural-language-processing nlp openai-api python text-generation transformers

Last synced: 27 Sep 2024

https://github.com/cdpierse/transformers-interpret

Model explainability that works seamlessly with 🤗 transformers. Explain your transformers model in just 2 lines of code.

captum computer-vision deep-learning explainable-ai interpretability machine-learning model-explainability natural-language-processing neural-network nlp transformers transformers-model

Last synced: 01 Oct 2024

https://github.com/kakaobrain/pororo

PORORO: Platform Of neuRal mOdels for natuRal language prOcessing

automatic-speech-recognition deep-learning natural-language-processing neural-models speech-synthesis

Last synced: 03 Aug 2024

https://github.com/natescarlet/holiday-cn

📅🇨🇳中国法定节假日数据 自动每日抓取国务院公告

china crawling data holiday natural-language-processing

Last synced: 30 Sep 2024

https://github.com/farama-foundation/chatarena

ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.

ai artificial-intelligence chatgpt gpt-4 large-language-models multi-agent multi-agent-reinforcement-learning multi-agent-simulation natural-language-processing python

Last synced: 26 Sep 2024

https://github.com/Farama-Foundation/chatarena

ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.

ai artificial-intelligence chatgpt gpt-4 large-language-models multi-agent multi-agent-reinforcement-learning multi-agent-simulation natural-language-processing python

Last synced: 01 Aug 2024

https://github.com/huggingface/hmtl

🌊HMTL: Hierarchical Multi-Task Learning - A State-of-the-Art neural network model for several NLP tasks based on PyTorch and AllenNLP

multi-task-learning natural-language-processing nlp pytorch

Last synced: 26 Sep 2024

https://github.com/juliasilge/tidytext

Text mining using tidy tools :sparkles::page_facing_up::sparkles:

natural-language-processing r text-mining tidy-data tidyverse

Last synced: 01 Oct 2024

https://github.com/bheinzerling/bpemb

Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)

embeddings multilingual natural-language-processing nlp subword-embeddings

Last synced: 30 Sep 2024

https://github.com/hse-aml/natural-language-processing

Resources for "Natural Language Processing" Coursera course.

natural-language-processing

Last synced: 30 Sep 2024

https://github.com/NateScarlet/holiday-cn

📅🇨🇳中国法定节假日数据 自动每日抓取国务院公告

china crawling data holiday natural-language-processing

Last synced: 31 Jul 2024

https://github.com/google/budou

Budou is an automatic organizer tool for beautiful line breaking in CJK (Chinese, Japanese, and Korean).

cjk natural-language-processing python web-development

Last synced: 30 Sep 2024

https://github.com/kavgan/nlp-in-practice

Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.

gensim machine-learning natural-language-processing nlp text-classification text-mining tf-idf word2vec

Last synced: 30 Sep 2024

https://github.com/uber-research/pplm

Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.

deep-learning language-modeling machine-learning natural-language-generation natural-language-processing nlp

Last synced: 30 Sep 2024

https://github.com/alibabaresearch/damo-convai

DAMO-ConvAI: The official repository which contains the codebase for Alibaba DAMO Conversational AI.

conversational-ai deep-learning dialog natural-language-processing

Last synced: 18 Aug 2024

https://github.com/uber-research/PPLM

Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.

deep-learning language-modeling machine-learning natural-language-generation natural-language-processing nlp

Last synced: 04 Aug 2024

https://github.com/pemistahl/lingua-py

The most accurate natural language detection library for Python, suitable for short text and mixed-language text

language-classification language-detection language-identification language-recognition natural-language-processing nlp python-library

Last synced: 26 Sep 2024

https://github.com/shujian2015/freeml

A List of Data Science/Machine Learning Resources (Mostly Free)

data-science deep-learning machine-learning natural-language-processing

Last synced: 30 Sep 2024

https://github.com/Shujian2015/FreeML

A List of Data Science/Machine Learning Resources (Mostly Free)

data-science deep-learning machine-learning natural-language-processing

Last synced: 02 Aug 2024

https://github.com/AlibabaResearch/DAMO-ConvAI

DAMO-ConvAI: The official repository which contains the codebase for Alibaba DAMO Conversational AI.

conversational-ai deep-learning dialog natural-language-processing

Last synced: 09 Aug 2024

https://github.com/rucaibox/textbox

TextBox 2.0 is a text generation library with pre-trained language models

deep-learning natural-language-generation natural-language-processing pretrained-models python pytorch seq2seq text-generation

Last synced: 30 Sep 2024

https://github.com/RUCAIBox/TextBox

TextBox 2.0 is a text generation library with pre-trained language models

deep-learning natural-language-generation natural-language-processing pretrained-models python pytorch seq2seq text-generation

Last synced: 03 Aug 2024