Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists tagged with nlp-library

A curated list of projects in awesome lists tagged with nlp-library .

https://github.com/fastnlp/fastnlp

fastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation.

chinese-nlp deep-learning natural-language-processing nlp-library nlp-parsing text-classification text-processing

Last synced: 20 Dec 2024

https://github.com/fastnlp/fastNLP

fastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation.

chinese-nlp deep-learning natural-language-processing nlp-library nlp-parsing text-classification text-processing

Last synced: 06 Nov 2024

https://github.com/fudannlp/fnlp

中文自然语言处理工具包 Toolkit for Chinese natural language processing

fnlp fudannlp java nlp-library

Last synced: 20 Dec 2024

https://github.com/FudanNLP/fnlp

中文自然语言处理工具包 Toolkit for Chinese natural language processing

fnlp fudannlp java nlp-library

Last synced: 27 Oct 2024

https://github.com/deepset-ai/farm

:house_with_garden: Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.

bert deep-learning germanbert language-models ner nlp nlp-framework nlp-library pretrained-models pytorch question-answering roberta transfer-learning xlnet-pytorch

Last synced: 19 Dec 2024

https://github.com/deepset-ai/FARM

:house_with_garden: Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.

bert deep-learning germanbert language-models ner nlp nlp-framework nlp-library pretrained-models pytorch question-answering roberta transfer-learning xlnet-pytorch

Last synced: 04 Nov 2024

https://github.com/chrismattmann/tika-python

Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.

buffer covid-19 detection extraction memex mime nlp nlp-library nlp-machine-learning parse parser-interface python recognition text-extraction text-recognition tika-python tika-server tika-server-jar translation-interface usc

Last synced: 17 Dec 2024

https://github.com/MilaNLProc/contextualized-topic-models

A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021.

bert embeddings multilingual-models multilingual-topic-models neural-topic-models nlp nlp-library nlp-machine-learning text-as-data topic-coherence topic-modeling transformer

Last synced: 04 Nov 2024

https://github.com/thunlp/opendelta

A plug-and-play library for parameter-efficient-tuning (Delta Tuning)

deep-learning nlp nlp-library parameter-efficient-learning pretrained-language-model

Last synced: 15 Dec 2024

https://github.com/thunlp/OpenDelta

A plug-and-play library for parameter-efficient-tuning (Delta Tuning)

deep-learning nlp nlp-library parameter-efficient-learning pretrained-language-model

Last synced: 16 Nov 2024

https://github.com/mocobeta/janome

Japanese morphological analysis engine written in pure Python

japanese-language nlp-library python

Last synced: 20 Dec 2024

https://github.com/ikawaha/kagome

Self-contained Japanese Morphological Analyzer written in pure Go

hacktoberfest japanese japanese-language korean morphological-analysis nlp-library pos-tagging segmentation tokenizer

Last synced: 26 Oct 2024

https://github.com/ailln/cn2an

📦 快速转化「中文数字」和「阿拉伯数字」~ (最新特性:分数,日期、温度等转化)

arabic-numbers arabic-numerals asr chinese-numerals cn2an nlp-library nlp-tool pypi python speech-recognition

Last synced: 16 Dec 2024

https://github.com/cbaziotis/ekphrasis

Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).

nlp nlp-library semeval spell-corrector spelling-correction text-processing text-segmentation tokenization tokenizer word-normalization word-segmentation

Last synced: 06 Nov 2024

https://github.com/wyounas/homer

Homer, a text analyser in Python, can help make your text more clear, simple and useful for your readers.

nlp nlp-library python python-library python-script python3 text-analysis

Last synced: 06 Nov 2024

https://github.com/medspacy/medspacy

Library for clinical NLP with spaCy.

clinical-nlp medspacy nlp nlp-library pipeline spacy

Last synced: 19 Dec 2024

https://github.com/fhamborg/Giveme5W1H

Extraction of the journalistic five W and one H questions (5W1H) from news articles: who did what, when, where, why, and how?

5w 5w1h answering event-detection event-extraction fivew fivewoneh news news-articles nlp nlp-library question question-answering text-analysis

Last synced: 28 Oct 2024

https://github.com/fhamborg/giveme5w1h

Extraction of the journalistic five W and one H questions (5W1H) from news articles: who did what, when, where, why, and how?

5w 5w1h answering event-detection event-extraction fivew fivewoneh news news-articles nlp nlp-library question question-answering text-analysis

Last synced: 15 Nov 2024

https://github.com/proycon/pynlpl

PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, and less common, NLP tasks. PyNLPl can be used for basic tasks such as the extraction of n-grams and frequency lists, and to build simple language model. There are also more complex data types and algorithms. Moreover, there are parsers for file formats common in NLP (e.g. FoLiA/Giza/Moses/ARPA/Timbl/CQL). There are also clients to interface with various NLP specific servers. PyNLPl most notably features a very extensive library for working with FoLiA XML (Format for Linguistic Annotation).

computational-linguistics evaluation-metrics folia language-modelling library linguistics machine-learning natural-language-processing nlp nlp-library python search-algorithms text-processing

Last synced: 15 Dec 2024

https://github.com/WorksApplications/SudachiPy

Python version of Sudachi, a Japanese tokenizer.

morphological-analysis nlp-library pos-tagging segmentation

Last synced: 06 Nov 2024

https://github.com/outcastofmusic/quick-nlp

Pytorch NLP library based on FastAI

fastai nlp-library pytorch seq2seq

Last synced: 14 Nov 2024

https://github.com/neomatrix369/nlp_profiler

A simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data, NLP Profiler will return either high-level insights or low-level/granular statistical information about the text in that column.

google-colab grammar-checks hacktoberfest jupyter kaggle-kernels natural-language-processing nlp nlp-keywords-extraction nlp-library nlp-machine-learning nlp-parsing nlp-profiler profiler profiling profiling-datasets text-mining

Last synced: 21 Dec 2024

https://github.com/BLLIP/bllip-parser

BLLIP reranking parser (also known as Charniak-Johnson parser, Charniak parser, Brown reranking parser) See http://pypi.python.org/pypi/bllipparser/ for Python module.

ai artificial-intelligence computational-linguistics machine-learning natural-language-processing nlp nlp-library parsing

Last synced: 30 Oct 2024

https://github.com/dccuchile/wefe

WEFE: The Word Embeddings Fairness Evaluation Framework. WEFE is a framework that standardizes the bias measurement and mitigation in Word Embeddings models. Please feel welcome to open an issue in case you have any questions or a pull request if you want to contribute to the project!

bias-detection bias-reduction fairness-ai fairness-ml library nlp nlp-library python3 word-embedding-evaluation word-embedding-fairness word-embeddings

Last synced: 22 Nov 2024

https://github.com/chewxy/lingo

package lingo provides the data structures and algorithms required for natural language processing

conll-u go golang inflection language-model natural-language-processing nlp nlp-dependency-parsing nlp-library nlp-machine-learning nlp-parsing part-of-speech part-of-speech-tagger

Last synced: 19 Dec 2024

https://github.com/emres/turkish-deasciifier

Turkish deasciifier in Python based on Deniz Yüret's turkish-mode for Emacs

deasciifier diacritics diacritics-reconstruction diacritics-restoration nlp nlp-library python turkish turkish-nlp

Last synced: 12 Nov 2024

https://github.com/nullnull/simstring

A Python implementation of the SimString, a simple and efficient algorithm for approximate string matching.

nlp nlp-library python

Last synced: 22 Nov 2024

https://github.com/doches/rwordnet

A pure Ruby interface to the WordNet database

nlp-library ruby wordnet wordnet-tags

Last synced: 16 Dec 2024

https://github.com/legacyai/tf-transformers

State of the art faster Transformer with Tensorflow 2.0 ( NLP, Computer Vision, Audio ).

bert gpt2 keras language-model natural-language-processing nlp nlp-library tensorflow tensorflow2 text-classification text-generation transformer

Last synced: 07 Nov 2024

https://github.com/ars-linguistica/mlconjug3

A Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese and Romanian (more soon) using Machine Learning techniques.

conjugation conjugator devops linguistics machine-learning nlp nlp-library nlp-machine-learning python3 test-driven-development

Last synced: 20 Dec 2024

https://github.com/wayfair-incubator/extra-model

Code to run the ExtRA algorithm for unsupervised topic/aspect extraction on English texts.

aspect-based-sentiment-analysis aspect-extraction machine-learning-algorithms nlp nlp-keywords-extraction nlp-library python python-library python3

Last synced: 07 Nov 2024

https://github.com/syzer/sentiment-analyser

ML that can extract german and english sentiment

english german nlp nlp-library node-js nodejs sentiment-analyser sentiment-analysis

Last synced: 28 Oct 2024

https://github.com/ispras/atr4s

Toolkit with state-of-the-art Automatic Terms Recognition methods in Scala

nlp-keywords-extraction nlp-library scala terminology-extraction

Last synced: 24 Nov 2024

https://github.com/proycon/python-ucto

This is a Python binding to the tokenizer Ucto. Tokenisation is one of the first step in almost any Natural Language Processing task, yet it is not always as trivial a task as it appears to be. This binding makes the power of the ucto tokeniser available to Python. Ucto itself is regular-expression based, extensible, and advanced tokeniser written in C++ (http://ilk.uvt.nl/ucto).

computational-linguistics folia nlp nlp-library python text-processing tokenizer

Last synced: 18 Dec 2024

https://github.com/chengchingwen/BytePairEncoding.jl

Julia implementation of Byte Pair Encoding for NLP

nlp nlp-library nlp-machine-learning word-segmentation

Last synced: 28 Oct 2024

https://github.com/chengchingwen/bytepairencoding.jl

Julia implementation of Byte Pair Encoding for NLP

nlp nlp-library nlp-machine-learning word-segmentation

Last synced: 03 Dec 2024

https://github.com/spacyturk/spacyturk

spaCyTurk - trained models & pipelines for Turkish

floret nlp nlp-library spacy turkish-nlp

Last synced: 12 Oct 2024

https://github.com/jakartaresearch/maleo

Wrapper library for text cleansing, preprocessing in NLP

indonesian-language machine-learning nlp nlp-library

Last synced: 15 Nov 2024

https://github.com/jpmanson/llm_templates

Instruction/chat prompts creation library for text generation LLMs. It supports local and Hugging Face models.

chatbot cohere gemma huggingface jinja2 library llama2 llama3 llm mistral nlp nlp-library phi3 template

Last synced: 03 Dec 2024

https://github.com/cdpierse/breame

Lightweight utility tools for the detection of multiple spellings, meanings, and language-specific terminology in British and American English

nlp nlp-library python search-engine spelling utility-library

Last synced: 15 Oct 2024

https://github.com/medspacy/sectionizer

A rule-based Python module for spitting documents into sections.

clinical-nlp medspacy nlp nlp-library pipeline spacy

Last synced: 11 Nov 2024

https://github.com/pythainlp/multiel

Multilingual Entity Linking model by BELA model

entity-linking nlp-library

Last synced: 15 Nov 2024

https://github.com/jweinst1/corplet

A binary-corpus system for word tagging

corpus-linguistics database linguistics nlp nlp-library

Last synced: 08 Nov 2024

https://github.com/f1uctus/ttc

✍ 🗣 A Text-To-Conversation natural language processing toolkit [WIP].

conversation nlp nlp-apis nlp-library spacy spacy-extension spacy-nlp spacy-pipeline speaker-identification

Last synced: 15 Nov 2024

https://github.com/made2591/cognitive-system-postagger

A pos-tagging library with Viterbi, CYK and SVO -> XSV translator made as part of my final exam for the Cognitive System course in Department of Computer Science.

cky cognitive-services cognitive-systems computer-science corpora cyk department lemmatizer nlp nlp-library nlp-parsing nlp-stemming nltk nltk-grammar nlu postagger postagging sentence stemmer viterbi

Last synced: 13 Nov 2024

https://github.com/ahammadmejbah/nlp-interview-questions

Natural Language Processing (NLP) is a field of artificial intelligence that focuses on enabling computers to understand, interpret, and generate human language. It involves tasks like text analysis, sentiment analysis, machine translation, and chatbots to facilitate human-computer communication and automate language-related tasks.

interview nlp-keywords-extraction nlp-library nlp-machine-learning python3

Last synced: 11 Nov 2024

https://github.com/ged/ruby-framenet

Git mirror of a Ruby library for FrameNet

english-grammar framework nlg nlp nlp-library ruby

Last synced: 18 Dec 2024

https://github.com/debjyotisaha/nlp-coursework

NLP Coursework repository, commits uploaded by the contributors mentioned in the repo

chatbot json neural-network nlp nlp-library python3 pytorch

Last synced: 17 Dec 2024

https://github.com/nishant2018/sentiment-analysis-amazon

Sentiment analysis, also known as opinion mining, is the process of determining the sentiment or emotion expressed in a piece of text.

machine-learning machine-learning-algorithms nlp nlp-library nlp-machine-learning sentiment-analysis

Last synced: 11 Nov 2024

https://github.com/nishant2018/knowledge-graph-nlp-tutorial--bert-spacy-nltk-

Natural Language Processing (NLP) is a field of artificial intelligence that focuses on the interaction between computers and humans through natural language.

nlp nlp-library nlp-machine-learning

Last synced: 11 Nov 2024

https://github.com/medspacy/nlp_preprocessor

SpaCy component for modifying the string of a doc before tokenizing.

clinical-nlp medspacy nlp nlp-library pipeline spacy

Last synced: 11 Nov 2024

https://github.com/izhx/nmnlp

自用NLP实验脚手架

nlp nlp-library research

Last synced: 12 Nov 2024

https://github.com/nishant2018/nlp-basic-

Natural Language Processing (NLP) is a field of artificial intelligence that focuses on the interaction between computers and humans through natural language.

nlp nlp-library nlp-machine-learning tf-idf word2vec

Last synced: 11 Nov 2024

https://github.com/nishant2018/disneyland-reviews-nlp-sentiment-analysis-

Sentiment analysis, also known as opinion mining, is a process that involves analyzing text to determine the sentiment expressed, such as positive, negative, or neutral.

nlp nlp-library nlp-machine-learning sentiment-analysis

Last synced: 11 Nov 2024

https://github.com/nishant2018/emotion-detection-nlp-ml-acc-95-

Emotion detection involves identifying and classifying emotions expressed in textual data. It combines techniques from Natural Language Processing (NLP) and Machine Learning (ML) to analyze and interpret human emotions, which can be applied in various domains like customer service, social media analysis, and mental health monitoring.

emotion-detection machine-learning nlp nlp-library nlp-machine-learning

Last synced: 11 Nov 2024

https://github.com/vigneshss-07/complete-atoz-nlprojects

This Repo contains NLP based projects like sentiment analysis, space, nltk libraries.

nlp nlp-library nlp-machine-learning python3

Last synced: 15 Nov 2024

https://github.com/nishant2018/nlp-student-writing-eda

Natural Language Processing (NLP) is a branch of artificial intelligence that focuses on the interaction between computers and humans using natural language.

nlp nlp-deep-learning nlp-library nlp-machine-learning

Last synced: 11 Nov 2024