An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with fasttext

A curated list of projects in awesome lists tagged with fasttext .

https://github.com/duoergun0729/nlp

兜哥出品 <一本开源的NLP入门书籍>

ai fasttext nlp security word2vec

Last synced: 13 Apr 2025

https://github.com/kyubyong/wordvectors

Pre-trained word vectors of 30+ languages

fasttext language vector word2vec

Last synced: 08 Apr 2025

https://github.com/Kyubyong/wordvectors

Pre-trained word vectors of 30+ languages

fasttext language vector word2vec

Last synced: 20 Apr 2025

https://github.com/yongzhuo/keras-textclassification

中文长文本分类、短句子分类、多标签分类、两句子相似度(Chinese Text Classification of Keras NLP, multi-label classify, or sentence classify, long or short),字词句向量嵌入层(embeddings)和网络层(graph)构建基类,FastText,TextCNN,CharCNN,TextRNN, RCNN, DCNN, DPCNN, VDCNN, CRNN, Bert, Xlnet, Albert, Attention, DeepMoji, HAN, 胶囊网络-CapsuleNet, Transformer-encode, Seq2seq, SWEM, LEAM, TextGCN

albert bert capsule charcnn crnn dcnn dpcnn embeddings fasttext han keras keras-textclassification leam nlp rcnn text-classification textcnn transformer vdcnn xlnet

Last synced: 15 May 2025

https://github.com/chenyuntc/PyTorchText

1st Place Solution for Zhihu Machine Learning Challenge . Implementation of various text-classification models.(知乎看山杯第一名解决方案)

fasttext lstm nlp pytorch textcnn textrcnn textrnn

Last synced: 28 Mar 2025

https://github.com/chenyuntc/pytorchtext

1st Place Solution for Zhihu Machine Learning Challenge . Implementation of various text-classification models.(知乎看山杯第一名解决方案)

fasttext lstm nlp pytorch textcnn textrcnn textrnn

Last synced: 12 Apr 2025

https://github.com/jimichan/mynlp

一个生产级、高性能、模块化、可扩展的中文NLP工具包。(中文分词、平均感知机、fastText、拼音、新词发现、分词纠错、BM25、人名识别、命名实体、自定义词典)

fasttext nlp pinyin segment starspace

Last synced: 04 May 2025

https://github.com/ncbi-nlp/BioSentVec

BioWordVec & BioSentVec: pre-trained embeddings for biomedical words and sentences

bionlp fasttext mimic-iii natural-language-processing pubmed sent2vec sentence-embeddings sentence-similarity word-embeddings

Last synced: 16 Nov 2025

https://github.com/ThoughtRiver/lmdb-embeddings

Fast word vectors with little memory usage in Python

embeddings fasttext gensim glove lmdb magnitude memory speed text vectors word word2vec

Last synced: 19 Jul 2025

https://github.com/apcode/tensorflow_fasttext

Simple embedding based text classifier inspired by fastText, implemented in tensorflow

fasttext language-identification tensorflow text-classifier

Last synced: 27 Mar 2025

https://github.com/explosion/floret

🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy

fasttext fasttext-embeddings spacy subword-embeddings word-embeddings word-vectors

Last synced: 10 Jan 2026

https://github.com/brightmart/ai_law

all kinds of baseline models for long text classificaiton( text categorization)

accusation ai attention crime fasttext hierarchical-attention-network law relevant-articles text-categorization text-classification textcnn

Last synced: 10 Oct 2025

https://github.com/dalinvip/cw2vec

cw2vec: Learning Chinese Word Embeddings with Stroke n-gram Information

cw2vec embeddings fasttext stroke-information word2vec

Last synced: 07 Apr 2025

https://github.com/vrasneur/pyfasttext

Yet another Python binding for fastText

fasttext machine-learning nlp numpy python python-bindings word-vectors

Last synced: 27 Jan 2026

https://github.com/giacbrd/ShallowLearn

An experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fully compatible with Scikit-learn.

fasttext gensim machine-learning neural-network online-learning scikit-learn shallow-learning supervised-learning text-classification text-mining word-embeddings word2vec

Last synced: 19 Jul 2025

https://github.com/giacbrd/shallowlearn

An experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fully compatible with Scikit-learn.

fasttext gensim machine-learning neural-network online-learning scikit-learn shallow-learning supervised-learning text-classification text-mining word-embeddings word2vec

Last synced: 08 Oct 2025

https://github.com/liyibo/text-classification-demos

Neural models for Text Classification in Tensorflow, such as cnn, dpcnn, fasttext, bert ...

bert cnn fasttext tensorflow text-classification

Last synced: 02 Apr 2025

https://github.com/LlmKira/fast-langdetect

⚡️ 80x faster Fasttext language detection out of the box | Split text by language

detect-languages fasttext i18n language-identification languagedetector svc tts

Last synced: 15 May 2025

https://github.com/avidale/compress-fasttext

Tools for shrinking fastText models (in gensim format)

fasttext fasttext-embeddings nlp python word-embeddings

Last synced: 16 Jan 2026

https://github.com/llmkira/fast-langdetect

⚡️ 80x faster Fasttext language detection out of the box | Split text by language

detect-languages fasttext i18n language-identification languagedetector svc tts

Last synced: 05 Apr 2025

https://github.com/prrao87/fine-grained-sentiment

A comparison and discussion of different NLP methods for 5-class sentiment classification on the SST-5 dataset.

fasttext flair nlp python pytorch sentiment-analysis text-classification transformers

Last synced: 09 Apr 2025

https://github.com/indix/whatthelang

Lightning Fast Language Prediction 🚀

fasttext language-detection languages nlp python

Last synced: 13 Dec 2025

https://github.com/renovamen/text-classification

PyTorch implementation of some text classification models (HAN, fastText, BiLSTM-Attention, TextCNN, Transformer) | 文本分类

bilstm-attention cnn document-classification fasttext han hierarchical-attention-networks lstm nlp text-classification textcnn transformer

Last synced: 24 Apr 2025

https://github.com/vyraun/Half-Size

Code for "Effective Dimensionality Reduction for Word Embeddings".

fasttext glove nips-2017 pca wordembedding

Last synced: 10 May 2025

https://github.com/eellak/nlpbuddy

A text analysis application for performing common NLP tasks through a web dashboard interface and an API

fasttext gensim natural-language-processing spacy text-analysis text-classification

Last synced: 12 Apr 2025

https://github.com/tharindudr/simple-sentence-similarity

Exploring the simple sentence similarity measurements using word embeddings

elmo fasttext glove ipynb python sentence-embeddings sentence-similarity wmd word-embeddings word2vec

Last synced: 30 Jun 2025

https://github.com/thomasahle/fastchess

Predicts the best chess move with 27.5% accuracy by a single matrix multiplication

ai chess chess-ai chess-engine fasttext machine-learning machinelearning

Last synced: 13 Jul 2025

https://github.com/IlyaGusev/tgcontest

Telegram Data Clustering contest solution by Mindful Squirrel

classification clustering cpp data-science document-similarity fasttext machine-learning nlp

Last synced: 03 Apr 2025

https://github.com/olegtarasov/fasttext.netwrapper

.NET Standard wrapper for fastText library. Now works on Windows, Linux and MacOs!

csharp fasttext machine-learning net nlp

Last synced: 28 Oct 2025

https://github.com/messense/fasttext-rs

fastText Rust binding

fasttext nlp

Last synced: 19 Oct 2025

https://github.com/messense/fasttext-serving

fastText model serving service

fasttext model-server model-serving nlp

Last synced: 04 Apr 2025

https://github.com/thomasthiebaud/spacy-fastlang

Language detection using Spacy and Fasttext

fasttext fasttext-python language-detection spacy spacy-extensions

Last synced: 01 Apr 2026

https://github.com/macanv/mqnlp

自然语言处理相关实验实现 some experiment of natural language processing, Like text classification, named entity recognition, pos-tags, segment, key words extractor, auto summarize etc.

fasttext lstm ner pos-tagging segment sequence-labeling textclassification textcnn textrnn

Last synced: 29 Oct 2025

https://github.com/iamaziz/language-detection-fasttext

Building a language detection classifier using fastText

fasttext language-detection text-classification word-embeddings

Last synced: 23 Mar 2025

https://github.com/vunb/node-fasttext

Nodejs binding for fasttext representation and classification.

classifier facebook-fasttext fasttext node-fasttext text-classification vntk

Last synced: 24 Jul 2025

https://github.com/mlampros/fasttext

R package for 'Efficient Learning of Word Representations and Sentence Classification'

cpp11 fasttext r

Last synced: 18 Oct 2025

https://github.com/currentslab/fastlangid

fastlangid, the only language identification package that support cantonese (zh-yue), simplified (zh-hans) and traditional chinese (zh-hant)

cantonese-language fasttext identification language-identification language-identifier simplified-chinese traditional-chinese

Last synced: 17 Mar 2026

https://github.com/mbanon/fastspell

Targetted language identifier, based on FastText and Hunspell.

fasttext hunspell language-identification nlp

Last synced: 18 Jan 2026

https://github.com/revanced/revanced-bots

🤖 NLP-backed bots assisting ReVanced

ai bot client discord fasttext ocr revanced server telegram

Last synced: 01 Mar 2026

https://github.com/ekzhu/go-fasttext

Facebook fastText database in SQLite with Go API

facebook fasttext golang sqlite wordembedding

Last synced: 26 Aug 2025

https://github.com/dataxujing/nlp-paper

:art: :art:NLP 自然语言处理教程 :art::art: https://dataxujing.github.io/NLP-paper/

albert attention-mechanism bert crf elmo fasttext glove gpt lad2vec lda lsa pagerank plsa seq2seq seq2seq-attention textcnn textrank transformer word2vec xlnet

Last synced: 23 Feb 2026

https://github.com/joedoyle23/fasttextjs

JavaScript implementation of the FastText prediction algorithm

fasttext

Last synced: 30 Apr 2025

https://github.com/peaceiris/actions-suggest-related-links

A GitHub Action to suggest related or similar issues, documents, and links. Based on the power of NLP and fastText.

actions fasttext github-actions issue-management nlp

Last synced: 20 Aug 2025

https://github.com/trykatchup/password-similarity-nlp

Password Similarity Detection Using Deep Neural Networks. This project was the case study of my bachelor's thesis.

deep-neural-networks fasttext machine-learning nlp-machine-learning password password-similarity privacy privacy-protection word-embeddings

Last synced: 08 Oct 2025

https://github.com/guenthermi/table-embeddings

Tools for training schema-aware Web table embedding for unsupervised and supervised machine learning on tabular data

embeddings fasttext ml neural-network schema schema-data tables unsupervised-learning web-table word-embeddings

Last synced: 14 Apr 2025

https://github.com/lonepatient/cw2vec-pytorch

cw2vec implementation in pytorch

chinese cw2vec fasttext gensim pytorch strokes

Last synced: 11 Feb 2026

https://github.com/mpuig/textclassification

A brief overview of how to use fastText to train powerful text classifiers in a python notebook.

fasttext nlp notebook python text-classification

Last synced: 15 Jun 2025

https://github.com/helboukkouri/embedding-visualization

This is a project for visualizing word embeddings based on the work of Andrei Kashcha (@anvaka).

fasttext glove graphs nlp visualization word-embeddings word2vec

Last synced: 30 Aug 2025

https://github.com/autonomio/signs

A suite of tools for text preparation, vectorization and processing for deep learning with Keras.

embeddings fasttext gensim glove keras spacy word2vec

Last synced: 12 Apr 2025

https://github.com/maartengr/vlac

Vectors of Locally Aggregated Concepts

fasttext kmeans machine-learning nlp word-embeddings word2vec

Last synced: 27 Feb 2026

https://github.com/rse/fasttext-lid

Language Identification with Facebook FastText for Node.js

fasttext identification language lid model prediction

Last synced: 19 Apr 2025

https://github.com/agiletechvn/opencv-starterkit

OpenCV, nltk with Tensorflow all together

bert-model fasttext keras nltk opencv4 tensorflow

Last synced: 11 Oct 2025

https://github.com/riccorl/sense-embedding

BabelNet (and WordNet) sense embedding trained with Word2Vec and FastText

embeddings fasttext gensim sense sense-embedding sense-embeddings word-embedding word2vec

Last synced: 30 Oct 2025

https://github.com/gluschenko/panlingo

Collection of language detection libraries for .NET: FastText, CLD2, CLD3, MediaPipe, Lingua, Whatlang

cld2 cld3 dotnet dotnet-core fasttext interop language-detection language-identification lingua machine-learning mediapipe neural-networks nlp whatlang wrapper

Last synced: 25 Jun 2025

https://github.com/yunsii/fasttext.wasm.js

Node and Browser env supported WebAssembly version of fastText: Library for efficient text classification and representation learning.

browser browser-extension fasttext language language-detection language-detector language-identification natural-language natural-language-processing nlp node nodejs wasm web-extension webassembly worker

Last synced: 28 Apr 2025

https://github.com/taufik-rama/fasttext-go-wrapper

A simple Golang wrapper for fastText text classification library

fasttext golang-wrapper

Last synced: 04 Feb 2026

https://github.com/mabdh/go-fasttext

🗚🐀 serving fastText model with golang

fasttext golang nlp textclassification

Last synced: 04 Aug 2025

https://github.com/pabvald/semantic-similarity

Comparison of methods based on pre-trained Word2Vec, GloVe and FastText vectors to measure the semantic similarity between sentence pairs

bachelor-thesis embeddings evaluation fasttext gensim-library glove semantic-similarity spacy word-embeddings word2vec

Last synced: 11 Oct 2025

https://github.com/altescy/xallennlp

Expanded AllenNLP

allennlp fasttext mlflow nlp optuna python

Last synced: 07 Oct 2025

https://github.com/jefrydco/similar-words-fasttext-tsne

Similar words visualization using gensim fasttext and sklearn tSNE

fasttext visualization word2vec

Last synced: 23 Oct 2025

https://github.com/talmago/simple-but-tough-to-beat-examples

Bunch of examples of a "Simple but tough to beat baseline for sentence embeddings" in classification tasks

fake-news-classification fasttext fasttext-python imdb-dataset machine-learning nlp sentence-embeddings sentence2vec w2v word-embeddings word2vec

Last synced: 23 Apr 2025

https://github.com/mwydmuch/datasets4fasttext

Multiclass and multilabel datasets in fastText format

datasets fasttext multi-class

Last synced: 01 Feb 2026

https://github.com/ermlab/polish-word-embeddings-review

Evaluation of polish word embeddings prepared by various research groups. Evaluation is done by words analogy task

computational-linguistics deep-learning fasttext machine-learning nlp polish-language word2vec wordembeddings

Last synced: 07 Apr 2026

https://github.com/messense/cfasttext

A fastText C wrapper

c-wrapper fasttext

Last synced: 15 Apr 2025

https://github.com/Ermlab/polish-word-embeddings-review

Evaluation of polish word embeddings prepared by various research groups. Evaluation is done by words analogy task

computational-linguistics deep-learning fasttext machine-learning nlp polish-language word2vec wordembeddings

Last synced: 15 Mar 2025

https://github.com/dominicburkart/fast_text

Rust wrapper for Facebook's FastText package.

fasttext machine-learning ml nlp rust word-embeddings

Last synced: 27 Jun 2025

https://github.com/miladnouriezade/ktrain-biobert_ner

This repository contains data and BioBert based NER model monologg/biobert_v1.1_pubmed from community-uploaded Hugging Face models for detecting entities such as chemical and disease.

biobert biomedical bionlp disease fasttext huggingface ktrain name named-entity-recognition ner nlp python spacy

Last synced: 28 Apr 2026

https://github.com/tuananh/fasttext-native

fastText native bindings for ⬡.js

fasttext nodejs

Last synced: 10 Apr 2025

https://github.com/ZhengZixiang/OpenTC

Exploring various text classification models based on PyTorch. 基于PyTorch探索各种文本分类模型

fasttext pytorch text-classification textcnn textrcnn textrnn

Last synced: 06 May 2025

https://github.com/raypereda/shuffle

a tool for shuffling lines of text

fasttext shuffle text text-classification text-processing

Last synced: 11 Apr 2025

https://github.com/bestmahdi2/uni__webcrawlerproject

A university project in which a web crawler is designed for the Instagram website and fasttext is used to predict the positive or negative content of a post's comments.

beautifulsoup4 fasttext gui matplotlib pandas prediction-model python selenium tkinter web-scraping

Last synced: 11 Feb 2026

https://github.com/jieguangzhou/textclassification

基于tensorflow的文本分类 Text classification

cnn deep-learning fasttext python rnn tensorflow textcnn textrnn

Last synced: 24 Oct 2025

https://github.com/bees4ever/seaqube

Semantic Quality Benchmark for Word Embeddings, i.e. Natural Language Models in Python. Acronym `SeaQuBe` or `seaqube`.

augmentation benchmark fasttext gensim nlp spacy spacy-nlp wordembeddings

Last synced: 21 Apr 2025

https://github.com/benzlokzik/spam-detector

Training code and models for spam message detection in Russian

bert chromadb fasttext knn ml python rag scikit-learn spam-detection transformers

Last synced: 25 May 2026

https://github.com/frederickroman/fasttextapi

Unofficial minified fastetext API. Use it to run NLP DL models that require word embeddings on the client-side.

fasttext fasttext-embeddings machine-learning natural-language-processing nextjs nlp-apis public-api pwa-app rest-api word-embeddings

Last synced: 06 Sep 2025

https://github.com/memgonzales/semantle-word-embeddings

Recreation of Semantle (a word guessing game that gives the semantic similarity to the secret word) using three pretrained word embeddings: (1) word2vec, (2) GloVe, and (3) fastText

dense-vector fasttext gensim glove glove-embeddings natural-language-processing natural-language-understanding nlp semantic-similarity semantics semantle word-embeddings word2vec

Last synced: 13 Apr 2026

https://github.com/aurelienmorgan/french_text_sentiment

Sentiment Analysis in texts written in French language using Tensorflow/Keras (and using XGBoost for hyperparameters optimization)

beautifulsoup dask fasttext french gru hyperparameters-optimization jupyter-notebook keras multiprocessing nlp python rnn scikit-learn sentiment-analysis tensorflow transfer-learning web-scraping xgboost

Last synced: 02 Apr 2026