Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists tagged with fasttext

A curated list of projects in awesome lists tagged with fasttext .

https://github.com/duoergun0729/nlp

兜哥出品 <一本开源的NLP入门书籍>

ai fasttext nlp security word2vec

Last synced: 21 Dec 2024

https://github.com/kyubyong/wordvectors

Pre-trained word vectors of 30+ languages

fasttext language vector word2vec

Last synced: 21 Dec 2024

https://github.com/Kyubyong/wordvectors

Pre-trained word vectors of 30+ languages

fasttext language vector word2vec

Last synced: 09 Nov 2024

https://github.com/yongzhuo/keras-textclassification

中文长文本分类、短句子分类、多标签分类、两句子相似度(Chinese Text Classification of Keras NLP, multi-label classify, or sentence classify, long or short),字词句向量嵌入层(embeddings)和网络层(graph)构建基类,FastText,TextCNN,CharCNN,TextRNN, RCNN, DCNN, DPCNN, VDCNN, CRNN, Bert, Xlnet, Albert, Attention, DeepMoji, HAN, 胶囊网络-CapsuleNet, Transformer-encode, Seq2seq, SWEM, LEAM, TextGCN

albert bert capsule charcnn crnn dcnn dpcnn embeddings fasttext han keras keras-textclassification leam nlp rcnn text-classification textcnn transformer vdcnn xlnet

Last synced: 18 Dec 2024

https://github.com/chenyuntc/PyTorchText

1st Place Solution for Zhihu Machine Learning Challenge . Implementation of various text-classification models.(知乎看山杯第一名解决方案)

fasttext lstm nlp pytorch textcnn textrcnn textrnn

Last synced: 31 Oct 2024

https://github.com/chenyuntc/pytorchtext

1st Place Solution for Zhihu Machine Learning Challenge . Implementation of various text-classification models.(知乎看山杯第一名解决方案)

fasttext lstm nlp pytorch textcnn textrcnn textrnn

Last synced: 17 Dec 2024

https://github.com/mayabot/mynlp

一个生产级、高性能、模块化、可扩展的中文NLP工具包。(中文分词、平均感知机、fastText、拼音、新词发现、分词纠错、BM25、人名识别、命名实体、自定义词典)

fasttext nlp pinyin segment starspace

Last synced: 13 Nov 2024

https://github.com/ThoughtRiver/lmdb-embeddings

Fast word vectors with little memory usage in Python

embeddings fasttext gensim glove lmdb magnitude memory speed text vectors word word2vec

Last synced: 27 Nov 2024

https://github.com/apcode/tensorflow_fasttext

Simple embedding based text classifier inspired by fastText, implemented in tensorflow

fasttext language-identification tensorflow text-classifier

Last synced: 30 Oct 2024

https://github.com/explosion/floret

🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy

fasttext fasttext-embeddings spacy subword-embeddings word-embeddings word-vectors

Last synced: 30 Sep 2024

https://github.com/brightmart/ai_law

all kinds of baseline models for long text classificaiton( text categorization)

accusation ai attention crime fasttext hierarchical-attention-network law relevant-articles text-categorization text-classification textcnn

Last synced: 18 Dec 2024

https://github.com/dalinvip/cw2vec

cw2vec: Learning Chinese Word Embeddings with Stroke n-gram Information

cw2vec embeddings fasttext stroke-information word2vec

Last synced: 19 Dec 2024

https://github.com/vrasneur/pyfasttext

Yet another Python binding for fastText

fasttext machine-learning nlp numpy python python-bindings word-vectors

Last synced: 07 Nov 2024

https://github.com/giacbrd/ShallowLearn

An experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fully compatible with Scikit-learn.

fasttext gensim machine-learning neural-network online-learning scikit-learn shallow-learning supervised-learning text-classification text-mining word-embeddings word2vec

Last synced: 27 Nov 2024

https://github.com/giacbrd/shallowlearn

An experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fully compatible with Scikit-learn.

fasttext gensim machine-learning neural-network online-learning scikit-learn shallow-learning supervised-learning text-classification text-mining word-embeddings word2vec

Last synced: 18 Dec 2024

https://github.com/liyibo/text-classification-demos

Neural models for Text Classification in Tensorflow, such as cnn, dpcnn, fasttext, bert ...

bert cnn fasttext tensorflow text-classification

Last synced: 02 Nov 2024

https://github.com/avidale/compress-fasttext

Tools for shrinking fastText models (in gensim format)

fasttext fasttext-embeddings nlp python word-embeddings

Last synced: 13 Nov 2024

https://github.com/prrao87/fine-grained-sentiment

A comparison and discussion of different NLP methods for 5-class sentiment classification on the SST-5 dataset.

fasttext flair nlp python pytorch sentiment-analysis text-classification transformers

Last synced: 18 Dec 2024

https://github.com/indix/whatthelang

Lightning Fast Language Prediction 🚀

fasttext language-detection languages nlp python

Last synced: 18 Dec 2024

https://github.com/renovamen/text-classification

PyTorch implementation of some text classification models (HAN, fastText, BiLSTM-Attention, TextCNN, Transformer) | 文本分类

bilstm-attention cnn document-classification fasttext han hierarchical-attention-networks lstm nlp text-classification textcnn transformer

Last synced: 10 Nov 2024

https://github.com/llmkira/fast-langdetect

⚡️ 80x faster language detection with Fasttext | Split text by language for TTS

detect-languages fasttext i18n language-identification languagedetector svc tts

Last synced: 16 Dec 2024

https://github.com/vyraun/Half-Size

Code for "Effective Dimensionality Reduction for Word Embeddings".

fasttext glove nips-2017 pca wordembedding

Last synced: 17 Nov 2024

https://github.com/eellak/nlpbuddy

A text analysis application for performing common NLP tasks through a web dashboard interface and an API

fasttext gensim natural-language-processing spacy text-analysis text-classification

Last synced: 14 Oct 2024

https://github.com/LlmKira/fast-langdetect

⚡️ 80x faster language detection with Fasttext | Split text by language for TTS

detect-languages fasttext i18n language-identification languagedetector svc tts

Last synced: 19 Nov 2024

https://github.com/tharindudr/simple-sentence-similarity

Exploring the simple sentence similarity measurements using word embeddings

elmo fasttext glove ipynb python sentence-embeddings sentence-similarity wmd word-embeddings word2vec

Last synced: 20 Dec 2024

https://github.com/IlyaGusev/tgcontest

Telegram Data Clustering contest solution by Mindful Squirrel

classification clustering cpp data-science document-similarity fasttext machine-learning nlp

Last synced: 04 Nov 2024

https://github.com/thomasahle/fastchess

Predicts the best chess move with 27.5% accuracy by a single matrix multiplication

ai chess chess-ai chess-engine fasttext machine-learning machinelearning

Last synced: 29 Oct 2024

https://github.com/messense/fasttext-serving

fastText model serving service

fasttext model-server model-serving nlp

Last synced: 16 Dec 2024

https://github.com/messense/fasttext-rs

fastText Rust binding

fasttext nlp

Last synced: 18 Dec 2024

https://github.com/macanv/mqnlp

自然语言处理相关实验实现 some experiment of natural language processing, Like text classification, named entity recognition, pos-tags, segment, key words extractor, auto summarize etc.

fasttext lstm ner pos-tagging segment sequence-labeling textclassification textcnn textrnn

Last synced: 07 Nov 2024

https://github.com/thomasthiebaud/spacy-fastlang

Language detection using Spacy and Fasttext

fasttext fasttext-python language-detection spacy spacy-extensions

Last synced: 19 Dec 2024

https://github.com/vunb/node-fasttext

Nodejs binding for fasttext representation and classification.

classifier facebook-fasttext fasttext node-fasttext text-classification vntk

Last synced: 30 Nov 2024

https://github.com/iamaziz/language-detection-fasttext

Building a language detection classifier using fastText

fasttext language-detection text-classification word-embeddings

Last synced: 28 Oct 2024

https://github.com/mlampros/fasttext

R package for 'Efficient Learning of Word Representations and Sentence Classification'

cpp11 fasttext r

Last synced: 07 Nov 2024

https://github.com/ekzhu/go-fasttext

Facebook fastText database in SQLite with Go API

facebook fasttext golang sqlite wordembedding

Last synced: 28 Oct 2024

https://github.com/joedoyle23/fasttextjs

JavaScript implementation of the FastText prediction algorithm

fasttext

Last synced: 22 Oct 2024

https://github.com/peaceiris/actions-suggest-related-links

A GitHub Action to suggest related or similar issues, documents, and links. Based on the power of NLP and fastText.

actions fasttext github-actions issue-management nlp

Last synced: 19 Dec 2024

https://github.com/dataxujing/nlp-paper

:art: :art:NLP 自然语言处理教程 :art::art: https://dataxujing.github.io/NLP-paper/

albert attention-mechanism bert crf elmo fasttext glove gpt lad2vec lda lsa pagerank plsa seq2seq seq2seq-attention textcnn textrank transformer word2vec xlnet

Last synced: 17 Dec 2024

https://github.com/trykatchup/password-similarity-nlp

Password Similarity Detection Using Deep Neural Networks. This project was the case study of my bachelor's thesis.

deep-neural-networks fasttext machine-learning nlp-machine-learning password password-similarity privacy privacy-protection word-embeddings

Last synced: 13 Oct 2024

https://github.com/lonepatient/cw2vec-pytorch

cw2vec implementation in pytorch

chinese cw2vec fasttext gensim pytorch strokes

Last synced: 06 Nov 2024

https://github.com/autonomio/signs

A suite of tools for text preparation, vectorization and processing for deep learning with Keras.

embeddings fasttext gensim glove keras spacy word2vec

Last synced: 14 Oct 2024

https://github.com/guenthermi/table-embeddings

Tools for training schema-aware Web table embedding for unsupervised and supervised machine learning on tabular data

embeddings fasttext ml neural-network schema schema-data tables unsupervised-learning web-table word-embeddings

Last synced: 15 Oct 2024

https://github.com/helboukkouri/embedding-visualization

This is a project for visualizing word embeddings based on the work of Andrei Kashcha (@anvaka).

fasttext glove graphs nlp visualization word-embeddings word2vec

Last synced: 03 Sep 2024

https://github.com/mpuig/textclassification

A brief overview of how to use fastText to train powerful text classifiers in a python notebook.

fasttext nlp notebook python text-classification

Last synced: 09 Dec 2024

https://github.com/maartengr/vlac

Vectors of Locally Aggregated Concepts

fasttext kmeans machine-learning nlp word-embeddings word2vec

Last synced: 27 Oct 2024

https://github.com/agiletechvn/opencv-starterkit

OpenCV, nltk with Tensorflow all together

bert-model fasttext keras nltk opencv4 tensorflow

Last synced: 17 Dec 2024

https://github.com/riccorl/sense-embedding

BabelNet (and WordNet) sense embedding trained with Word2Vec and FastText

embeddings fasttext gensim sense sense-embedding sense-embeddings word-embedding word2vec

Last synced: 08 Nov 2024

https://github.com/yunsii/fasttext.wasm.js

Node and Browser env supported WebAssembly version of fastText: Library for efficient text classification and representation learning.

browser browser-extension fasttext language language-detection language-detector language-identification natural-language natural-language-processing nlp node nodejs wasm web-extension webassembly worker

Last synced: 09 Nov 2024

https://github.com/altescy/xallennlp

Expanded AllenNLP

allennlp fasttext mlflow nlp optuna python

Last synced: 27 Nov 2024

https://github.com/mabdh/go-fasttext

🗚🐀 serving fastText model with golang

fasttext golang nlp textclassification

Last synced: 20 Nov 2024

https://github.com/talmago/simple-but-tough-to-beat-examples

Bunch of examples of a "Simple but tough to beat baseline for sentence embeddings" in classification tasks

fake-news-classification fasttext fasttext-python imdb-dataset machine-learning nlp sentence-embeddings sentence2vec w2v word-embeddings word2vec

Last synced: 20 Oct 2024

https://github.com/mwydmuch/datasets4fasttext

Multiclass and multilabel datasets in fastText format

datasets fasttext multi-class

Last synced: 19 Nov 2024

https://github.com/dominicburkart/fast_text

Rust wrapper for Facebook's FastText package.

fasttext machine-learning ml nlp rust word-embeddings

Last synced: 11 Oct 2024

https://github.com/messense/cfasttext

A fastText C wrapper

c-wrapper fasttext

Last synced: 16 Oct 2024

https://github.com/Ermlab/polish-word-embeddings-review

Evaluation of polish word embeddings prepared by various research groups. Evaluation is done by words analogy task

computational-linguistics deep-learning fasttext machine-learning nlp polish-language word2vec wordembeddings

Last synced: 26 Oct 2024

https://github.com/bees4ever/seaqube

Semantic Quality Benchmark for Word Embeddings, i.e. Natural Language Models in Python. Acronym `SeaQuBe` or `seaqube`.

augmentation benchmark fasttext gensim nlp spacy spacy-nlp wordembeddings

Last synced: 18 Oct 2024

https://github.com/jefrydco/similar-words-fasttext-tsne

Similar words visualization using gensim fasttext and sklearn tSNE

fasttext visualization word2vec

Last synced: 07 Nov 2024

https://github.com/jieguangzhou/textclassification

基于tensorflow的文本分类 Text classification

cnn deep-learning fasttext python rnn tensorflow textcnn textrnn

Last synced: 06 Dec 2024

https://github.com/bestmahdi2/uni__webcrawlerproject

A university project in which a web crawler is designed for the Instagram website and fasttext is used to predict the positive or negative content of a post's comments.

beautifulsoup4 fasttext gui matplotlib pandas prediction-model python selenium tkinter web-scraping

Last synced: 16 Nov 2024

https://github.com/raypereda/shuffle

a tool for shuffling lines of text

fasttext shuffle text text-classification text-processing

Last synced: 14 Nov 2024

https://github.com/gluschenko/panlingo

Collection of language identification libraries for .NET: FastText, CLD2, CLD3, MediaPipe, Lingua, Whatlang

cld2 cld3 dotnet dotnet-core fasttext interop language-detection language-identification lingua machine-learning mediapipe neural-networks nlp whatlang wrapper

Last synced: 17 Nov 2024

https://github.com/tuananh/fasttext-native

fastText native bindings for ⬡.js

fasttext nodejs

Last synced: 24 Nov 2024

https://github.com/miladnouriezade/ktrain-biobert_ner

This repository contains data and BioBert based NER model monologg/biobert_v1.1_pubmed from community-uploaded Hugging Face models for detecting entities such as chemical and disease.

biobert biomedical bionlp disease fasttext huggingface ktrain name named-entity-recognition ner nlp python spacy

Last synced: 03 Dec 2024

https://github.com/ZhengZixiang/OpenTC

Exploring various text classification models based on PyTorch. 基于PyTorch探索各种文本分类模型

fasttext pytorch text-classification textcnn textrcnn textrnn

Last synced: 13 Nov 2024

https://github.com/jpomykala/masters-thesis

Subject classification of texts in Polish

classification fasttext java machine-learning python spring-boot

Last synced: 15 Nov 2024

https://github.com/george-gca/ai_papers_search_tool

Automatic paper clustering and search tool by fastext from Facebook Research

fasttext fasttext-embeddings fasttext-python nlp python scikit-learn

Last synced: 14 Nov 2024

https://github.com/rid17pawar/semantic-search-model-experiments

Experiments in the field of Semantic Search using BM-25 Algorithm, Mean of Word Vectors, along with state of the art Transformer based models namely USE and SBERT.

bm25 fasttext fasttext-embeddings glove glove-embeddings information-retrieval sbert semantic-search universal-sentence-encoder word2vec word2vec-embeddinngs

Last synced: 17 Nov 2024

https://github.com/sudip-13/nlp

This repo for tutorial NLP dialog flow chat bot back end configured

dialogflow fastapi fasttext mogodb ner regex spacy tf-idf

Last synced: 14 Oct 2024

https://github.com/mhdb96/ml-django-webapp

Machine learning practice project on NLP for Turkish language 🔥 using multiple datasets & deep learning algorithms from TensorFlow.

cnn deep-learning django fasttext lstm machine-learning mlp perceptron python rnn tensorflow

Last synced: 09 Nov 2024

https://github.com/aurelienmorgan/french_text_sentiment

Sentiment Analysis in texts written in French language using Tensorflow/Keras (and using XGBoost for hyperparameters optimization)

beautifulsoup dask fasttext french gru hyperparameters-optimization jupyter-notebook keras multiprocessing nlp python rnn scikit-learn sentiment-analysis tensorflow transfer-learning web-scraping xgboost

Last synced: 15 Dec 2024

https://github.com/ianramzy/yelp-review-classifier

⭐ A classifier built with facebook's fasttext that will input a reviewers text and predict the corresponding star rating.

fasttext nlp-machine-learning python review-sentiments yelp-dataset

Last synced: 19 Nov 2024

https://github.com/penguincabinet/aniota-wiki-fasttext-model

The fattext model is made by Anime Wiki.

anime fasttext nlp word2vec

Last synced: 03 Dec 2024

https://github.com/bhattbhavesh91/language-identification-using-python

A small tutorial on how you can detect language using Python, Fasttext, Google Compact Language Detector and Google Translate

cld3 fasttext google-language google-translate language-identification python python-tutorial tutorial

Last synced: 16 Nov 2024

https://github.com/jolivaresc/fasttext-vecmap

bilingual word embeddings mapping using fastText

autoencoder embeddings fasttext machine-learning machine-translation word2vec

Last synced: 03 Dec 2024

https://github.com/frederickroman/fasttextapi

Unofficial minified fastetext API. Use it to run NLP DL models that require word embeddings on the client-side.

fasttext fasttext-embeddings machine-learning natural-language-processing nextjs nlp-apis public-api pwa-app rest-api word-embeddings

Last synced: 17 Nov 2024

https://github.com/raypereda/normalize

normalize text

fasttext text-processing

Last synced: 14 Nov 2024

https://github.com/d-dawg78/mva_dl

Master MVA - Deep Learning project

bert cnn fasttext genre-identification glove gru lstm lyrics word-embeddings

Last synced: 17 Dec 2024

https://github.com/vhidvz/language-identification

Language identification microservice powered by the FastText language detection model

ai fastapi fasttext language-identification

Last synced: 13 Oct 2024

https://github.com/sngjuk/fasttext_oov_similar_word_printer

Check the fastText's inference performance for OOV.

fasttext fasttext-embeddings fasttext-oov inference-performance

Last synced: 13 Nov 2024