An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with word2vec

A curated list of projects in awesome lists tagged with word2vec .

https://github.com/vi3k6i5/flashtext

Extract Keywords from sentence or Replace keywords in sentences.

data-extraction keyword-extraction nlp search-in-text word2vec

Last synced: 13 May 2025

https://github.com/shibing624/text2vec

text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。

embeddings nlp sentence-embeddings similarity text-similarity text2vec word2vec

Last synced: 12 May 2025

https://github.com/PaddlePaddle/PaddleRec

Recommendation Algorithm大规模推荐算法库,包含推荐系统经典及最新算法LR、Wide&Deep、DSSM、TDM、MIND、Word2Vec、Bert4Rec、DeepWalk、SSR、AITM,DSIN,SIGN,IPREC、GRU4Rec、Youtube_dnn、NCF、GNN、FM、FFM、DeepFM、DCN、DIN、DIEN、DLRM、MMOE、PLE、ESMM、ESCMM, MAML、xDeepFM、DeepFEFM、NFM、AFM、RALM、DMR、GateNet、NAML、DIFM、Deep Crossing、PNN、BST、AutoInt、FGCNN、FLEN、Fibinet、ListWise、DeepRec、ENSFM,TiSAS,AutoFIS等,包含经典推荐系统数据集criteo 、movielens等

deepfm esmm gru4rec lr mmoe ple tdm widedeep word2vec

Last synced: 29 Apr 2025

https://github.com/paddlepaddle/paddlerec

Recommendation Algorithm大规模推荐算法库,包含推荐系统经典及最新算法LR、Wide&Deep、DSSM、TDM、MIND、Word2Vec、Bert4Rec、DeepWalk、SSR、AITM,DSIN,SIGN,IPREC、GRU4Rec、Youtube_dnn、NCF、GNN、FM、FFM、DeepFM、DCN、DIN、DIEN、DLRM、MMOE、PLE、ESMM、ESCMM, MAML、xDeepFM、DeepFEFM、NFM、AFM、RALM、DMR、GateNet、NAML、DIFM、Deep Crossing、PNN、BST、AutoInt、FGCNN、FLEN、Fibinet、ListWise、DeepRec、ENSFM,TiSAS,AutoFIS等,包含经典推荐系统数据集criteo 、movielens等

deepfm esmm gru4rec lr mmoe ple tdm widedeep word2vec

Last synced: 13 May 2025

https://github.com/alibaba/alink

Alink is the Machine Learning algorithm platform based on Flink, developed by the PAI team of Alibaba computing platform.

apriori classification clustering data-mining feature-engineering flink flink-machine-learning flink-ml fm graph-algorithms graph-embedding kafka machine-learning recommender recommender-system regression statistics word2vec xgboost

Last synced: 14 May 2025

https://github.com/alibaba/Alink

Alink is the Machine Learning algorithm platform based on Flink, developed by the PAI team of Alibaba computing platform.

apriori classification clustering data-mining feature-engineering flink flink-machine-learning flink-ml fm graph-algorithms graph-embedding kafka machine-learning recommender recommender-system regression statistics word2vec xgboost

Last synced: 14 Mar 2025

https://github.com/danielfrg/word2vec

Python interface to Google word2vec

doc2vec python word2vec

Last synced: 20 Oct 2025

https://github.com/duoergun0729/nlp

兜哥出品 <一本开源的NLP入门书籍>

ai fasttext nlp security word2vec

Last synced: 13 Apr 2025

https://github.com/kyubyong/wordvectors

Pre-trained word vectors of 30+ languages

fasttext language vector word2vec

Last synced: 08 Apr 2025

https://github.com/Kyubyong/wordvectors

Pre-trained word vectors of 30+ languages

fasttext language vector word2vec

Last synced: 20 Apr 2025

https://github.com/golbin/tensorflow-tutorials

텐서플로우를 기초부터 응용까지 단계별로 연습할 수 있는 소스 코드를 제공합니다

autoencoder chatbot cnn deep-learning dqn gan inception mnist neural-network rnn seq2seq tensorflow tutorial word2vec

Last synced: 15 May 2025

https://github.com/golbin/TensorFlow-Tutorials

텐서플로우를 기초부터 응용까지 단계별로 연습할 수 있는 소스 코드를 제공합니다

autoencoder chatbot cnn deep-learning dqn gan inception mnist neural-network rnn seq2seq tensorflow tutorial word2vec

Last synced: 01 May 2025

https://github.com/kavgan/nlp-in-practice

Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.

gensim machine-learning natural-language-processing nlp text-classification text-mining tf-idf word2vec

Last synced: 16 May 2025

https://github.com/skalskip/vlms-zero-to-hero

This series will take you on a journey from the fundamentals of NLP and Computer Vision to the cutting edge of Vision-Language Models.

bert-model clip computer-vision embeddings gpt gpt-2 lora natural-language-processing seq2seq vision-language-model word2vec

Last synced: 06 Oct 2025

https://github.com/dselivanov/text2vec

Fast vectorization, topic modeling, distances and GloVe word embeddings in R.

glove latent-dirichlet-allocation natural-language-processing text-mining topic-modeling vectorization word-embeddings word2vec

Last synced: 16 May 2025

https://github.com/zhezhaoa/ngram2vec

Four word embedding models implemented in Python. Supporting arbitrary context features

analogy chinese embedding glove n-gram ngram ngram2vec ppmi svd word word-embedding word2vec

Last synced: 09 Apr 2025

https://github.com/inspirehep/magpie

Deep neural network framework for multi-label text classification

classification deep-learning machine-learning multi-label-classification neural-network nlp prediction word2vec

Last synced: 04 Apr 2025

https://github.com/hankcs/cs224n

CS224n: Natural Language Processing with Deep Learning Assignments Winter, 2017

cs224n deep-learning natural-language-processing rnn tensorflow word2vec

Last synced: 04 Apr 2025

https://github.com/zhang17173/Event-Extraction

基于法律裁判文书的事件抽取及其应用,包括数据的分词、词性标注、命名实体识别、事件要素抽取和判决结果预测等内容

cnn-classification deep-learning event-extraction judgment nlp word2vec

Last synced: 17 Jul 2025

https://github.com/relevanceai/vectorhub

Vector Hub - Library for easy discovery, and consumption of State-of-the-art models to turn data into vectors. (text2vec, image2vec, video2vec, graph2vec, bert, inception, etc)

artificial-intelligence audio-processing deep-learning deeplearning embeddings encodings image2vec machine-learning neural-network python pytorch tensorflow tfhub transformers vector vector-similarity video-processing word2vec

Last synced: 17 Feb 2026

https://github.com/RelevanceAI/vectorhub

Vector Hub - Library for easy discovery, and consumption of State-of-the-art models to turn data into vectors. (text2vec, image2vec, video2vec, graph2vec, bert, inception, etc)

artificial-intelligence audio-processing deep-learning deeplearning embeddings encodings image2vec machine-learning neural-network python pytorch tensorflow tfhub transformers vector vector-similarity video-processing word2vec

Last synced: 27 Apr 2025

https://github.com/gaoisbest/NLP-Projects

word2vec, sentence2vec, machine reading comprehension, dialog system, text classification, pretrained language model (i.e., XLNet, BERT, ELMo, GPT), sequence labeling, information retrieval, information extraction (i.e., entity, relation and event extraction), knowledge graph, text generation, network embedding

dialogue-systems information-extraction information-retrieval knowledge-graph machine-reading-comprehension network-embedding pretrained-language-model sentence2vec sequence-labeling text-classification text-generation word2vec

Last synced: 07 Apr 2025

https://github.com/khanhnamle1994/natural-language-processing

Programming Assignments and Lectures for Stanford's CS 224: Natural Language Processing with Deep Learning

deep-learning glove machine-learning natural-language-processing word2vec

Last synced: 05 Apr 2025

https://github.com/ynqa/wego

Word Embeddings in Go!

glove go machine-learning nlp word-embeddings word2vec

Last synced: 05 Apr 2025

https://github.com/ThoughtRiver/lmdb-embeddings

Fast word vectors with little memory usage in Python

embeddings fasttext gensim glove lmdb magnitude memory speed text vectors word word2vec

Last synced: 19 Jul 2025

https://github.com/bakrianoo/aravec

AraVec is a pre-trained distributed word representation (word embedding) open source project which aims to provide the Arabic NLP research community with free to use and powerful word embedding models.

arabic embedded-models gensim nlp text-mining word2vec

Last synced: 20 Jan 2026

https://github.com/pkmital/pycadl

Python package with source code from the course "Creative Applications of Deep Learning w/ TensorFlow"

autoregressive celeba conditional course cyclegan dcgan deep-learning gan glove magenta mooc neural-network nsynth pixelcnn tensorflow tutorial vae vae-gan wavenet word2vec

Last synced: 04 Apr 2025

https://github.com/planeshifter/node-word2vec

Node.js interface to the Google word2vec tool.

nlp word2vec

Last synced: 16 May 2025

https://github.com/Planeshifter/node-word2vec

Node.js interface to the Google word2vec tool.

nlp word2vec

Last synced: 01 Apr 2025

https://github.com/brightmart/nlu_sim

all kinds of baseline models for sentence similarity 句子对语义相似度模型

atec nlu qa question-answering questions-and-answers semantic-similarity sentence-similarity similarity-measurement word2vec

Last synced: 09 Apr 2025

https://github.com/lujiaying/MovieTaster-Open

A practical movie recommend project based on Item2vec.

deep-learning item2vec word2vec

Last synced: 13 Jul 2025

https://github.com/dalinvip/cw2vec

cw2vec: Learning Chinese Word Embeddings with Stroke n-gram Information

cw2vec embeddings fasttext stroke-information word2vec

Last synced: 07 Apr 2025

https://github.com/30lm32/ml-projects

ML based projects such as Spam Classification, Time Series Analysis, Text Classification using Random Forest, Deep Learning, Bayesian, Xgboost in Python

ab-testing deep-learning docker gensim geolocation imbalanced-data kdtree keras lstm-neural-networks machine-learning mlflow nlp random-forest spam-classification svm tensorboard tensorflow text-classification timeseries-analysis word2vec

Last synced: 08 May 2025

https://github.com/thesephist/revery

A personal semantic search engine capable of surfacing relevant bookmarks, journal entries, notes, blogs, contacts, and more, built on an efficient document embedding algorithm and Monocle's personal search index.

browser-extension natural-language-processing search-engine torus-dom word2vec

Last synced: 28 Jun 2025

https://github.com/bloomberg/koan

A word2vec negative sampling implementation with correct CBOW update.

cbow cpp skipgram word-embeddings word2vec

Last synced: 13 Apr 2025

https://github.com/oxford-cs-deepnlp-2017/practical-1

Oxford Deep NLP 2017 course - Practical 1: word2vec

deep-learning natural-language-processing nlp oxford word2vec

Last synced: 07 Apr 2025

https://github.com/tolga-b/debiaswe

Remove problematic gender bias from word embeddings.

debias gender-equality nips-2016 social-justice word-embeddings word2vec

Last synced: 07 Apr 2025

https://github.com/luopeixiang/textclf

TextClf :基于Pytorch/Sklearn的文本分类框架,包括逻辑回归、SVM、TextCNN、TextRNN、TextRCNN、DRNN、DPCNN、Bert等多种模型,通过简单配置即可完成数据处理、模型训练、测试等过程。

bert cnn-text-classification configurable document-classification dpcnn drnn glove logistic-regression lstm-text-classification neuralclassifier pytorch sentiment-analysis sklearn-classify svm textcnn textrnn word2vec

Last synced: 09 Apr 2025

https://github.com/devmount/germanwordembeddings

Toolkit to obtain and preprocess German text corpora, train models and evaluate them with generated testsets. Built with Gensim and Tensorflow.

deep-learning deep-neural-networks evaluation gensim german-language model natural-language-processing neural-network nlp training word-embeddings word2vec

Last synced: 06 Apr 2025

https://github.com/akoksal/Turkish-Word2Vec

Pre-trained Word2Vec Model for Turkish

gensim nlp turkish word2vec

Last synced: 03 May 2025

https://github.com/akutuzov/webvectors

Web-ify your word2vec: framework to serve distributional semantic models online

distributional-semantics embedding-models flask gensim web-app word2vec

Last synced: 18 Jan 2026

https://github.com/sajari/word2vec

Go library for performing computations in word2vec binary models

embedding go golang word word2vec word2vec-model

Last synced: 08 May 2025

https://github.com/giacbrd/ShallowLearn

An experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fully compatible with Scikit-learn.

fasttext gensim machine-learning neural-network online-learning scikit-learn shallow-learning supervised-learning text-classification text-mining word-embeddings word2vec

Last synced: 19 Jul 2025

https://github.com/giacbrd/shallowlearn

An experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fully compatible with Scikit-learn.

fasttext gensim machine-learning neural-network online-learning scikit-learn shallow-learning supervised-learning text-classification text-mining word-embeddings word2vec

Last synced: 08 Oct 2025

https://github.com/OlgaChernytska/word2vec-pytorch

Implementation of the first paper on word2vec

deep-learning natural-language-processing pytorch word2vec

Last synced: 30 Sep 2025

https://github.com/fanglanting/skip-gram-pytorch

A complete pytorch implementation of skip-gram

embed pytorch spearman word2vec

Last synced: 19 Jul 2025

https://github.com/mkearney/textfeatures

👷‍♂️ A simple package for extracting useful features from character objects 👷‍♀️

feature-extraction machine-learning mkearney-r-package neural-network neural-networks r rstats text-mining word2vec

Last synced: 09 Apr 2025

https://github.com/natasha/navec

Compact high quality word embeddings for Russian language

embeddings glove nlp python quantization russian word2vec

Last synced: 05 Apr 2025

https://github.com/Lancern/asm2vec

An unofficial implementation of asm2vec as a standalone python package

asm2vec binary-analysis machine-learning nlp numpy python python3 unofficial word2vec

Last synced: 10 May 2025

https://github.com/lancern/asm2vec

An unofficial implementation of asm2vec as a standalone python package

asm2vec binary-analysis machine-learning nlp numpy python python3 unofficial word2vec

Last synced: 14 Sep 2025

https://github.com/dalinvip/pytorch_word2vec

Use pytorch to implement word2vec

pytorch word2vec

Last synced: 22 Apr 2025

https://github.com/cadene/skip-thoughts.torch

Porting of Skip-Thoughts pretrained models from Theano to PyTorch & Torch7

gru pretrained-models rnn skip-thoughts torch word2vec

Last synced: 15 Jun 2025

https://github.com/guenthermi/postgres-word2vec

utils to use word embedding models like word2vec vectors in a PostgreSQL database

inverted-index knn-search postgresql product-quantization similarity-search word-embeddings word2vec

Last synced: 14 Apr 2025

https://github.com/src-d/ml

sourced.ml is a library and command line tools to build and apply machine learning models on top of Universal Abstract Syntax Trees

ast machine-learning mloncode word2vec

Last synced: 30 Dec 2025

https://github.com/kefirski/pytorch_NEG_loss

NEG loss implemented in pytorch

python pytorch word2vec

Last synced: 07 May 2025

https://github.com/chatopera/wikidata-corpus

Train Wikidata with word2vec for word embedding tasks

wikidata word-embeddings word2vec

Last synced: 20 Mar 2025

https://github.com/hironsan/ja.text8

Japanese text8 corpus for word embedding.

corpus deep-learning machine-learning natural-language-processing word2vec

Last synced: 11 Aug 2025

https://github.com/lamyiowce/word2viz

Visualization of semantic similarities in word embeddings.

analogies d3js interactive visualization word2vec

Last synced: 19 Nov 2025