Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists tagged with word2vec

A curated list of projects in awesome lists tagged with word2vec .

https://github.com/vi3k6i5/flashtext

Extract Keywords from sentence or Replace keywords in sentences.

data-extraction keyword-extraction nlp search-in-text word2vec

Last synced: 16 Dec 2024

https://github.com/shibing624/text2vec

text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。

embeddings nlp sentence-embeddings similarity text-similarity text2vec word2vec

Last synced: 17 Dec 2024

https://github.com/paddlepaddle/paddlerec

Recommendation Algorithm大规模推荐算法库,包含推荐系统经典及最新算法LR、Wide&Deep、DSSM、TDM、MIND、Word2Vec、Bert4Rec、DeepWalk、SSR、AITM,DSIN,SIGN,IPREC、GRU4Rec、Youtube_dnn、NCF、GNN、FM、FFM、DeepFM、DCN、DIN、DIEN、DLRM、MMOE、PLE、ESMM、ESCMM, MAML、xDeepFM、DeepFEFM、NFM、AFM、RALM、DMR、GateNet、NAML、DIFM、Deep Crossing、PNN、BST、AutoInt、FGCNN、FLEN、Fibinet、ListWise、DeepRec、ENSFM,TiSAS,AutoFIS等,包含经典推荐系统数据集criteo 、movielens等

deepfm esmm gru4rec lr mmoe ple tdm widedeep word2vec

Last synced: 17 Dec 2024

https://github.com/PaddlePaddle/PaddleRec

Recommendation Algorithm大规模推荐算法库,包含推荐系统经典及最新算法LR、Wide&Deep、DSSM、TDM、MIND、Word2Vec、Bert4Rec、DeepWalk、SSR、AITM,DSIN,SIGN,IPREC、GRU4Rec、Youtube_dnn、NCF、GNN、FM、FFM、DeepFM、DCN、DIN、DIEN、DLRM、MMOE、PLE、ESMM、ESCMM, MAML、xDeepFM、DeepFEFM、NFM、AFM、RALM、DMR、GateNet、NAML、DIFM、Deep Crossing、PNN、BST、AutoInt、FGCNN、FLEN、Fibinet、ListWise、DeepRec、ENSFM,TiSAS,AutoFIS等,包含经典推荐系统数据集criteo 、movielens等

deepfm esmm gru4rec lr mmoe ple tdm widedeep word2vec

Last synced: 11 Nov 2024

https://github.com/alibaba/alink

Alink is the Machine Learning algorithm platform based on Flink, developed by the PAI team of Alibaba computing platform.

apriori classification clustering data-mining feature-engineering flink flink-machine-learning flink-ml fm graph-algorithms graph-embedding kafka machine-learning recommender recommender-system regression statistics word2vec xgboost

Last synced: 17 Dec 2024

https://github.com/alibaba/Alink

Alink is the Machine Learning algorithm platform based on Flink, developed by the PAI team of Alibaba computing platform.

apriori classification clustering data-mining feature-engineering flink flink-machine-learning flink-ml fm graph-algorithms graph-embedding kafka machine-learning recommender recommender-system regression statistics word2vec xgboost

Last synced: 26 Oct 2024

https://github.com/danielfrg/word2vec

Python interface to Google word2vec

doc2vec python word2vec

Last synced: 26 Sep 2024

https://github.com/duoergun0729/nlp

兜哥出品 <一本开源的NLP入门书籍>

ai fasttext nlp security word2vec

Last synced: 21 Dec 2024

https://github.com/kyubyong/wordvectors

Pre-trained word vectors of 30+ languages

fasttext language vector word2vec

Last synced: 21 Dec 2024

https://github.com/Kyubyong/wordvectors

Pre-trained word vectors of 30+ languages

fasttext language vector word2vec

Last synced: 09 Nov 2024

https://github.com/golbin/TensorFlow-Tutorials

텐서플로우를 기초부터 응용까지 단계별로 연습할 수 있는 소스 코드를 제공합니다

autoencoder chatbot cnn deep-learning dqn gan inception mnist neural-network rnn seq2seq tensorflow tutorial word2vec

Last synced: 12 Nov 2024

https://github.com/golbin/tensorflow-tutorials

텐서플로우를 기초부터 응용까지 단계별로 연습할 수 있는 소스 코드를 제공합니다

autoencoder chatbot cnn deep-learning dqn gan inception mnist neural-network rnn seq2seq tensorflow tutorial word2vec

Last synced: 20 Dec 2024

https://github.com/kavgan/nlp-in-practice

Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.

gensim machine-learning natural-language-processing nlp text-classification text-mining tf-idf word2vec

Last synced: 21 Dec 2024

https://github.com/dselivanov/text2vec

Fast vectorization, topic modeling, distances and GloVe word embeddings in R.

glove latent-dirichlet-allocation natural-language-processing text-mining topic-modeling vectorization word-embeddings word2vec

Last synced: 25 Oct 2024

https://github.com/zhezhaoa/ngram2vec

Four word embedding models implemented in Python. Supporting arbitrary context features

analogy chinese embedding glove n-gram ngram ngram2vec ppmi svd word word-embedding word2vec

Last synced: 06 Nov 2024

https://github.com/inspirehep/magpie

Deep neural network framework for multi-label text classification

classification deep-learning machine-learning multi-label-classification neural-network nlp prediction word2vec

Last synced: 18 Dec 2024

https://github.com/hankcs/cs224n

CS224n: Natural Language Processing with Deep Learning Assignments Winter, 2017

cs224n deep-learning natural-language-processing rnn tensorflow word2vec

Last synced: 21 Dec 2024

https://github.com/zhang17173/Event-Extraction

基于法律裁判文书的事件抽取及其应用,包括数据的分词、词性标注、命名实体识别、事件要素抽取和判决结果预测等内容

cnn-classification deep-learning event-extraction judgment nlp word2vec

Last synced: 25 Nov 2024

https://github.com/relevanceai/vectorhub

Vector Hub - Library for easy discovery, and consumption of State-of-the-art models to turn data into vectors. (text2vec, image2vec, video2vec, graph2vec, bert, inception, etc)

artificial-intelligence audio-processing deep-learning deeplearning embeddings encodings image2vec machine-learning neural-network python pytorch tensorflow tfhub transformers vector vector-similarity video-processing word2vec

Last synced: 21 Dec 2024

https://github.com/RelevanceAI/vectorhub

Vector Hub - Library for easy discovery, and consumption of State-of-the-art models to turn data into vectors. (text2vec, image2vec, video2vec, graph2vec, bert, inception, etc)

artificial-intelligence audio-processing deep-learning deeplearning embeddings encodings image2vec machine-learning neural-network python pytorch tensorflow tfhub transformers vector vector-similarity video-processing word2vec

Last synced: 11 Nov 2024

https://github.com/gaoisbest/NLP-Projects

word2vec, sentence2vec, machine reading comprehension, dialog system, text classification, pretrained language model (i.e., XLNet, BERT, ELMo, GPT), sequence labeling, information retrieval, information extraction (i.e., entity, relation and event extraction), knowledge graph, text generation, network embedding

dialogue-systems information-extraction information-retrieval knowledge-graph machine-reading-comprehension network-embedding pretrained-language-model sentence2vec sequence-labeling text-classification text-generation word2vec

Last synced: 06 Nov 2024

https://github.com/khanhnamle1994/natural-language-processing

Programming Assignments and Lectures for Stanford's CS 224: Natural Language Processing with Deep Learning

deep-learning glove machine-learning natural-language-processing word2vec

Last synced: 15 Dec 2024

https://github.com/ynqa/wego

Word Embeddings in Go!

glove go machine-learning nlp word-embeddings word2vec

Last synced: 21 Dec 2024

https://github.com/ThoughtRiver/lmdb-embeddings

Fast word vectors with little memory usage in Python

embeddings fasttext gensim glove lmdb magnitude memory speed text vectors word word2vec

Last synced: 27 Nov 2024

https://github.com/pkmital/pycadl

Python package with source code from the course "Creative Applications of Deep Learning w/ TensorFlow"

autoregressive celeba conditional course cyclegan dcgan deep-learning gan glove magenta mooc neural-network nsynth pixelcnn tensorflow tutorial vae vae-gan wavenet word2vec

Last synced: 20 Dec 2024

https://github.com/planeshifter/node-word2vec

Node.js interface to the Google word2vec tool.

nlp word2vec

Last synced: 20 Dec 2024

https://github.com/Planeshifter/node-word2vec

Node.js interface to the Google word2vec tool.

nlp word2vec

Last synced: 02 Nov 2024

https://github.com/brightmart/nlu_sim

all kinds of baseline models for sentence similarity 句子对语义相似度模型

atec nlu qa question-answering questions-and-answers semantic-similarity sentence-similarity similarity-measurement word2vec

Last synced: 18 Dec 2024

https://github.com/lujiaying/MovieTaster-Open

A practical movie recommend project based on Item2vec.

deep-learning item2vec word2vec

Last synced: 22 Nov 2024

https://github.com/dalinvip/cw2vec

cw2vec: Learning Chinese Word Embeddings with Stroke n-gram Information

cw2vec embeddings fasttext stroke-information word2vec

Last synced: 19 Dec 2024

https://github.com/30lm32/ml-projects

ML based projects such as Spam Classification, Time Series Analysis, Text Classification using Random Forest, Deep Learning, Bayesian, Xgboost in Python

ab-testing deep-learning docker gensim geolocation imbalanced-data kdtree keras lstm-neural-networks machine-learning mlflow nlp random-forest spam-classification svm tensorboard tensorflow text-classification timeseries-analysis word2vec

Last synced: 15 Nov 2024

https://github.com/thesephist/revery

A personal semantic search engine capable of surfacing relevant bookmarks, journal entries, notes, blogs, contacts, and more, built on an efficient document embedding algorithm and Monocle's personal search index.

browser-extension natural-language-processing search-engine torus-dom word2vec

Last synced: 18 Nov 2024

https://github.com/bloomberg/koan

A word2vec negative sampling implementation with correct CBOW update.

cbow cpp skipgram word-embeddings word2vec

Last synced: 18 Dec 2024

https://github.com/oxford-cs-deepnlp-2017/practical-1

Oxford Deep NLP 2017 course - Practical 1: word2vec

deep-learning natural-language-processing nlp oxford word2vec

Last synced: 19 Dec 2024

https://github.com/tolga-b/debiaswe

Remove problematic gender bias from word embeddings.

debias gender-equality nips-2016 social-justice word-embeddings word2vec

Last synced: 18 Dec 2024

https://github.com/luopeixiang/textclf

TextClf :基于Pytorch/Sklearn的文本分类框架,包括逻辑回归、SVM、TextCNN、TextRNN、TextRCNN、DRNN、DPCNN、Bert等多种模型,通过简单配置即可完成数据处理、模型训练、测试等过程。

bert cnn-text-classification configurable document-classification dpcnn drnn glove logistic-regression lstm-text-classification neuralclassifier pytorch sentiment-analysis sklearn-classify svm textcnn textrnn word2vec

Last synced: 20 Nov 2024

https://github.com/devmount/germanwordembeddings

Toolkit to obtain and preprocess German text corpora, train models and evaluate them with generated testsets. Built with Gensim and Tensorflow.

deep-learning deep-neural-networks evaluation gensim german-language model natural-language-processing neural-network nlp training word-embeddings word2vec

Last synced: 16 Dec 2024

https://github.com/akoksal/Turkish-Word2Vec

Pre-trained Word2Vec Model for Turkish

gensim nlp turkish word2vec

Last synced: 12 Nov 2024

https://github.com/giacbrd/ShallowLearn

An experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fully compatible with Scikit-learn.

fasttext gensim machine-learning neural-network online-learning scikit-learn shallow-learning supervised-learning text-classification text-mining word-embeddings word2vec

Last synced: 27 Nov 2024

https://github.com/giacbrd/shallowlearn

An experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fully compatible with Scikit-learn.

fasttext gensim machine-learning neural-network online-learning scikit-learn shallow-learning supervised-learning text-classification text-mining word-embeddings word2vec

Last synced: 18 Dec 2024

https://github.com/akutuzov/webvectors

Web-ify your word2vec: framework to serve distributional semantic models online

distributional-semantics embedding-models flask gensim web-app word2vec

Last synced: 27 Nov 2024

https://github.com/sajari/word2vec

Go library for performing computations in word2vec binary models

embedding go golang word word2vec word2vec-model

Last synced: 19 Dec 2024

https://github.com/OlgaChernytska/word2vec-pytorch

Implementation of the first paper on word2vec

deep-learning natural-language-processing pytorch word2vec

Last synced: 26 Sep 2024

https://github.com/fanglanting/skip-gram-pytorch

A complete pytorch implementation of skip-gram

embed pytorch spearman word2vec

Last synced: 27 Nov 2024

https://github.com/mkearney/textfeatures

👷‍♂️ A simple package for extracting useful features from character objects 👷‍♀️

feature-extraction machine-learning mkearney-r-package neural-network neural-networks r rstats text-mining word2vec

Last synced: 17 Dec 2024

https://github.com/natasha/navec

Compact high quality word embeddings for Russian language

embeddings glove nlp python quantization russian word2vec

Last synced: 15 Dec 2024

https://github.com/Lancern/asm2vec

An unofficial implementation of asm2vec as a standalone python package

asm2vec binary-analysis machine-learning nlp numpy python python3 unofficial word2vec

Last synced: 17 Nov 2024

https://github.com/lancern/asm2vec

An unofficial implementation of asm2vec as a standalone python package

asm2vec binary-analysis machine-learning nlp numpy python python3 unofficial word2vec

Last synced: 20 Dec 2024

https://github.com/cadene/skip-thoughts.torch

Porting of Skip-Thoughts pretrained models from Theano to PyTorch & Torch7

gru pretrained-models rnn skip-thoughts torch word2vec

Last synced: 19 Dec 2024

https://github.com/dalinvip/pytorch_word2vec

Use pytorch to implement word2vec

pytorch word2vec

Last synced: 29 Nov 2024

https://github.com/guenthermi/postgres-word2vec

utils to use word embedding models like word2vec vectors in a PostgreSQL database

inverted-index knn-search postgresql product-quantization similarity-search word-embeddings word2vec

Last synced: 08 Nov 2024

https://github.com/kefirski/pytorch_NEG_loss

NEG loss implemented in pytorch

python pytorch word2vec

Last synced: 14 Nov 2024

https://github.com/hironsan/ja.text8

Japanese text8 corpus for word embedding.

corpus deep-learning machine-learning natural-language-processing word2vec

Last synced: 13 Dec 2024

https://github.com/tharindudr/simple-sentence-similarity

Exploring the simple sentence similarity measurements using word embeddings

elmo fasttext glove ipynb python sentence-embeddings sentence-similarity wmd word-embeddings word2vec

Last synced: 20 Dec 2024

https://github.com/joisino/wordtour

Code for "Word Tour: One-dimensional Word Embeddings via the Traveling Salesman Problem" (NAACL 2022)

embeddings machine-learning natural-language-processing word-embeddings word2vec

Last synced: 27 Nov 2024

https://github.com/iamaziz/ar-embeddings

Sentiment Analysis for Arabic Text (tweets, reviews, and standard Arabic) using word2vec

arabic arabic-embedding arabic-nlp arabic-sentiment embeddings sentiment-analysis word2vec word2vec-model

Last synced: 15 Dec 2024

https://github.com/guillaume-chevalier/glove-as-a-tensorflow-embedding-layer

Taking a pretrained GloVe model, and using it as a TensorFlow embedding weight layer **inside the GPU**. Therefore, you only need to send the index of the words through the GPU data transfer bus, reducing data transfer overhead.

cosine-similarity glove glove-embeddings gpu gpu-acceleration gpu-tensorflow neural-network tensorflow tensorflow-layers word-embeddings word2vec

Last synced: 09 Nov 2024

https://github.com/maxoodf/russian_news_corpus

Russian mass media stemmed texts corpus / Корпус лемматизированных (морфологически нормализованных) текстов российских СМИ

articles corpus machine-learning ml nlp nlp-machine-learning russian text word2vec

Last synced: 07 Dec 2024

https://github.com/benedekrozemberczki/BANE

A sparsity aware implementation of "Binarized Attributed Network Embedding" (ICDM 2018).

bane deepwalk diff2vec dimensionality-reduction embedding factorization fscnmf gemsec graph graph2vec icdm lane line musae node node2vec svd tadw tridnr word2vec

Last synced: 08 Nov 2024

https://github.com/benedekrozemberczki/bane

A sparsity aware implementation of "Binarized Attributed Network Embedding" (ICDM 2018).

bane deepwalk diff2vec dimensionality-reduction embedding factorization fscnmf gemsec graph graph2vec icdm lane line musae node node2vec svd tadw tridnr word2vec

Last synced: 14 Nov 2024

https://github.com/philipperemy/japanese-words-to-vectors

Word2vec (word to vectors) approach for Japanese language using Gensim and Mecab.

corpus gensim japanese japanese-language wikipedia word2vec word2vec-algorithm

Last synced: 02 Nov 2024