Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists by piskvorky
A curated list of projects in awesome lists by piskvorky .
https://github.com/piskvorky/gensim
Topic Modelling for Humans
data-mining data-science document-similarity fasttext gensim information-retrieval machine-learning natural-language-processing neural-network nlp python topic-modeling word-embeddings word-similarity word2vec
Last synced: 28 Oct 2024
https://github.com/rare-technologies/gensim
Topic Modelling for Humans
data-mining data-science document-similarity fasttext gensim information-retrieval machine-learning natural-language-processing neural-network nlp python topic-modeling word-embeddings word-similarity word2vec
Last synced: 07 Aug 2024
https://github.com/RaRe-Technologies/gensim
Topic Modelling for Humans
data-mining data-science document-similarity fasttext gensim information-retrieval machine-learning natural-language-processing neural-network nlp python topic-modeling word-embeddings word-similarity word2vec
Last synced: 04 Aug 2024
https://github.com/piskvorky/smart_open
Utils for streaming large files (S3, HDFS, gzip, bz2...)
boto bz2 file gzip-stream hacktoberfest hdfs python s3 streaming streaming-data webhdfs
Last synced: 28 Oct 2024
https://github.com/piskvorky/sqlitedict
Persistent dict, backed by sqlite3 and pickle, multithread-safe.
data-store multi-threading python sqlite
Last synced: 14 Oct 2024
https://github.com/piskvorky/gensim-data
Data repository for pretrained NLP models and NLP corpora.
corpora dataset gensim glove-model lda-model lsi-model pretrained-models word2vec-model
Last synced: 01 Nov 2024
https://github.com/RaRe-Technologies/bounter
Efficient Counter that uses a limited (bounded) amount of memory regardless of data size.
Last synced: 08 Nov 2024
https://github.com/piskvorky/bounter
Efficient Counter that uses a limited (bounded) amount of memory regardless of data size.
Last synced: 01 Nov 2024
https://github.com/piskvorky/word_embeddings
Code for the blog post "Making Sense of Word2vec"
Last synced: 27 Oct 2024
https://github.com/piskvorky/topic_modeling_tutorial
Instructions & code for the EuroPython 2014 training session "Topic Modeling for Fun and Profit"
Last synced: 27 Oct 2024
https://github.com/piskvorky/gensim-simserver
[NO LONGER MAINTAINED AS OPEN SOURCE - USE SCALETEXT.COM INSTEAD]
Last synced: 27 Oct 2024
https://github.com/piskvorky/sim-shootout
Code for "Performance shootout between nearest-neighbour libraries": http://radimrehurek.com/2013/11/performance-shootout-of-nearest-neighbours-intro
Last synced: 28 Oct 2024
https://github.com/piskvorky/data_science_python
Source code for the "Practical Data Science in Python" tutorial
Last synced: 13 Oct 2024
https://github.com/RaRe-Technologies/sparsesvd
Python wrapper around SVDLIBC, a fast library for sparse Singular Value Decomposition
Last synced: 07 Aug 2024
https://github.com/piskvorky/sparsesvd
Python wrapper around SVDLIBC, a fast library for sparse Singular Value Decomposition
Last synced: 28 Oct 2024
https://github.com/piskvorky/gensim-wheels
Repository to build and test Gensim wheels
Last synced: 13 Oct 2024