Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists by kavgan
A curated list of projects in awesome lists by kavgan .
https://github.com/kavgan/nlp-in-practice
Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.
gensim machine-learning natural-language-processing nlp text-classification text-mining tf-idf word2vec
Last synced: 30 Oct 2024
https://github.com/kavgan/rouge-2.0
ROUGE automatic summarization evaluation toolkit. Support for ROUGE-[N, L, S, SU], stemming and stopwords in different languages, unicode text evaluation, CSV output.
evaluation evaluation-toolkit java metrics nlp rouge rouge-l rouge-n rouge-s rouge-su text-summarization unicode-text
Last synced: 30 Oct 2024
https://github.com/kavgan/phrase-at-scale
Detect common phrases in large amounts of text using a data-driven approach. Size of discovered phrases can be arbitrary. Can be used in languages other than English
collocation-extraction multiword-expressions multiword-extraction natural-language-processing nlp nlp-machine-learning phrase-discovery phrase-extraction pyspark spark
Last synced: 30 Oct 2024
https://github.com/kavgan/opinosis-summarization
This repo contains code and dataset for the Opinosis Summarization Framework
algorithm graph graph-nlp opinosis opinosis-summarizer summarization-framework
Last synced: 30 Oct 2024
https://github.com/kavgan/word_cloud
Python word cloud library for use within Jupyter notebook and Python apps.
cloud-library jupyter-notebook nlp python visualization word-cloud wordcloud
Last synced: 30 Oct 2024
https://github.com/kavgan/opinrank
OpinRank Dataset. Dataset containing user reviews for entities namely cars and hotels. Full reviews from Tripadvisor (~259,000 reviews) and Edmunds (~42,230 reviews)
dataset entity-ranking opinions opinrank-dataset user-reviews
Last synced: 30 Oct 2024
https://github.com/kavgan/clinical-concepts
Discovering Related Clinical Concepts using Large Amounts of Clinical Notes. An unsupervised graphical approach to mine related concepts by leveraging the volume within large amounts of clinical notes.
clinical-concepts clinical-nlp clinical-notes concept-graph graph-nlp nlp paper terminologies
Last synced: 30 Oct 2024
https://github.com/kavgan/stop-words
Stop word lists
natural-language-processing nlp stopwords text-mining
Last synced: 30 Oct 2024
https://github.com/kavgan/hashtags_test
Test hashtags
engagement facebook-post google-hashtags instagram pound social-media trending-hashtags twitter twitter-ads
Last synced: 30 Oct 2024
https://github.com/kavgan/micropinion-generation-dataset
Dataset for Micropinion Generation. Dataset is based on user reviews from CNET. The reviews are on products from various categories like tv, cell phones, gps etc.
cnet dataset micropinion-generation-dataset sentence sentences user-reviews
Last synced: 30 Oct 2024
https://github.com/kavgan/javapractice
Practice practice practice. Bubble sort, factorial, powerset, subarray, mergesort, remove duplicates, etc.
data-mining hmm java java-practice sorting
Last synced: 30 Oct 2024
https://github.com/kavgan/neuralnetworks-1
Experiments using neural networks in java
Last synced: 30 Oct 2024
https://github.com/kavgan/rouge-utility
Utility tools to prepare and evaluate ROUGE scores. Perl script to convert perl output of ROUGE to CSV.
evaluate-rouge-scores nlp rouge rouge-utility text-summarization
Last synced: 30 Oct 2024
https://github.com/kavgan/desmond
Importing/exporting functionality for the RedShift data warehouse
Last synced: 30 Oct 2024
https://github.com/kavgan/complaints-and-praises
Linguistic Understanding of Complaints and Praises in User Reviews. Paper talking about going beyond positive and negative sentiment categories. Complaints and Praise have properties that are different from positives and negatives
complaints linguistics nlp paper sentiment-analysis user-reviews
Last synced: 30 Oct 2024