Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists by kavgan

A curated list of projects in awesome lists by kavgan .

https://github.com/kavgan/nlp-in-practice

Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.

gensim machine-learning natural-language-processing nlp text-classification text-mining tf-idf word2vec

Last synced: 30 Oct 2024

https://github.com/kavgan/rouge-2.0

ROUGE automatic summarization evaluation toolkit. Support for ROUGE-[N, L, S, SU], stemming and stopwords in different languages, unicode text evaluation, CSV output.

evaluation evaluation-toolkit java metrics nlp rouge rouge-l rouge-n rouge-s rouge-su text-summarization unicode-text

Last synced: 30 Oct 2024

https://github.com/kavgan/phrase-at-scale

Detect common phrases in large amounts of text using a data-driven approach. Size of discovered phrases can be arbitrary. Can be used in languages other than English

collocation-extraction multiword-expressions multiword-extraction natural-language-processing nlp nlp-machine-learning phrase-discovery phrase-extraction pyspark spark

Last synced: 30 Oct 2024

https://github.com/kavgan/opinosis-summarization

This repo contains code and dataset for the Opinosis Summarization Framework

algorithm graph graph-nlp opinosis opinosis-summarizer summarization-framework

Last synced: 30 Oct 2024

https://github.com/kavgan/word_cloud

Python word cloud library for use within Jupyter notebook and Python apps.

cloud-library jupyter-notebook nlp python visualization word-cloud wordcloud

Last synced: 30 Oct 2024

https://github.com/kavgan/opinrank

OpinRank Dataset. Dataset containing user reviews for entities namely cars and hotels. Full reviews from Tripadvisor (~259,000 reviews) and Edmunds (~42,230 reviews)

dataset entity-ranking opinions opinrank-dataset user-reviews

Last synced: 30 Oct 2024

https://github.com/kavgan/clinical-concepts

Discovering Related Clinical Concepts using Large Amounts of Clinical Notes. An unsupervised graphical approach to mine related concepts by leveraging the volume within large amounts of clinical notes.

clinical-concepts clinical-nlp clinical-notes concept-graph graph-nlp nlp paper terminologies

Last synced: 30 Oct 2024

https://github.com/kavgan/spark-examples

Examples of code in spark

pyspark spark

Last synced: 30 Oct 2024

https://github.com/kavgan/micropinion-generation-dataset

Dataset for Micropinion Generation. Dataset is based on user reviews from CNET. The reviews are on products from various categories like tv, cell phones, gps etc.

cnet dataset micropinion-generation-dataset sentence sentences user-reviews

Last synced: 30 Oct 2024

https://github.com/kavgan/javapractice

Practice practice practice. Bubble sort, factorial, powerset, subarray, mergesort, remove duplicates, etc.

data-mining hmm java java-practice sorting

Last synced: 30 Oct 2024

https://github.com/kavgan/neuralnetworks-1

Experiments using neural networks in java

Last synced: 30 Oct 2024

https://github.com/kavgan/rouge-utility

Utility tools to prepare and evaluate ROUGE scores. Perl script to convert perl output of ROUGE to CSV.

evaluate-rouge-scores nlp rouge rouge-utility text-summarization

Last synced: 30 Oct 2024

https://github.com/kavgan/python-examples

Working examples in python

python python-examples

Last synced: 30 Oct 2024

https://github.com/kavgan/test-repo

Test repo

Last synced: 30 Oct 2024

https://github.com/kavgan/desmond

Importing/exporting functionality for the RedShift data warehouse

Last synced: 30 Oct 2024

https://github.com/kavgan/complaints-and-praises

Linguistic Understanding of Complaints and Praises in User Reviews. Paper talking about going beyond positive and negative sentiment categories. Complaints and Praise have properties that are different from positives and negatives

complaints linguistics nlp paper sentiment-analysis user-reviews

Last synced: 30 Oct 2024

https://github.com/kavgan/images

website images

Last synced: 30 Oct 2024