https://github.com/jldbc/ipython-notebooks
A collection of small side projects and analyses
https://github.com/jldbc/ipython-notebooks
coffee ipython projects
Last synced: 4 days ago
JSON representation
A collection of small side projects and analyses
- Host: GitHub
- URL: https://github.com/jldbc/ipython-notebooks
- Owner: jldbc
- Created: 2016-09-28T02:46:33.000Z (over 8 years ago)
- Default Branch: master
- Last Pushed: 2017-12-02T06:30:05.000Z (over 7 years ago)
- Last Synced: 2025-03-31T13:23:35.004Z (23 days ago)
- Topics: coffee, ipython, projects
- Language: Jupyter Notebook
- Size: 1.79 MB
- Stars: 4
- Watchers: 4
- Forks: 2
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# A Collection of iPython Notebooks from Side Projects and One-Off Analyses
I often find myself getting carried away with data and models with no real goal of making a larger project out of the matter. This is where those types of projects will live. So far I have:
### Author Analysis
An NLP analysis of the career works of various authors. Here I look into the distribution of part-of-speech and punctuation profiles across authors, cluster them on both linguistic profile and word choice (via TF-IDF scores), and hierarchically cluster all the authors in my data. Data mostly comes from Project Gutenberg.### Coffee
Every cup of coffee I consumed between July 13th and December 21st, 2016 was logged as an entry in a spreadsheet I keep. This is the analysis of that data.### Election Forecasts
Some minimal analysis on polling and electoral college data. Still need to add Trump vs. Hillary to this.### Generative LSTM-RNN
Procedural text generation with recurrent neural networks using long short-term memory cells. Makes use of the Keras library.### Groupme Analytics
A notebook for data mining your Groupme. Requires you to get an API key. Find out who talks the most, writes the longest messages, gets the most likes, swears the most, etc. I highly recommend doing this.### Malicious URL Classification
Comparing the performance of three binary classifiers (logistic regression, random forest, and SVM) on the UCSD Malicious URL dataset (~1M rows, 3M features).### NBA Draft
An analysis of the career value of NBA draft picks, draft-day trades, and of tanking on the league standings in order to improve a team's lottery odds.