awesome-data-science
Data science and programming resources for daily work
https://github.com/m-muecke/awesome-data-science
Last synced: 11 days ago
JSON representation
-
R
-
Links
-
Books
- Advanced R - Hadley Wickham
- R Packages: Organize, Test, Document, and Share Your Code - Hadley Wickham and Jennifer Bryan
- An Introduction to R - Alex Douglas, Deon Roos, Francesca Mancini, Ana Couto, David Lusseau
- R Markdown: The Definitive Guide - Yihui Xie, J. J. Allaire, Garrett Grolemund
- bookdown: Authoring Books and Technical Documents with R Markdown - Yihui Xie
- Advanced R Solutions - Malte Grosser, Henning Bumann, Hadley Wickham
- Efficient R Programming - C. Gillespie and R. Lovelace
- Cookbook for R - Winston Chang
- Hands-On Programming with R - Garrett Grolemund
- Functional Programming - Sara Altman, Bill Behrman, Hadley Wickham
- Mastering Software Development in R - Roger D. Peng
- R Markdown Cookbook - Yihui Xie, Christophe Dervieux, Emily Riederer
- R for Everyone - Jared Lander
- R in Production - Hadley Wickham
- Tidy design principles - Hadley Wickham
- R in Production - Hadley Wickham
- Tidy design principles - Hadley Wickham
-
Blogs
-
Courses
-
-
Linux and Shell/Bash
-
R
- GitHub Curated List: List of Shell command-line frameworks
- The Missing Semester of Your CS Education: MIT lecture for shell, Vim, Git, etc. - MIT
- Bash Scripting Cheat Sheet
- GitHub Curated List: Bash Resources
- GitHub Curated List: Software for Linux
- GitHub Curated List: Linux Ecosystem Overview
- Data Science at the Command Line - Jeroen Janssens
- Advanced Bash Scripting Lecture - bwHPC
- Data Science at the Command Line - Jeroen Janssens
- Advanced Bash Scripting Guide - Mendel Cooper
- Advanced Bash Scripting Lecture - bwHPC
- Linux Basics: E-Learning Module on Linux and Bash Fundamentals - bwHPC
-
-
Documentation and Style Guide
-
Web Development
-
Python
- Django - High-level Python web framework
- FastAPI - Modern, fast (high-performance), web framework for building APIs
- Streamlit - A faster way to build and share data apps
- Flask Web Development - Miguel Grinberg
- Django for Everybody Specialization - Coursera
- Django for Everybody (DJ4E) - Charles R. Severance
- Python Django Tutorial Series - Corey Schafer
- FastHTML - Modern web applications in pure Python
- Python Django Tutorial Series - Corey Schafer
-
R
- Mastering Shiny - Hadley Wickham
- plumber - A web API generator for R
- blogdown: Creating Websites with R Markdown - Yihui Xie, Amber Thomas, Alison Presmanes Hill
- blogdown - Create websites with R Mardown
- Engineering Production-Grade Shiny Apps - Colin Fay, Sébastien Rochette, Vincent Guyader, Cervan Girard
- Outstanding User Interfaces with Shiny - David Granjon
- Shiny - Build interactive web applications
-
-
Python
-
Books
-
Courses
-
Links
-
-
Data Science
-
R
- Data Science Specialization - Coursera
- Statistical Inference via Data Science: A ModernDive into R and the Tidyverse - Chester Ismay and Albert Y. Kim
- Text Mining with R - Julia Silge and David Robinson
- Introduction to Statistical Learning - Gareth James, Daniela Witten, Trevor Hastiec, Rob Tibshirani
- R Graphics Cookbook - Winston Chang
- GitHub Curated List: Data Science R
- Flexible and Robust Machine Learning Using mlr3 in R - Lars Kotthoff, Raphael Sonabend, Michel Lang, Bernd Bischl
- Mastering Spark with R - Javier Luraschi, Kevin Kuo, Edgar Ruiz
- R Programming for Data Science - Roger D. Peng
- Report Writing for Data Science in R - Roger D. Peng
- Introduction to Machine Learning (I2ML) - LMU Munich
- Flexible and Robust Machine Learning Using mlr3 in R - Lars Kotthoff, Raphael Sonabend, Michel Lang, Bernd Bischl
- Report Writing for Data Science in R - Roger D. Peng
- Supervised Machine Learning for Text Analysis in R - Emil Hvitfeldt and Julia Silge
-
Python
- Applied Data Science with Python Specialization - Coursera
- Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow: Concepts, Tools, and Techniques to Build Intelligent Systems - Aurélien Géron
- Introduction to Machine Learning with Python - Andreas C. Müller and Sarah Guido
- Think Stats 2e - Allen B. Downey
- Web Scraping with Python - Ryan Mitchell
- Foundations of Machine Learning - Bloomberg ML EDU
- GitHub Curated List: Data Science Python
- Python Data Science Handbook: Essential Tools for Working with Data - Jake VanderPlas
- Python for Data Analysis - Wes McKinney
-
Courses
-
Books
- Elements of Statistical Learning - Trevor Hastie, Robert Tibshirani and Jerome Friedman
- Statistical Learning with Sparsity: The Lasso and Generalization - Trevor Hastie, Robert Tibshirani, Martin Wainwright
- Interpretable Machine Learning - Christoph Molnar
- Advanced Statistical Computing - Roger D. Peng
- Convex Optimization - Stephen Boyd and Lieven Vandenberghe
- Linear Algebra Review - J. Zico Kolter
- Linear Algebra for Data Science with examples in R - Shaina Race Bennett
- Statistical Inference for Data Science - Brian Caffo
-
-
Time Series
-
R
- Forecasting: Principles and Practice - Rob J Hyndman and George Athanasopoulos
- An Introduction to Analysis of Financial Data with R - Ruey S. Tsay
- Multivariate Time Series Analysis: With R and Financial Applications - Ruey S. Tsay
- Nonlinear Time Series: Theory, Methods and Applications with R Examples - David S. Stoffer, Randal Douc, Éric Moulines
- Time Series Analysis and Its Applications With R Examples - Robert H. Shumway and David S. Stoffer
- Time Series: A Data Analysis Approach Using R - Robert H. Shumway and David S. Stoffer
- CRAN Task View: Time Series Analysis
-
Books
- Analysis of Financial Time Series - Ruey S. Tsay
- Forecasting for Economics and business - Gloria González-Rivera
- Introduction to time series and forecasting - Peter J. Brockwell and Richard A. Davis
- Nonlinear Time Series Analysis - Ruey S. Tsay and Rong Chen
- Time series analysis: forecasting and control: George EP Box, et al.
-
-
NLP
-
Python
- Natural Language Processing Nanodegree - Udacity
- Natural Language Processing with Python - Steven Bird, Ewan Klein, Edward Loper
- CS224n: Natural Language Processing with Deep Learning - Stanford University
- Applied Text Analysis with Python - Benjamin Bengfort, Rebecca Bilbro, Tony Ojeda
- A Code-First Intro to Natural Language Processing - fast.ai
- CS224U: Natural Language Understading - Stanford University
-
-
Data Sets
-
R
- Yelp Academic Data Sets
- Kaggle Data
- Google Data Set Search
- University of Illinois at Chicago - Opinion Mining, Sentiment Analysis, and Opinion Spam Detection
- Quandl - Financial, Economic, and Alternative Datasets
- World Bank Open Data
- OECD Data Portal
- European Central Bank’s Data Warehouse
- Fama-French Data Library
- OpinRank Data – Reviews From TripAdvisor \& Edmunds
- SNAP – Amazon Reviews
- U.S. Census Bureau Data
- University of Texas - Multifaceted Text Classification Datasets
- eurostat - Statistical Database of the European Commision
- OpinRank Data – Reviews From TripAdvisor \& Edmunds
- SNAP – Amazon Reviews
- UCI – Machine Learning Repository
-
-
Deep Learning
-
Finance
-
Python
- Artificial Intelligence for Trading Nanodegree - Udacity
- Financial Engineering and Risk Management Specialization
- Advances in Financial Machine Learning - Marcos López de Prado
- Machine Learning for Algorithmic Trading - Stefan Jansen
- Machine Learning in Finance: From Theory to Practice - Igor Halperin, Matthew F. Dixon, Paul Bilokon
- Python for Finance - Yves Hilpisch
- Investment Management with Python and Machine Learning Specialization - Coursera
-
R
- Applying Data Analytics in Finance - Coursera
- CRAN Task View: Empirical Finance
- Basic R for Finance - Diethelm Würtz, Tobias Setz, Yohan Chalabi, Longhow Lam, Andrew Ellis
- Financial Data and Models Using R - Clifford Ang
- Financial Optimisation with R - Enrico Schumann
- Financial Risk Modelling and Portfolio Optimization with R - Bernhard Pfaff
- Introduction to Computational Finance and Financial Econometrics with R - Eric Zivot
- Machine Learning for Factor Investing - Guillaume Coqueret and Tony Guida
- Portfolio Management with R - Enrico Schumann
- Portfolio Optimization with R/Rmetrics - Diethelm Würtz, Tobias Setz, Yohan Chalabi, William Chen, Andrew Ellis
- Topics in Empirical Finance with R and Rmetrics - Patrick Hénaff
- ECON 424/CFRM 462: Computational Finance and Financial Econometrics - University of Washington
- FRE7241 Algorithmic Portfolio Management - NYU
- Financial Data and Models Using R - Clifford Ang
- Machine Learning for Factor Investing - Guillaume Coqueret and Tony Guida
- Financial Optimisation with R - Enrico Schumann
- Portfolio Management with R - Enrico Schumann
- Tidy Finance with R - Christoph Scheuch, Stefan Voigt, Patrick Weiss
-
Links
-
Books
- Bayesian Stability Concepts for Investment Managers - Diethelm Würtz, Tobias Setz
- Numerical Methods and Optimization in Finance - Manfred Gilli, Dietmar Maringer and Enrico Schumann
- Statistics and Data Analysis for Financial Engineering (with R examples) - David Ruppert and David S. Matteson
- Numerical Methods and Optimization in Finance - Manfred Gilli, Dietmar Maringer and Enrico Schumann
-
-
Computer Science
-
SQL
-
Quantitative Economics
-
Python
- QuantEcon - A high performance, open source Python code library for economics
- Advanced Quantitative Economics with Python - Thomas J. Sargent and John Stachurski
- Introduction to Economic Modeling and Data Science - Chase Coleman, Spencer Lyon, Jesse Perla, et al.
- Python Programming for Economics and Finance - Thomas J. Sargent and John Stachurski
- Quantitative Economics with Python - Thomas J. Sargent and John Stachurski
-
-
Docker
-
Econometrics
Programming Languages
Categories
Keywords
awesome
6
awesome-list
6
data-science
4
r
4
python
4
list
3
rstats
2
bash
2
text-mining
2
nlp
2
machine-learning
2
algorithmic-trading-engine
1
algorithmic-trading-library
1
algotrading
1
arbitrage-bot
1
finance
1
finance-api
1
financial-data
1
financial-instruments
1
google-finance
1
quant
1
quantitative-finance
1
yapf
1
pre-commit-hook
1
gofmt
1
formatter
1
codeformatter
1
code
1
autopep8
1
zsh
1
shell
1
fish
1
cli
1
data-analysis
1
x-window-manager
1
linux-distribution
1
linux
1
desktop-environment
1
book
1
datascience
1
styleguide
1
style-guide
1
cpplint
1
website-generation
1
rstudio
1
rmarkdown
1
hugo
1
blogdown
1
blog-engine
1
scikit-learn
1