fintech-awesome-libraries
compilation of libraries that have been helpful for data analysis
https://github.com/Davelexic/fintech-awesome-libraries
Last synced: 4 days ago
JSON representation
-
Machine Learning
-
Automatic Plotting
- awesome-machine-learning
- Shogun - Machine learning toolbox.
- gym - A toolkit for developing and comparing reinforcement learning algorithms.
- H2O - Open Source Fast Scalable Machine Learning Platform.
- mlpack - A scalable C++ machine learning library (Python bindings).
- xLearn - High Performance, Easy-to-use, and Scalable Machine Learning Package.
- MLxtend - Extension and helper modules for Python's data analysis and machine learning libraries.
- xgboost - A scalable, portable, and distributed gradient boosting library.
- cuML - RAPIDS Machine Learning Library.
- MindsDB - MindsDB is an open source AI layer for existing databases that allows you to effortlessly develop, train and deploy state-of-the-art machine learning models using standard queries.
- causalml - Uplift modeling and causal inference with machine learning algorithms.
- Karate Club - An unsupervised machine learning library for graph structured data.
- sklearn-expertsys - Highly interpretable classifiers for scikit learn, producing easily understood decision rules instead of black box models.
- seqlearn - Sequence classification toolkit for Python.
- pystruct - Simple structured learning framework for Python.
- Sparkit-learn - PySpark + scikit-learn = Sparkit-learn.
- RuleFit - Implementation of the rulefit.
- pyGAM - Generalized Additive Models in Python.
- Little Ball of Fur - A library for sampling graph structured data.
- Reproducible Experiment Platform (REP) - Machine Learning toolbox for Humans.
- hyperlearn - 50%+ Faster, 50%+ less RAM usage, GPU support re-written Sklearn, Statsmodels.
- scikit-learn - The most popular Python library for Machine Learning.
- NuPIC - Numenta Platform for Intelligent Computing.
- metric-learn - Metric learning algorithms in Python.
-
Automated Machine Learning
- auto-sklearn - An automated machine learning toolkit and a drop-in replacement for a scikit-learn estimator.
- MLBox - A powerful Automated Machine Learning python library.
-
-
Theory
-
Automated Machine Learning
- Papers We Love
- Talks
- Algorithms
- Education - Learning and practicing.
- Algorithm Visualizations
- Artificial Intelligence
- Search Engine Optimization
- Competitive Programming
- Math
- Recursion Schemes - Traversing nested data structures.
-
-
Books
-
Cheats sheets
- Github Cheat sheet - A collection of cool hidden and not so hidden features of Git and GitHub.
- Git tips - Collection of tips for Git.
- Data Manipulation - Data mutation.
- Data Science Cheatsheet
-
Database Drivers
-
Automatic Plotting
- awesome-postgres
- psycopg2 - The most popular PostgreSQL adapter for Python.
- sqlite3 - (Python standard library) SQlite interface compliant with DB-API 2.0
- awesome-sqlite
- PyMySQL - A pure Python MySQL driver compatible to mysql-python.
- queries - A wrapper of the psycopg2 library for interacting with PostgreSQL.
- SuperSQLite - A supercharged SQLite library built on top of [apsw](https://github.com/rogerbinns/apsw).
- awesome-mysql
- mysqlclient - MySQL connector with Python 3 support ([mysql-python](https://sourceforge.net/projects/mysql-python/) fork).
-
-
Learn
-
Automated Machine Learning
- CLI Workshoppers - Interactive tutorials.
- Learn to Program
- Speaking
- Tech Videos
- Computer History
- Educational Games - Learn while playing.
- Product Management - Learn how to be a better product manager.
- Roadmaps - Gives you a clear route to improve your knowledge and skills.
- YouTubers - Watch video tutorials from YouTubers that teach you about technology.
- Dive into Machine Learning
-
-
Business
- Open Companies
- Places to Post Your Startup
- OKR Methodology - Goal setting & communication best practices.
- Indie - Independent developer businesses.
- Tools of the Trade - Tools used by companies on Hacker News.
- Clean Tech - Fighting climate change with technology.
- Wardley Maps - Provides high situational awareness to help improve strategic planning and decision making.
- Social Enterprise - Building an organization primarily focused on social impact that is at least partially self-funded.
- Engineering Team Management - How to transition from software development to engineering management.
- Developer-First Products - Products that target developers as the user.
- Leading and Managing - Leading people and being a manager in a technology company/environment.
-
Work
-
Automated Machine Learning
- Slack - Team collaboration.
- Communities
- Remote Jobs
- Productivity
- Niche Job Boards
- Programming Interviews
- Code Review - Reviewing code.
- Creative Technology - Businesses & groups that specialize in combining computing, design, art, and user experience.
-
-
Decentralized Systems
-
Automatic Plotting
- Bitcoin - Bitcoin services and tools for software developers.
- Ripple - Open source distributed settlement network.
- Non-Financial Blockchain - Non-financial blockchain applications.
- Ethereum - Distributed computing platform for smart contract development.
- Blockchain AI - Blockchain projects for artificial intelligence and machine learning.
- EOSIO - A decentralized operating system supporting industrial-scale apps.
- Waves - Open source blockchain platform and development toolset for Web 3.0 apps and decentralized solutions.
- Mastodon - Open source decentralized microblogging network.
- Corda - Open source blockchain platform designed for business.
- Substrate - Framework for writing scalable, upgradeable blockchains in Rust.
-
-
Cryptography
- cryptography - A package designed to expose cryptographic primitives and recipes to Python developers.
- passlib - Secure password storage/hashing library, very high level.
- pynacl - Python binding to the Networking and Cryptography (NaCl) library.
- paramiko - The leading native Python SSHv2 protocol library.
-
Data Analysis
- Pandas - A library providing high-performance, easy-to-use data structures and data analysis tools.
- Pandas Profiling - generates profile report off of pandas database.
- Blaze - NumPy and Pandas interface to Big Data.
- Open Mining - Business Intelligence (BI) in Pandas interface.
- Orange - Data mining, data visualization, analysis and machine learning through visual programming or scripts.
- Optimus - Agile Data Science Workflows made easy with PySpark.
- AWS Data Wrangler - Pandas on AWS.
-
Date and Time
-
Automatic Plotting
- Arrow - A Python library that offers a sensible and human-friendly approach to creating, manipulating, formatting and converting dates, times and timestamps.
- pytz - World timezone definitions, modern and historical.
- dateutil - Powerful extensions to the standard datetime module
- when.py - Providing user-friendly functions to help perform common date and time actions.
- luminol - Anomaly Detection and Correlation library.
- Prophet - Automatic Forecasting Procedure.
- PyFlux - Open source time series library for Python.
- moment - A Python library for dealing with dates/times.
- Chronyk - A Python 3 library for parsing human-written times and dates.
- bayesloop - Probabilistic programming framework that facilitates objective model selection for time-varying parameter models.
- tick - Module for statistical learning, with a particular emphasis on time-dependent modelling.
- PyTime - An easy-to-use Python module which aims to operate date/time/datetime by string.
- delorean - A library for clearing up the inconvenient truths that arise dealing with datetimes.
- tslearn - Machine learning toolkit dedicated to time-series data.
- maya - makes it very easy to parse a string and for changing timezones.
- Pendulum - Python datetimes made easy.
-
-
Websites
-
Automated Machine Learning
- Full Stack Python
- Python Cheatsheet
- The Hitchhiker’s Guide to Python
- Awesome Python @LibHunt
- Python ZEEF
- Pythonic News
- O'reilly
- Towards Data Science
- Reddit-Data Science
- Towards Data Science
- Towards Data Science
- Towards Data Science
- Towards Data Science
- Towards Data Science
- Towards Data Science
- Towards Data Science
- Towards Data Science
- Towards Data Science
- Towards Data Science
- Towards Data Science
- Towards Data Science
- Towards Data Science
- Towards Data Science
- Towards Data Science
- Towards Data Science
- Towards Data Science
- Towards Data Science
- Towards Data Science
- Towards Data Science
- Towards Data Science
- What the f*ck Python!
- Ultimate Python study guide
- Towards Data Science
- kdnuggets
- Real Python
-
-
Newsletters
-
Automated Machine Learning
-
-
Data Visualization
-
Interactive plots
- Altair - Declarative statistical visualization library for Python. Can easily do many data transformation within the code to create graph
- Bokeh - Interactive Web Plotting for Python.
- animatplot - A python package for animating plots build on matplotlib.
- bqplot - Plotting library for IPython/Jupyter notebooks
- pyecharts - a charting and visualization library, to Python's interactive visual drawing library.
-
Map
-
General Purposes
- Matplotlib - Plotting with Python.
- seaborn - Statistical data visualization using matplotlib.
- prettyplotlib - Painlessly create beautiful matplotlib plots.
- python-ternary - Ternary plotting library for python with matplotlib.
- missingno - Missing data visualization module for Python.
- chartify - Python library that makes it easy for data scientists to create charts.
- physt - Improved histograms.
-
Automatic Plotting
-
-
Deployment
-
Tensor Flow
-
Automated Machine Learning
- TensorFlow - Computation using data flow graphs for scalable machine learning by Google.
- NeuPy - NeuPy is a Python library for Artificial Neural Networks and Deep Learning
- Elephas - Distributed Deep learning with Keras & Spark.
- qkeras - A quantization deep learning library.
- Mesh TensorFlow - Model Parallelism Made Easier.
- TFLearn - Deep learning library featuring a higher-level API for TensorFlow.
- Polyaxon - A platform that helps you build, manage and monitor deep learning models.
- tfdeploy - Deploy tensorflow graphs for fast evaluation and export to tensorflow-less environments running numpy.
- TensorFlow Fold - Deep learning with dynamic computation graphs in TensorFlow.
- tensorlm - Wrapper library for text generation / language models at char and word level with RNN.
- keras-contrib - Keras community contributions.
- Hyperas - Keras + Hyperopt: A very simple wrapper for convenient hyperparameter.
- Hera - Train/evaluate a Keras model, get metrics streamed to a dashboard in your browser.
- Spektral - Deep learning on graphs.
- Sonnet - TensorFlow-based neural network library.
- tensorpack - A Neural Net Training Interface on TensorFlow.
- tensorflow-upstream - TensorFlow ROCm port.
- Ludwig - A toolbox, that allows to train and test deep learning models without the need to write code.
- TensorLight - A high-level framework for TensorFlow.
-
-
Statistics
-
Automated Machine Learning
- statsmodels - Statistical modeling and econometrics in Python.
- Alphalens - Performance analysis of predictive (alpha) stock factors.
- scikit-posthocs - Pairwise Multiple Comparisons Post-hoc Tests.
- stockstats - Supply a wrapper ``StockDataFrame`` based on the ``pandas.DataFrame`` with inline stock statistics/indicators support.
- weightedcalcs - A pandas-based utility to calculate weighted means, medians, distributions, standard deviations, and more.
- Pandas Profiling - Create HTML profiling reports from pandas DataFrame objects.
-
-
Evaluation
-
Automatic Plotting
- sklearn-evaluation - Model evaluation made easy: plots, tables and markdown reports.
- Metrics - Machine learning evaluation metric.
- recmetrics - Library of useful metrics and plots for evaluating recommender systems.
- AI Fairness 360 - Fairness metrics for datasets and ML models, explanations and algorithms to mitigate bias in datasets and models.
-
Programming Languages
Categories
Sub Categories
Keywords
awesome
36
awesome-list
32
python
30
machine-learning
23
list
16
deep-learning
15
data-science
14
tensorflow
9
statistics
7
keras
7
plotting
6
data-visualization
5
pandas
5
jupyter
5
visualization
5
artificial-intelligence
4
books
4
data-analysis
4
neural-network
4
automl
4
blockchain
3
xgboost
3
automated-machine-learning
3
optimization
3
distributed
3
scikit-learn
3
unsupervised-learning
3
neural-networks
3
hyperparameter-optimization
3
education
3
javascript
2
startups
2
interview-questions
2
slack
2
python3
2
product-management
2
scikit
2
database
2
git
2
programming-language
2
graph-embedding
2
network-embedding
2
node-embedding
2
network-science
2
networkx
2
distributed-systems
2
distributed-computing
2
spark
2
ml
2
deep-neural-networks
2