Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/dyarleniber/hands-on-machine-learning

This repository contains code examples, exercises, and projects related to the concepts covered in the book "Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow, 3rd Edition" by Aurélien Géron.
https://github.com/dyarleniber/hands-on-machine-learning

ai artificial-intelligence keras machine-learning matpolotlib numpy pandas scikit-learn tensorflow

Last synced: about 2 months ago
JSON representation

This repository contains code examples, exercises, and projects related to the concepts covered in the book "Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow, 3rd Edition" by Aurélien Géron.

Awesome Lists containing this project

README

        

# Hands-On Machine Learning

This repository was created to document my progress and learnings from the book ["Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow, 3rd Edition" by Aurélien Géron](https://learning.oreilly.com/library/view/hands-on-machine-learning/9781098125967/).

## Overview

This repository contains code examples, exercises, and projects related to the concepts covered in the book.

## Table of Contents

- [Getting Started](#getting-started)
- [Projects](#projects)

## Getting Started

You can find the official tutorials page here: https://colab.research.google.com/github/ageron/handson-ml3/blob/main/index.ipynb

And the official code examples here: https://github.com/ageron/handson-ml3

To ensure a consistent environment for working locally with Python, you can use `pyenv` for managing Python versions and `Poetry` for dependency management.
In the `tutorials` directory you can find some basic instruction on how to install and use `pyenv` and `Poetry`.

> Alternatively you can use Google Colab to write and execute python code through the browser.

It's also recommended to have a prior knowledge of the following Python libraries: `NumPy`, `Pandas`, and `Matplotlib`. In the `tutorials` directory, you'll also find some basic examples of usage for each of these libraries, as well as examples of required basic math (such as linear algebra) and other related materials.

## Projects

In the `projects` directory, you'll find the source code of the project developed in each chapter.

## Real Data

There are thousands of open datasets to choose from, ranging across all sorts of domains. Here are a few places you can look to get data:

Popular open data sources:

- [OpenML.org](https://openml.org/)
- [Kaggle](https://www.kaggle.com/datasets)
- [Hugging Face Datasets](https://huggingface.co/datasets)
- [Paper with Code](https://paperswithcode.com/datasets)
- [UC Irvine Machine Learning Repository](https://archive.ics.uci.edu/)
- [Amazon's AWS datasets](https://registry.opendata.aws/)
- [TensorFlow Datasets](https://www.tensorflow.org/datasets)
- [Google's Dataset Search](https://datasetsearch.research.google.com/)
- [Microsoft's Open Data](https://msropendata.com/)
- [Data.gov](https://www.data.gov/)
- [EU Open Data Portal](https://data.europa.eu/euodp/en/home)

Meta portals:

- [Data Portals](https://dataportals.org/)
- [Awesome Public Datasets](https://github.com/awesomedata/awesome-public-datasets)
- [Open Data Monitor](https://opendatamonitor.eu/)
- [Wikipedia's list of datasets](https://en.wikipedia.org/wiki/List_of_datasets_for_machine-learning_research)
- [Quora's list of datasets](https://www.quora.com/Where-can-I-find-large-datasets-open-to-the-public)
- [Reddit's r/datasets](https://www.reddit.com/r/datasets/)

## Machine Learning Project Checklist

This checklist can guide you through your machine learning projects. There are eight main steps:

1. Frame the problem and look at the big picture.
2. Get the data.
3. Explore the data to gain insights.
4. Prepare the data to better expose the underlying data patterns to machine learning algorithms.
5. Explore many different models and shortlist the best ones.
6. Fine-tune your models and combine them into a great solution.
7. Present your solution.
8. Launch, monitor, and maintain your system.