https://github.com/ThoughtWorksInc/cd4ml-workshop
Repository with sample code and instructions for "Continuous Intelligence" and "Continuous Delivery for Machine Learning: CD4ML" workshops
https://github.com/ThoughtWorksInc/cd4ml-workshop
Last synced: about 1 month ago
JSON representation
Repository with sample code and instructions for "Continuous Intelligence" and "Continuous Delivery for Machine Learning: CD4ML" workshops
- Host: GitHub
- URL: https://github.com/ThoughtWorksInc/cd4ml-workshop
- Owner: ThoughtWorksInc
- License: mit
- Archived: true
- Created: 2018-08-20T13:36:43.000Z (almost 7 years ago)
- Default Branch: master
- Last Pushed: 2024-08-13T12:10:01.000Z (10 months ago)
- Last Synced: 2024-08-13T15:06:49.695Z (10 months ago)
- Language: Jupyter Notebook
- Homepage:
- Size: 5.52 MB
- Stars: 316
- Watchers: 12
- Forks: 365
- Open Issues: 8
-
Metadata Files:
- Readme: README.md
- License: LICENSE
- Code of conduct: CODE_OF_CONDUCT.md
Awesome Lists containing this project
README
# Continuous Intelligence and CD4ML Workshop
*NOTE: We are archiving this repository, as it's not been maintained and updated recently.
We will keep it read-only for anyone interested in forking and evolving it independently*This workshop contains the sample application and machine learning code used for
the Continuous Delivery for Machine Learning (CD4ML) and Continuous Intelligence
workshop. This material has been developed and is continuously evolved by
[ThoughtWorks](www.thoughtworks.com/open-source) and has been presented in
conferences such as: Yottabyte 2018, World AI Summit 2018, Strata London 2019,
and others.## Pre-Requisites
In order to run this workshop, you will need:
* A valid Github account
* A working Docker setup (if running on Windows, make sure to use Linux containers)## Workshop Instructions
The workshop is divided into several steps, which build on top of each other.
Instructions for each exercise can be found under the
[`instructions`](./instructions) folder.*WARNING: the exercises build on top of each other, so you will not be able to
skip steps ahead without executing them.**WARNING 2: the workshop requires infrastructure that we only provision when
needed, therefore you won't be able to execute the exercises on your own that
require that shared infrastructure. We are working on a setup that allows
running the workshop locally, but that is work in progress.*## The Machine Learning Problem
We built a simplified solution to a Kaggle problem posted by Corporación Favorita,
a large Ecuadorian-based grocery retailer interested in improving their
[Sales Forecasting](https://www.kaggle.com/c/favorita-grocery-sales-forecasting/overview)
using data. For the purposes of this workshop, we have combined and simplified
their data sets, as our goal is not to find the best predictions, but to
demonstrate how to implement CD4ML.## Collaborators
The material, ideas, and content developed for this workshop were contributions
from (in alphabetical order):* [Arif Wider](https://github.com/arifwider)
* [Arun Manivannan](https://github.com/arunma)
* [Christoph Windheuser](https://github.com/ciwin)
* [Danilo Sato](https://github.com/dtsato)
* [Danni Yu](https://github.com/danniyu)
* [David Tan](https://github.com/davified)
* [Emily Grasmeder](https://github.com/emilyagras)
* [Emily Gorcenski](https://github.com/Gorcenski)
* [Jin Yang](https://github.com/yytina)
* [Jonathan Heng](https://github.com/jonheng)