{"id":28794435,"url":"https://github.com/iterative/workshop-e2e-gitops-ml","last_synced_at":"2025-07-20T17:04:50.254Z","repository":{"id":266451842,"uuid":"709213675","full_name":"iterative/workshop-e2e-gitops-ml","owner":"iterative","description":"A generated repository for a workshop on End2End GitOps-based ML","archived":false,"fork":false,"pushed_at":"2023-10-24T08:58:26.000Z","size":241,"stargazers_count":2,"open_issues_count":0,"forks_count":1,"subscribers_count":5,"default_branch":"main","last_synced_at":"2025-07-18T10:57:43.463Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/iterative.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2023-10-24T08:58:23.000Z","updated_at":"2024-12-13T17:40:21.000Z","dependencies_parsed_at":"2024-12-04T10:40:53.235Z","dependency_job_id":"847488ce-47e7-4fdb-b8cf-b5f37b6b7123","html_url":"https://github.com/iterative/workshop-e2e-gitops-ml","commit_stats":null,"previous_names":["iterative/workshop-e2e-gitops-ml"],"tags_count":0,"template":false,"template_full_name":"iterative/e2e-workshop-template","purl":"pkg:github/iterative/workshop-e2e-gitops-ml","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/iterative%2Fworkshop-e2e-gitops-ml","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/iterative%2Fworkshop-e2e-gitops-ml/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/iterative%2Fworkshop-e2e-gitops-ml/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/iterative%2Fworkshop-e2e-gitops-ml/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/iterative","download_url":"https://codeload.github.com/iterative/workshop-e2e-gitops-ml/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/iterative%2Fworkshop-e2e-gitops-ml/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":266161906,"owners_count":23885928,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2025-06-18T02:38:09.740Z","updated_at":"2025-07-20T17:04:50.247Z","avatar_url":"https://github.com/iterative.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Workshop - End2End GitOps ML pipeline\n\nThis is a Computer Vision (CV) project that solves the problem of segmenting out \nswimming pools from satellite images. \n\nWe use a slightly modified version of the [BH-Pools dataset](http://patreo.dcc.ufmg.br/2020/07/29/bh-pools-watertanks-datasets/):\nwe split the original 4k images into tiles of 1024x1024 pixels.\n\n\n## Initial setup\n\nPython 3.8+ is required to run code from this repo.\n\n\n1. Send the presenter the e-mail address associated with your GitHub account. You will be invited to collaborate on the workshop repository.\n1. Clone the repository locally\n\n    ```console\n    $ git clone https://github.com/iterative/workshop-e2e-gitops-ml\n    $ cd workshop-e2e-gitops-ml\n    ```\n\n1. Now let's install the requirements and create a virual environment. You can use the included makefile and simply call\n\n    ```console\n    $ make venv\n    ```\n\n1. (Optional) Connect your account to DVC Studio for live experiment sharing\n\n    The worskhop host should invite you to a team in DVC Studio using the GitHub-associated e-mail address. You can then simply use your GitHub account to log in to Studio.\n\n    \u003cimg src=\"sign-in-studio.png\" width=\"300\"\u003e\n\n\n    Once you are signed-in, click on the icon in the top-right corner of the Studio UI and select Profile. Scroll down to Studio Access Token and copy it \n\n    \u003cimg src=\"studio-token.png\" width=\"500\"\u003e\n\n\n    Then call the following command with `***` replaced by your token\n    ```console\n    dvc config --global studio.token ***\n    ```\n    This will allow dvc to communicate with Studio and share progress of live (ongoing) experiments with others.\n\n## 1. Getting raw data\n\nImport the raw data with dvc from our public dataset registry:\n\n```console\n$ dvc import https://github.com/iterative/dataset-registry use-cases/pool_data -o data/pool_data\n```\n\n## 1. Running your own experiments during the workshop\n\n1. Open the `params.yaml` file in the root of the repository and edit some values\n1. Run `dvc exp run`\n1. Observe the results in Studio, in the VS code extension or in the command line\n1. Persist the experiment and add it to the (local and remote) git history by calling\n    ```console\n    dvc exp push origin exp-name\n    ```\n    with `exp-name` replaced with your experiment name.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fiterative%2Fworkshop-e2e-gitops-ml","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fiterative%2Fworkshop-e2e-gitops-ml","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fiterative%2Fworkshop-e2e-gitops-ml/lists"}