https://github.com/jisnap/dvc_pipeline
This repository manages multiple datasets and ML models using DVC
https://github.com/jisnap/dvc_pipeline
dvc dvc-pipeline
Last synced: 8 months ago
JSON representation
This repository manages multiple datasets and ML models using DVC
- Host: GitHub
- URL: https://github.com/jisnap/dvc_pipeline
- Owner: JisnaP
- Created: 2025-02-26T15:28:49.000Z (over 1 year ago)
- Default Branch: master
- Last Pushed: 2025-02-26T16:07:07.000Z (over 1 year ago)
- Last Synced: 2025-02-26T17:20:14.747Z (over 1 year ago)
- Topics: dvc, dvc-pipeline
- Language: Python
- Homepage:
- Size: 20.5 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
Overview
This project demonstrates how to use DVC (Data Version Control) with Git Tags to manage different versions of datasets and models. By leveraging git checkout and dvc checkout, you can seamlessly switch between different versions of data and models while maintaining reproducibility.
Prerequisites
Ensure you have the following installed:
Git
DVC (pip install dvc)
A remote storage (Google Drive, S3, Azure, or a local path) for DVC data tracking
This repository uses https://github.com/iterative/example-versioning and this has been created for thoroughly understanding how dvc can be used for version control data ,model and pipelines.