Projects in Awesome Lists tagged with data-version-control
A curated list of projects in awesome lists tagged with data-version-control .
https://github.com/dolthub/dolt
Dolt – Git for Data
command-line data-version-control data-versioning database database-version-control database-versioning decentralized-database git git-database git-for-data git-for-databases git-sql golang immutable-database mariadb mysql sql version-controlled-database
Last synced: 12 May 2025
https://github.com/iterative/dvc
🦉 Data Versioning and ML Experiments
ai data-science data-version-control developer-tools machine-learning reproducibility unstructured-data
Last synced: 12 May 2025
https://github.com/treeverse/lakefs
lakeFS - Data version control for your data lake | Git for data
apache-spark apache-sparksql aws-s3 azure-blob-storage azure-storage data-engineering data-lake data-quality data-version-control data-versioning datalake datalakes git-for-data go golang google-cloud-storage hadoop-filesystem lakefs object-storage
Last synced: 24 Dec 2025
https://github.com/treeverse/lakeFS
lakeFS - Data version control for your data lake | Git for data
apache-spark apache-sparksql aws-s3 azure-blob-storage azure-storage data-engineering data-lake data-quality data-version-control data-versioning datalake datalakes git-for-data go golang google-cloud-storage hadoop-filesystem lakefs object-storage
Last synced: 20 Mar 2025
https://github.com/quiltdata/quilt
Quilt is a data mesh for connecting people with actionable data
data data-engineering data-version-control data-versioning parquet python serialization
Last synced: 13 May 2025
https://github.com/data-drift/data-drift
Metrics Observability & Troubleshooting
analytics bigquery context data-diffing data-governance data-lineage data-monitoring data-observability data-quality data-reliability data-version-control dbt dbt-metrics dbt-packages drill-down metrics reconciliation redshift semantic-layer snowflake
Last synced: 08 Oct 2025
https://github.com/splitgraph/sgr
sgr (command line client for Splitgraph) and the splitgraph Python library
data data-version-control developer-tools postgres postgresql python sql
Last synced: 13 Apr 2025
https://github.com/GitDataAI/jiaozifs
A Git-like Version Control File System for AI & Data Product Management.
aiops data-collaboration data-lake data-lineage data-product data-version-control data-versioning dataops digital-twins federated-learning git git-filesystem git-for-data git-interface jiaozifs jzfs mlops version-controlled-filesystem
Last synced: 03 Mar 2025
https://github.com/GitDataAI/jzfs
A Git-like Version Control File System for AI & Data Product Management.
aiops data-collaboration data-lake data-lineage data-product data-version-control data-versioning dataops digital-twins federated-learning git git-filesystem git-for-data git-interface jiaozifs jzfs mlops version-controlled-filesystem
Last synced: 04 Apr 2025
https://github.com/ropensci/gittargets
Data version control for reproducible analysis pipelines in R with {targets}.
data-science data-version-control data-versioning r r-package reproducibility reproducible-research rstats targets workflow
Last synced: 21 Aug 2025
https://github.com/zincware/zntrack
Create, visualize, run & benchmark DVC pipelines in Python & Jupyter notebooks.
data-science data-version-control developer-tools dvc git machine-learning python reproducibility
Last synced: 22 Jul 2025
https://github.com/zincware/ZnTrack
Create, visualize, run & benchmark DVC pipelines in Python & Jupyter notebooks.
data-science data-version-control developer-tools dvc git machine-learning python reproducibility
Last synced: 07 May 2025
https://github.com/shuyib/data-version-ctrl
Data version control with Makefile and DVC for a regression task to estimate insurance costs for certain individuals.
continuous-training data-version-control datascience docker dvc insurance-cost jupyter-notebook machinelearning makefiles mlem python regression-analysis
Last synced: 19 Apr 2025
https://github.com/ruifilipecampos/git-datasets
Declaratively create, transform, manage and version ML datasets.
ai data-version-control datasets git machine-learning
Last synced: 29 Oct 2025
https://github.com/md-emon-hasan/dvc-turotial
📂 Comprehensive guide on using DVC for efficient and reproducible machine learning projects, covering essential commands and workflows.
data-engineering data-management data-science data-version-control dvc mlops version-control
Last synced: 02 Mar 2025
https://github.com/blaz-cerpnjak/dvc-git-example
DVC - Data Version Control Basics
data-version-control dvc dvc-google-drive
Last synced: 16 Mar 2025
https://github.com/ajithvcoder/dvc-aws-s3-bucket-workflow-setup
tutorial to connect dvc and aws-s3 and run github actions
aws-s3 data-version-control dvc github-actions workflow
Last synced: 07 Oct 2025
https://github.com/pirocheto/template-ml-dvc
Template for machine-learning project with DVC
cookiecutter data-science data-version-control dvc machine-learning mlops python template
Last synced: 06 Apr 2025
https://github.com/marpogaus/dvc-stage
Stop programming common dvc stages. Configure them.
ai data-science data-version-control developer-tools dvc git machine-learning python reproducibility
Last synced: 17 Jun 2025
https://github.com/lsjsj92/data_version_control
practice about data_version_control(DVC)
data-version-control data-versioning dvc git github python
Last synced: 04 Oct 2025
https://github.com/harshstats/chicken-disease-classification-using-mlops-dvc-pipeline-
The Chicken Disease Classification Using MLOps DVC Pipeline project utilizes the VGG16 architecture to analyze images of chicken fecal matter, enabling early disease detection and reducing economic losses in poultry farming.
agritech aws azure ci-cd computer-vision data-version-control deep-learning disease-detection docker dvc github-actions image-classification keras machine-learning mlflow mlops poultry-health python tensorflow vgg16
Last synced: 06 Mar 2025
https://github.com/cr21/dvc-pytorch-lightning-mlops
Data Version Control (DVC) , Hydra, Pytorch Lightning Integration MLOPS
cicd data-version-control dvc hydra mlops-workflow pytorch-lightning
Last synced: 04 Apr 2025
https://github.com/ajithvcoder/dvc-gdrive-workflow-setup
tutorial to connect dvc and gdrive and run github actions
data-version-control dvc-pipeline github-actions github-workflows google-drive google-drive-api
Last synced: 08 Apr 2025
https://github.com/wlandau/user-conf-2022
useR! 2022 talk
data-version-control reproducibility targets targets-pipeline
Last synced: 08 Apr 2025