An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with webdataset

A curated list of projects in awesome lists tagged with webdataset .

https://github.com/webdataset/webdataset

A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.

data-augmentation deep-learning pytorch webdataset webdataset-format

Last synced: 11 Dec 2025

https://github.com/huggingface/chug

Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.

computer-vision dataloading datasets distributed-training document-understanding multi-modal-learning pdf-document webdataset

Last synced: 14 Oct 2025

https://github.com/robvanvolt/DALLE-tools

DALLE-tools provided useful dataset utilities to improve you workflow with WebDatasets.

dataset-preparation datasets webdataset

Last synced: 08 May 2025

https://github.com/mlfoundations/webdataset-resharder

Efficiently process webdatasets

webdataset webdataset-format

Last synced: 21 Apr 2025

https://github.com/hemumanju/carla-data-collector

Scripts to collect data from CARLA and save them as Webdataset

carla-data pytorch webdataset

Last synced: 05 Jan 2026