Projects in Awesome Lists tagged with webdataset
A curated list of projects in awesome lists tagged with webdataset .
https://github.com/webdataset/webdataset
A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.
data-augmentation deep-learning pytorch webdataset webdataset-format
Last synced: 11 Dec 2025
https://github.com/huggingface/chug
Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.
computer-vision dataloading datasets distributed-training document-understanding multi-modal-learning pdf-document webdataset
Last synced: 14 Oct 2025
https://github.com/robvanvolt/DALLE-tools
DALLE-tools provided useful dataset utilities to improve you workflow with WebDatasets.
dataset-preparation datasets webdataset
Last synced: 08 May 2025
https://github.com/mlfoundations/webdataset-resharder
Efficiently process webdatasets
Last synced: 21 Apr 2025
https://github.com/hemumanju/carla-data-collector
Scripts to collect data from CARLA and save them as Webdataset
Last synced: 05 Jan 2026