An open API service indexing awesome lists of open source software.

https://github.com/fal-ai/lavender-data

Load & manage evolving datasets efficiently
https://github.com/fal-ai/lavender-data

data dataloader ml torch

Last synced: 7 months ago
JSON representation

Load & manage evolving datasets efficiently

Awesome Lists containing this project

README

          


Lavender Data Logo



Load & evolve datasets efficiently



PyPI


Discord


License



Please visit our docs for more information.



docs.lavenderdata.com

## Quick Start

### Installation

```bash
pip install lavender-data
```

#### Start the server

```bash
lavender-data server start --init
```

```
lavender-data is running on 0.0.0.0:8000
UI is running on http://localhost:3000
API key created: la-...
```

Save the API key to use it in the next steps.

```bash
export LAVENDER_API_URL=http://0.0.0.0:8000
export LAVENDER_API_KEY=la-...
```

### Create an example dataset

```bash
lavender-data client \
datasets create \
--name my_dataset \
--uid-column-name id \
--shardset-location https://docs.lavenderdata.com/example-dataset/images/
```

### Iterate over the dataset

```python
import lavender_data.client as lavender

lavender.init()

iteration = lavender.LavenderDataLoader(
dataset_name="my_dataset",
shuffle=True,
shuffle_block_size=10,
)

for i in iteration:
print(i["id"])
```


Please visit our docs for more information.



docs.lavenderdata.com