https://github.com/treeverse/lakefs-playground-utils
A set of utilities to work interactively with a lakeFS playground from notebook environments
https://github.com/treeverse/lakefs-playground-utils
Last synced: about 2 months ago
JSON representation
A set of utilities to work interactively with a lakeFS playground from notebook environments
- Host: GitHub
- URL: https://github.com/treeverse/lakefs-playground-utils
- Owner: treeverse
- License: apache-2.0
- Created: 2022-11-22T09:19:04.000Z (over 3 years ago)
- Default Branch: main
- Last Pushed: 2022-12-01T11:13:02.000Z (over 3 years ago)
- Last Synced: 2025-02-10T14:03:14.246Z (over 1 year ago)
- Language: Python
- Size: 38.1 KB
- Stars: 0
- Watchers: 7
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# lakeFS Playground Utilities
## Using in a notebook:
```python
>>> !pip install lakefs-playground-utils # Syntax used in Google Colab, might be different for Jupyter, Databricks, etc.
>>>
>>> import playground
>>> conn = playground.get_or_create('myemail@example.com') # Creates a lakeFS playground environment, or returns an existing one
>>> playground.mount(conn) # Sets up a `lakefs://` protocol handler for pandas, pre-configured to read+write from our playground environment
```
Once set up, we can use pandas to read and write from a lakeFS repository on our playground installation:
```python
>>> import pandas as pd
>>> pd.read_parquet('lakefs://repo/branch/path/to/part-00000.snappy.parquet')
```
## License
Apache 2.0
See [LICENSE](./LICENSE)