Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/developmentseed/object-store-rs
A Python interface and pyo3 integration to the object-store crate
https://github.com/developmentseed/object-store-rs
azure google-cloud-storage object-store python rust s3
Last synced: 3 months ago
JSON representation
A Python interface and pyo3 integration to the object-store crate
- Host: GitHub
- URL: https://github.com/developmentseed/object-store-rs
- Owner: developmentseed
- License: mit
- Created: 2024-10-17T19:46:34.000Z (3 months ago)
- Default Branch: main
- Last Pushed: 2024-10-21T04:35:06.000Z (3 months ago)
- Last Synced: 2024-10-21T06:13:39.315Z (3 months ago)
- Topics: azure, google-cloud-storage, object-store, python, rust, s3
- Language: Rust
- Homepage: https://developmentseed.org/object-store-rs
- Size: 6.02 MB
- Stars: 8
- Watchers: 2
- Forks: 1
- Open Issues: 8
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# object-store-rs
A Python interface and [pyo3](https://github.com/PyO3/pyo3) integration to the Rust [`object_store`](https://docs.rs/object_store) crate, providing a uniform API for interacting with object storage services and local files.
Run the same code in multiple clouds via a simple runtime configuration change.
- Easy to install with no Python dependencies.
- Sync and async API.
- Streaming downloads with configurable chunking.
- Automatically supports [multipart uploads](https://docs.aws.amazon.com/AmazonS3/latest/userguide/mpuoverview.html) under the hood for large file objects.
- The [underlying Rust library](https://docs.rs/object_store) is production quality and used in large scale production systems, such as the Rust package registry [crates.io](https://crates.io/).
- Simple API with static type checking.
- Helpers for constructing from environment variables and `boto3.Session` objectsSupported object storage providers include:
- Amazon S3 and S3-compliant APIs like Cloudflare R2
- Google Cloud Storage
- Azure Blob Gen1 and Gen2 accounts (including ADLS Gen2)
- Local filesystem
- In-memory storage## Installation
```sh
pip install object-store-rs
```## Documentation
[Full documentation is available on the website](https://developmentseed.org/object-store-rs).
## Usage
### Constructing a store
Classes to construct a store are exported from the `object_store_rs.store` submodule:
- [`S3Store`](https://developmentseed.org/object-store-rs/latest/api/store/aws/): Configure a connection to Amazon S3.
- [`GCSStore`](https://developmentseed.org/object-store-rs/latest/api/store/gcs/): Configure a connection to Google Cloud Storage.
- [`AzureStore`](https://developmentseed.org/object-store-rs/latest/api/store/azure/): Configure a connection to Microsoft Azure Blob Storage.
- [`HTTPStore`](https://developmentseed.org/object-store-rs/latest/api/store/http/): Configure a connection to a generic HTTP server
- [`LocalStore`](https://developmentseed.org/object-store-rs/latest/api/store/local/): Local filesystem storage providing the same object store interface.
- [`MemoryStore`](https://developmentseed.org/object-store-rs/latest/api/store/memory/): A fully in-memory implementation of ObjectStore.#### Example
```py
import boto3
from object_store_rs.store import S3Storesession = boto3.Session()
store = S3Store.from_session(session, "bucket-name", config={"AWS_REGION": "us-east-1"})
```#### Configuration
Each store class above has its own configuration, accessible through the `config` named parameter. This is covered in the docs, and string literals are in the type hints.
Additional [HTTP client configuration](https://developmentseed.org/object-store-rs/latest/api/store/config/) is available via the `client_options` named parameter.
### Interacting with a store
All methods for interacting with a store are exported as **top-level functions** (not methods on the `store` object):
- [`copy`](https://developmentseed.org/object-store-rs/latest/api/copy/): Copy an object from one path to another in the same object store.
- [`delete`](https://developmentseed.org/object-store-rs/latest/api/delete/): Delete the object at the specified location.
- [`get`](https://developmentseed.org/object-store-rs/latest/api/get/): Return the bytes that are stored at the specified location.
- [`head`](https://developmentseed.org/object-store-rs/latest/api/head/): Return the metadata for the specified location
- [`list`](https://developmentseed.org/object-store-rs/latest/api/list/): List all the objects with the given prefix.
- [`put`](https://developmentseed.org/object-store-rs/latest/api/put/): Save the provided bytes to the specified location
- [`rename`](https://developmentseed.org/object-store-rs/latest/api/rename/): Move an object from one path to another in the same object store.There are a few additional APIs useful for specific use cases:
- [`get_range`](https://developmentseed.org/object-store-rs/latest/api/get/#object_store_rs.get_range): Get a specific byte range from a file.
- [`get_ranges`](https://developmentseed.org/object-store-rs/latest/api/get/#object_store_rs.get_ranges): Get multiple byte ranges from a single file.
- [`list_with_delimiter`](https://developmentseed.org/object-store-rs/latest/api/list/#object_store_rs.list_with_delimiter): List objects within a specific directory.
- [`sign`](https://developmentseed.org/object-store-rs/latest/api/sign/): Create a signed URL.All methods have a comparable async method with the same name plus an `_async` suffix.
#### Example
```py
import object_store_rs as obsstore = obs.store.MemoryStore()
obs.put(store, "file.txt", b"hello world!")
response = obs.get(store, "file.txt")
response.meta
# {'path': 'file.txt',
# 'last_modified': datetime.datetime(2024, 10, 21, 16, 19, 45, 102620, tzinfo=datetime.timezone.utc),
# 'size': 12,
# 'e_tag': '0',
# 'version': None}
assert response.bytes() == b"hello world!"byte_range = obs.get_range(store, "file.txt", offset=0, length=5)
assert byte_range == b"hello"obs.copy(store, "file.txt", "other.txt")
assert obs.get(store, "other.txt").bytes() == b"hello world!"
```All of these methods also have `async` counterparts, suffixed with `_async`.
```py
import object_store_rs as obsstore = obs.store.MemoryStore()
await obs.put_async(store, "file.txt", b"hello world!")
response = await obs.get_async(store, "file.txt")
response.meta
# {'path': 'file.txt',
# 'last_modified': datetime.datetime(2024, 10, 21, 16, 20, 36, 477418, tzinfo=datetime.timezone.utc),
# 'size': 12,
# 'e_tag': '0',
# 'version': None}
assert await response.bytes_async() == b"hello world!"byte_range = await obs.get_range_async(store, "file.txt", offset=0, length=5)
assert byte_range == b"hello"await obs.copy_async(store, "file.txt", "other.txt")
resp = await obs.get_async(store, "other.txt")
assert await resp.bytes_async() == b"hello world!"
```## Comparison to object-store-python
[Read a detailed comparison](https://github.com/roeap/object-store-python/issues/24#issuecomment-2422689636) to [`object-store-python`](https://github.com/roeap/object-store-python), a previous Python library that also wraps the same Rust `object_store` crate.