https://github.com/dde-labs/self-lake
Self Sandbox for interact with any lake storage like DeltaLake, IceBerg, Hudi etc.
https://github.com/dde-labs/self-lake
deltalake hudi iceberg py311 self-learning
Last synced: 7 months ago
JSON representation
Self Sandbox for interact with any lake storage like DeltaLake, IceBerg, Hudi etc.
- Host: GitHub
- URL: https://github.com/dde-labs/self-lake
- Owner: dde-labs
- License: mit
- Created: 2025-01-14T02:11:17.000Z (9 months ago)
- Default Branch: main
- Last Pushed: 2025-03-05T06:56:09.000Z (7 months ago)
- Last Synced: 2025-03-05T07:31:53.837Z (7 months ago)
- Topics: deltalake, hudi, iceberg, py311, self-learning
- Language: Python
- Homepage:
- Size: 9.77 KB
- Stars: 0
- Watchers: 0
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Self: lakeHouse
This project was created for research and develop with lake storage engines like
DeltaLake, IceBerg, etc.**This project will focus on**:
- Open table file format such as Deltalake, Iceberg, and Hudi (I think I can add
Hive on this project.)
- Storage of these open table format such as MinIO, and HDFS.## :round_pushpin: Prerequisite
IceBerg Config;
```dotenv
PYICEBERG_CATALOG__SANDBOX__TYPE=sql
PYICEBERG_CATALOG__SANDBOX__URI=sqlite:///./tmp/pyiceberg.db
PYICEBERG_CATALOG__SANDBOX__WAREHOUSE=file://./tmp
PYICEBERG_CATALOG__SANDBOX__INIT_CATALOG_TABLES=true
PYICEBERG_CATALOG__SANDBOX__POOL_PRE_PING=true
PYICEBERG_CATALOG__SANDBOX__ECHO=false
```:fast_forward: Read more on the [Official Document](https://py.iceberg.apache.org/)
## :speech_balloon: Contribute
I do not think this project will go around the world because it has specific propose,
and you can create by your coding without this project dependency for long term
solution. So, on this time, you can open [the GitHub issue on this project :raised_hands:](https://github.com/dde-labs/self-lake/issues)
for fix bug or request new feature if you want it.