Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/mrjsj/msfabricutils
Spark-free Python utilities for Microsoft Fabric focused on Data Engineering using Polars and delta-rs
https://github.com/mrjsj/msfabricutils
data-engineering dataframe delta-rs deltalake duckdb fabric microsoft-fabric polars python
Last synced: 2 days ago
JSON representation
Spark-free Python utilities for Microsoft Fabric focused on Data Engineering using Polars and delta-rs
- Host: GitHub
- URL: https://github.com/mrjsj/msfabricutils
- Owner: mrjsj
- License: mit
- Created: 2024-12-01T15:13:54.000Z (2 months ago)
- Default Branch: main
- Last Pushed: 2024-12-29T17:18:32.000Z (about 1 month ago)
- Last Synced: 2025-01-29T23:46:04.766Z (3 days ago)
- Topics: data-engineering, dataframe, delta-rs, deltalake, duckdb, fabric, microsoft-fabric, polars, python
- Language: Python
- Homepage: https://mrjsj.github.io/msfabricutils
- Size: 1.38 MB
- Stars: 18
- Watchers: 1
- Forks: 4
- Open Issues: 7
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# MSFabricUtils
A collection of Spark-free Python utilities for working with Microsoft Fabric in Python Notebooks.
Mainly focused on Data Engineering tasks utilising [Polars](https://github.com/pola-rs/polars) and [delta-rs](https://github.com/delta-io/delta-rs).![How to select Python Notebook](assets/images/select-python-notebooks.png)
## Installation
```bash
pip install msfabricutils
```## Features
### Local development first
- Aim to provide a local development "within" Fabric. This means you can develop and run code locally, but reading and writing to Fabric Lakehouses. You can also read and write to Azure Data Lake, or even locally.### ETL
- Read from delta tables or parquet files with automatic authentication to Azure Data Lake or OneLake
- Common transformations (add audit columns, reorder columns, deduplicate etc.)
- Load delta tables with one of the provided load methods (upsert, overwrite, append etc.)### Thin Fabric API wrapper
Intregate functions to manage workspaces, lakehouses, notebooks, environments, libraries, etc.
For more details, see the [Fabric API Reference](https://mrjsj.github.io/msfabricutils/core/fabric-api/)### Fabric API as CLI
CLI for managing workspaces, lakehouses, notebooks, environments, libraries, etc.To get started, run:
```bash
msfu --help
```![msfu CLI help](assets/images/cli-help.png)
### Fabric DuckDB Connection
Seamless integration between DuckDB and Microsoft Fabric Lakehouses for data exploration and analysis.- Cross-workspace and cross-lakehouse querying capabilities
- Automatic table registration
- Reading and writing to Onelake outside and inside Fabric
- Support for Delta Lake tables
- Flexible table name referencing (1-part to 4-part names)## Documentation
For quick start and detailed documentation, examples, and API reference, visit our [GitHub Pages documentation](https://mrjsj.github.io/msfabricutils/).## Contributing
Contributions are welcome! Here are some ways you can contribute:- Report bugs and feature requests through GitHub issues
- Submit pull requests for bug fixes or new features
- Improve documentation
- Share ideas for new utilities## License
This project is licensed under the MIT License - see the LICENSE file for details.