https://github.com/alimghmi/blob-sync
A tiny app for extracting file metadata from Azure Blob Storage and synchronizing it with an MSSQL database.
https://github.com/alimghmi/blob-sync
azure-blob-storage cloud-storage data-automation file-indexing file-management mssql python
Last synced: 4 months ago
JSON representation
A tiny app for extracting file metadata from Azure Blob Storage and synchronizing it with an MSSQL database.
- Host: GitHub
- URL: https://github.com/alimghmi/blob-sync
- Owner: alimghmi
- Created: 2024-01-29T10:12:57.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2024-07-16T13:57:04.000Z (about 1 year ago)
- Last Synced: 2025-01-11T16:50:37.658Z (9 months ago)
- Topics: azure-blob-storage, cloud-storage, data-automation, file-indexing, file-management, mssql, python
- Language: Python
- Homepage:
- Size: 15.6 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# blob-sync
blob sync is a Python application designed to fetch file metadata from Azure Blob Storage and store it in an MSSQL database. This project is particularly useful for creating an index of specific file types (e.g., PDFs) stored in Azure Blob Storage.
## Features
- Fetch file metadata from Azure Blob Storage.
- Filter files based on extension (default is set to .pdf).
- Store file metadata in an MSSQL database.
- Customizable for various file types and database schemas.## Usage
- Install the required packaghes:
```
pip install -r requirements.txt
```- Create a .env file according to .env.sample
- To run the application, execute the following command:
```
python main.py
```## Contributing
Contributions to blob sync are welcome. Please ensure you follow the guidelines:
- Fork the repository.
- Create a new branch for each feature or improvement.
- Send a pull request from each feature branch to the main branch.