https://github.com/m-farag/etlworkers
a Data Engineering package
https://github.com/m-farag/etlworkers
dataengineering python
Last synced: 10 months ago
JSON representation
a Data Engineering package
- Host: GitHub
- URL: https://github.com/m-farag/etlworkers
- Owner: M-Farag
- License: mit
- Created: 2021-08-07T20:46:40.000Z (over 4 years ago)
- Default Branch: main
- Last Pushed: 2021-08-13T21:56:25.000Z (over 4 years ago)
- Last Synced: 2025-03-14T22:18:17.845Z (10 months ago)
- Topics: dataengineering, python
- Language: Python
- Homepage:
- Size: 48.8 KB
- Stars: 2
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
## ETL Workers
It is my way to express my love for python and my life passion for data science in general and explicitly for data engineering.
### About the package name:
I named it after the famous data-engineering pipeline approach: Extract, Transform, and Load.
### About the idea and package structure
It will contain multiple worker classes, And Each worker will fulfill certain data operations segmented by type.
For example, now it contains one worker called FileWorker, And it is performing file-related operations.
In the future, I will add more workers and more functions per worker. So it can perform complex operations under any applicable built with python.
### How to install & use
```
# In your terminal
pip install EtlWorkers
# In your python application
import EtlWorkers as ew
my_file = ew.workers.FileWorker('file_location_and_name.csv')
my_file.replace_a_word_in_each_line('old_word','new_word')
```