https://github.com/pcpp94/parsing_wrongly_formatted_excels
A tailored solution for a specific folder containing Excel files with varied and incorrect extensions (.xml, .xlsx, .xls). This repository automatically identifies and processes files with inconsistent formats, applies smart cleaning functions, and standardizes data structures, ensuring reliable data extraction and cleanup from an updated folder
https://github.com/pcpp94/parsing_wrongly_formatted_excels
etl excel
Last synced: about 1 year ago
JSON representation
A tailored solution for a specific folder containing Excel files with varied and incorrect extensions (.xml, .xlsx, .xls). This repository automatically identifies and processes files with inconsistent formats, applies smart cleaning functions, and standardizes data structures, ensuring reliable data extraction and cleanup from an updated folder
- Host: GitHub
- URL: https://github.com/pcpp94/parsing_wrongly_formatted_excels
- Owner: pcpp94
- Created: 2024-11-11T14:13:50.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2024-11-11T14:27:46.000Z (over 1 year ago)
- Last Synced: 2025-01-26T19:11:14.285Z (over 1 year ago)
- Topics: etl, excel
- Language: Python
- Homepage:
- Size: 7.81 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
**Automated Data Cleaner for Mixed-Format Excel Folder**
A tailored solution for a specific folder containing Excel files with varied and incorrect extensions (.xml, .xlsx, .xls). This repository automatically identifies and processes files with inconsistent formats, applies smart cleaning functions, and standardizes data structures, ensuring reliable data extraction and cleanup from a constantly updated folder with mixed file types.