Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/ndonkohenri/metro.de-scraper
Scraping the metro.de website with python.
https://github.com/ndonkohenri/metro.de-scraper
Last synced: 26 days ago
JSON representation
Scraping the metro.de website with python.
- Host: GitHub
- URL: https://github.com/ndonkohenri/metro.de-scraper
- Owner: ndonkoHenri
- Created: 2023-08-07T18:41:33.000Z (over 1 year ago)
- Default Branch: master
- Last Pushed: 2023-09-22T09:18:10.000Z (about 1 year ago)
- Last Synced: 2024-10-08T05:59:52.437Z (about 1 month ago)
- Language: Python
- Size: 68.4 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
## Report: Scraping metro.de shopping website
Install the requirements (`splinter` - web automation, `openpyxl`, `pandas` - eases work with excel files, and `flet` - GUI) using:
```bash
pip install -r requirements.txt
```### GUI Capture
TBA### How to try?
TBA[//]: # (Download the files for your OS from the releases section.)
### Notes
- Running `utils.py` will run the automation, but without showing a GUI. If you want the GUI, then run `gui.py` ([flet](https://flet.dev) is used for it, so it should be installed).
- Required files if not in GUI mode: `source.xlsx`, `gui.py`, `utils.py`, `logs.txt`
- The project must contain a source file with named `source.xlsx` (hardcoded), which contains at least two columns with headers `"Metro Artikelnummer"` and `"Link"` (equally hardcoded). The code could be modified though to work without the first column, but the second is absolutely necessary(contains the direct link to each item).
- A new Excel file named `Results.xlsx` will be created at the end of the automation/scraping and contains the scraped results.
Made with ❤ by TheEthicalBoy!