https://github.com/devprojectekla/webscraping
My first steps in web scraping using bs4 (BeautifulSoup) library on python. This one only allows you to scrap wallpapers from an hardcoded website. It allows you to download every wallpaper providing a keyword like 'landscape' or 'star wars' (or iterates on a list of keywords written in a .txt file). It iterates over all pages of the given keyword
https://github.com/devprojectekla/webscraping
gtk gui scraping web
Last synced: over 1 year ago
JSON representation
My first steps in web scraping using bs4 (BeautifulSoup) library on python. This one only allows you to scrap wallpapers from an hardcoded website. It allows you to download every wallpaper providing a keyword like 'landscape' or 'star wars' (or iterates on a list of keywords written in a .txt file). It iterates over all pages of the given keyword
- Host: GitHub
- URL: https://github.com/devprojectekla/webscraping
- Owner: DevprojectEkla
- License: agpl-3.0
- Created: 2023-01-08T09:01:54.000Z (over 3 years ago)
- Default Branch: main
- Last Pushed: 2023-01-08T10:12:11.000Z (over 3 years ago)
- Last Synced: 2025-01-27T16:47:45.822Z (over 1 year ago)
- Topics: gtk, gui, scraping, web
- Language: Python
- Homepage:
- Size: 35.2 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# WebScraping
My first steps in web scraping using bs4 (BeautifulSoup) library on python. This one only allows you to scrap wallpapers from an hardcoded website.
It allows you to download every wallpaper providing a keyword like 'landscape' or 'star wars' (or iterates on a list of keywords written in a .txt file).
It iterates over all pages of the given keyword.
There is two branches:
'main' is the CLI version with a simple user input to type one keyword (ex:'landscape', 'airplane', 'stars') and the number of pages you want to scrap.
'GTK' is the GUI version of the same app but it allows you to 'stop' scraping as you wish and restart with other keywords. You can also open a list of keyword or dynamically create a list, whereas the CLI version only load the default 'list.txt' if you just press enter instead of typing a keyword.
USAGE:
main branch: (make sure to create an environment with all requirements) just run 'python WebSiteScraping.py'
GTK branch: (it gives you a very basic User Interface): run the command 'python _GTK_MainWindow.py'