https://github.com/nathancordeiro/web-scraper
A python GUI web scraper made with beautiful soup and PyQt5.
https://github.com/nathancordeiro/web-scraper
beautifulsoup4 pyqt5 python python-gui web-scraper
Last synced: 6 months ago
JSON representation
A python GUI web scraper made with beautiful soup and PyQt5.
- Host: GitHub
- URL: https://github.com/nathancordeiro/web-scraper
- Owner: NathanCordeiro
- License: mit
- Created: 2024-04-27T06:58:41.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2024-05-26T18:25:52.000Z (over 1 year ago)
- Last Synced: 2025-02-07T00:20:20.404Z (8 months ago)
- Topics: beautifulsoup4, pyqt5, python, python-gui, web-scraper
- Language: Python
- Homepage:
- Size: 976 KB
- Stars: 2
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
---
PYTHON WEB SCRAPER
---
A powerful and user-friendly web scraping tool built with Python beautiful soup and PyQt5.
About the Project
The Web Scraper Application is a versatile and intuitive tool designed to make web scraping tasks effortless. Built using Python and PyQt5, it offers a sleek interface coupled with powerful functionality, allowing users to easily scrape content from websites and save it for further analysis.
Features
-
User-Friendly Interface: The application features an intuitive user interface, making it easy for users of all skill levels to navigate and utilize its features. -
Scraping Capabilities: Users can enter a URL and scrape the content of the corresponding webpage with a single click. The scraped content is displayed in real-time within the application. -
HTML Content Saving: The application allows users to save the scraped HTML content to a text file for future reference or analysis. -
Task Bar Navigation: With a built-in task bar, users can seamlessly switch between different functionalities such as scraping, viewing, and accessing the about section.
Getting Started
Prerequisites
- Python 3.x
- PyQt5 library
- BeautifulSoup4 library
- Requests library
Installation
- Clone the repository:
- Navigate to the project directory:
- Install the required dependencies:
git clone https://github.com/NathanCordeiro/WEB-SCRAPER.git
cd WEB-SCRAPER
pip install -r requirements.txt
Usage
- Run the application by executing the
main.py
file. - Enter the URL of the website you want to scrape in the designated input field.
- Click on the "Scrape" button to initiate the scraping process.
- The scraped HTML content will be displayed in the application's view section. Additionally, it will be saved to a text file named
scraped_content.html
.
python main.py
Contributing
Contributions are what make the open-source community such an amazing place to learn, inspire, and create. Any contributions you make are greatly appreciated.
- Fork the Project
- Create your Feature Branch (
git checkout -b feature/AmazingFeature
) - Commit your Changes (
git commit -m 'Add some AmazingFeature'
) - Push to the Branch (
git push origin feature/AmazingFeature
) - Open a Pull Request
License
Distributed under the MIT License. See LICENSE for more information.
Contact
Nathan Cordeiro - nathanjohncordeiro@gmail.com
Project Link: https://github.com/NathanCordeiro/WEB-SCRAPER