https://github.com/nathancordeiro/web-scraper

A python GUI web scraper made with beautiful soup and PyQt5.
https://github.com/nathancordeiro/web-scraper

beautifulsoup4 pyqt5 python python-gui web-scraper

Last synced: 11 months ago
JSON representation

A python GUI web scraper made with beautiful soup and PyQt5.

Host: GitHub
URL: https://github.com/nathancordeiro/web-scraper
Owner: NathanCordeiro
License: mit
Created: 2024-04-27T06:58:41.000Z (almost 2 years ago)
Default Branch: main
Last Pushed: 2024-05-26T18:25:52.000Z (almost 2 years ago)
Last Synced: 2025-02-07T00:20:20.404Z (about 1 year ago)
Topics: beautifulsoup4, pyqt5, python, python-gui, web-scraper
Language: Python
Homepage:
Size: 976 KB
Stars: 2
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

---

PYTHON WEB SCRAPER

---

A powerful and user-friendly web scraping tool built with Python beautiful soup and PyQt5.

About the Project

The Web Scraper Application is a versatile and intuitive tool designed to make web scraping tasks effortless. Built using Python and PyQt5, it offers a sleek interface coupled with powerful functionality, allowing users to easily scrape content from websites and save it for further analysis.

Features

User-Friendly Interface: The application features an intuitive user interface, making it easy for users of all skill levels to navigate and utilize its features.

Scraping Capabilities: Users can enter a URL and scrape the content of the corresponding webpage with a single click. The scraped content is displayed in real-time within the application.

HTML Content Saving: The application allows users to save the scraped HTML content to a text file for future reference or analysis.

Task Bar Navigation: With a built-in task bar, users can seamlessly switch between different functionalities such as scraping, viewing, and accessing the about section.

Getting Started

Prerequisites

Python 3.x

PyQt5 library

BeautifulSoup4 library

Requests library

Installation

Clone the repository:

git clone https://github.com/NathanCordeiro/WEB-SCRAPER.git

Navigate to the project directory:

cd WEB-SCRAPER

Install the required dependencies:

pip install -r requirements.txt

Usage

Run the application by executing the main.py file.

python main.py

Enter the URL of the website you want to scrape in the designated input field.

Click on the "Scrape" button to initiate the scraping process.

The scraped HTML content will be displayed in the application's view section. Additionally, it will be saved to a text file named scraped_content.html.

Contributing

Contributions are what make the open-source community such an amazing place to learn, inspire, and create. Any contributions you make are greatly appreciated.

Fork the Project

Create your Feature Branch (git checkout -b feature/AmazingFeature)

Commit your Changes (git commit -m 'Add some AmazingFeature')

Push to the Branch (git push origin feature/AmazingFeature)

Open a Pull Request

License

Distributed under the MIT License. See LICENSE for more information.

Contact

Nathan Cordeiro - nathanjohncordeiro@gmail.com

Project Link: https://github.com/NathanCordeiro/WEB-SCRAPER

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/nathancordeiro/web-scraper

Awesome Lists containing this project

README

PYTHON WEB SCRAPER

---

A powerful and user-friendly web scraping tool built with Python beautiful soup and PyQt5.

About the Project

Features

Getting Started

Prerequisites

Installation

Usage

Contributing

License

Contact

https://github.com/nathancordeiro/web-scraper

Awesome Lists containing this project

README

PYTHON WEB SCRAPER --- A powerful and user-friendly web scraping tool built with Python beautiful soup and PyQt5.

About the Project

Features

Getting Started

Prerequisites

Installation

Usage

Contributing

License

Contact

PYTHON WEB SCRAPER

---

A powerful and user-friendly web scraping tool built with Python beautiful soup and PyQt5.