An open API service indexing awesome lists of open source software.

https://github.com/nathancordeiro/web-scraper

A python GUI web scraper made with beautiful soup and PyQt5.
https://github.com/nathancordeiro/web-scraper

beautifulsoup4 pyqt5 python python-gui web-scraper

Last synced: 6 months ago
JSON representation

A python GUI web scraper made with beautiful soup and PyQt5.

Awesome Lists containing this project

README

          

---


PYTHON WEB SCRAPER

---


A powerful and user-friendly web scraping tool built with Python beautiful soup and PyQt5.


About the Project


The Web Scraper Application is a versatile and intuitive tool designed to make web scraping tasks effortless. Built using Python and PyQt5, it offers a sleek interface coupled with powerful functionality, allowing users to easily scrape content from websites and save it for further analysis.


Features




  • User-Friendly Interface: The application features an intuitive user interface, making it easy for users of all skill levels to navigate and utilize its features.


  • Scraping Capabilities: Users can enter a URL and scrape the content of the corresponding webpage with a single click. The scraped content is displayed in real-time within the application.


  • HTML Content Saving: The application allows users to save the scraped HTML content to a text file for future reference or analysis.


  • Task Bar Navigation: With a built-in task bar, users can seamlessly switch between different functionalities such as scraping, viewing, and accessing the about section.


Getting Started


Prerequisites



  • Python 3.x

  • PyQt5 library

  • BeautifulSoup4 library

  • Requests library

Installation



  1. Clone the repository:

  2. git clone https://github.com/NathanCordeiro/WEB-SCRAPER.git
  3. Navigate to the project directory:

  4. cd WEB-SCRAPER
  5. Install the required dependencies:

  6. pip install -r requirements.txt


Usage



  1. Run the application by executing the main.py file.

  2. python main.py
  3. Enter the URL of the website you want to scrape in the designated input field.

  4. Click on the "Scrape" button to initiate the scraping process.

  5. The scraped HTML content will be displayed in the application's view section. Additionally, it will be saved to a text file named scraped_content.html.


Contributing


Contributions are what make the open-source community such an amazing place to learn, inspire, and create. Any contributions you make are greatly appreciated.



  1. Fork the Project

  2. Create your Feature Branch (git checkout -b feature/AmazingFeature)

  3. Commit your Changes (git commit -m 'Add some AmazingFeature')

  4. Push to the Branch (git push origin feature/AmazingFeature)

  5. Open a Pull Request


License


Distributed under the MIT License. See LICENSE for more information.


Contact


Nathan Cordeiro - nathanjohncordeiro@gmail.com


Project Link: https://github.com/NathanCordeiro/WEB-SCRAPER