Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/diusmontenegro/web-scraper-tool

This project is a web scraping program that uses the BeautifulSoup library to extract data from a given URL. It extracts the href and text attributes from all the links on the page and stores them in a list.
https://github.com/diusmontenegro/web-scraper-tool

python3 tool web-scraper

Last synced: about 2 months ago
JSON representation

This project is a web scraping program that uses the BeautifulSoup library to extract data from a given URL. It extracts the href and text attributes from all the links on the page and stores them in a list.

Awesome Lists containing this project

README

        

# Web Scraper
This is a Python script that scrapes a webpage and extracts all the links in it, saving the data to a JSON file. It uses the following libraries:

requests
BeautifulSoup
json
tkinter
Installation
First, make sure that you have Python 3 and the necessary libraries installed. You can install them using pip:

## Copy code
pip install requests beautifulsoup4

## Usage
To use this script, follow these steps:

Run the script in your Python environment.
Enter the URL of the webpage you want to scrape in the URL entry widget.
Click the "Browse" button and select the location and name of the file you want to save the data to.
Click the "Start Scraping" button to begin the scraping process.
As the script runs, it will update a progress bar to show the progress of the scraping process. Once the scraping is complete, the data will be saved to the file you selected and the console will display a message indicating the file name and location.

### Note: this script is intended for educational purposes only and should not be used for unethical or illegal activities.

~Dre(Dius)