Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/ababaee1/simplescrapertool
Python tool for extracting contents from websites using Selenium
https://github.com/ababaee1/simplescrapertool
python selenium-webdriver webscraping webscraping-data
Last synced: about 1 month ago
JSON representation
Python tool for extracting contents from websites using Selenium
- Host: GitHub
- URL: https://github.com/ababaee1/simplescrapertool
- Owner: ababaee1
- License: mit
- Created: 2024-10-08T21:52:09.000Z (4 months ago)
- Default Branch: main
- Last Pushed: 2024-10-08T22:00:25.000Z (4 months ago)
- Last Synced: 2024-10-29T04:20:39.347Z (3 months ago)
- Topics: python, selenium-webdriver, webscraping, webscraping-data
- Language: Python
- Homepage:
- Size: 4.88 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Web Scraping Tool for Contact Information
This Python-based utility extracts requried information from websites using Selenium. It’s designed for educational and lawful purposes and should be used responsibly.## ⚠️ Legal and Ethical Disclaimer
Before using this tool:- Check Permissions: Review the website privacy and policy and terms of service to confirm permission for data extraction.
- Privacy Compliance: Scrape responsibly and respect privacy laws (e.g., GDPR, CCPA).
- User Responsibility: Ensure compliance with all applicable laws and website policies. Misuse is solely the user’s responsibility.## Features
- Flexible CSS Selectors: Customize selectors to target various websites.
- Automated Interaction: Clicks and extracts contact data, saving it to a CSV.
- Headless Operation: Runs in the background for efficient scraping.## Installation
### Install Dependencies:```bash
pip install selenium
```
### Setup:- Update ```driver_path``` with the path to ```chromedriver```.
- Customize CSS selectors in the script.### Run the Script:
```bash
python scraper.py
```