An open API service indexing awesome lists of open source software.

https://github.com/s-r-e-e-r-a-j/webextractor

WebExtractor is a powerful OSINT and ethical hacking tool developed in Python. It is used to extract email addresses, phone numbers, and links from a target website
https://github.com/s-r-e-e-r-a-j/webextractor

bugbounty-tool emailscraper information-gathering information-gathering-tool information-gathering-tools informationgathering linkscraper linux osint osint-python osint-tool osint-tools phonenumber-scrapping reconaissance termux termux-tool webscraper webscraping

Last synced: about 2 months ago
JSON representation

WebExtractor is a powerful OSINT and ethical hacking tool developed in Python. It is used to extract email addresses, phone numbers, and links from a target website

Awesome Lists containing this project

README

          

## WebExtractor
WebExtractor is a powerful **OSINT** and **ethical hacking tool developed in Python**. It is used to extract **email addresses**, **phone numbers**, and **links** (including visible, hidden, and social media links) from a target website. Designed for cybersecurity professionals, bug bounty hunters, and ethical hackers, it helps gather critical intelligence from web pages.

The extracted links can also assist in identifying potential vulnerabilities in the website, such as SQL injection (SQLi) points, open directories, exposed admin panels, or unvalidated input fields. These links serve as entry points for further vulnerability assessments and exploitation attempts during ethical hacking or penetration testing.

## Features
- **Extracts:**
- Emails
- Phone Numbers
- All Links (including visible, hidden, and social media links)

- Saves the extracted information for further analysis.

- Clean and organized output

- Works on Linux and Termux

- Simple CLI interface

- Lightweight and fast

## Compatibility
- Linux (Debian, RHEL, Arch, etc.)
- Termux (Android)

The tool automatically detects the environment and installs itself accordingly.

## Disclaimer
This tool is intended for educational and ethical OSINT purposes only. Use it only on websites you own or have explicit permission to analyze. The developer is not responsible for any misuse of this tool.

## Installation
**Step 1: Clone the Repository**
```bash
git clone https://github.com/s-r-e-e-r-a-j/WebExtractor.git
```
**step2: Navigate to the WebExtractor directory**
```bash
cd WebExtractor
```
**Step 3: Install Dependencies**
```bash
pip3 install -r requirements.txt
```
**Note for Kali, Parrot, Ubuntu 23.04+, Arch Linux users:**

If you see an error like:
```go
error: externally-managed-environment
```
then use:
```bash
pip3 install -r requirements.txt --break-system-packages
```

**Step 4: Run Installer (Linux or Termux)**
```bash
python3 install.py
```
**Then type `y` for install**

**Step 5: Run the Tool**
```bash
webextractor
```

## Usage
**Just run the tool:**
```bash
webextractor
```
1. Provide a valid URL when prompted.

2. Choose whether to extract email addresses, phone numbers, links (including visible, hidden, and social media links), or all three

3. It will display the **extracted emails, phone numbers, and links (including visible, hidden, and social media links)** in a clean format.

4. Optionally save the extracted data to a folder.

## Uninstallation
**Run the install.py script**
```bash
python3 install.py
```
Then type `n` for uninstall
## License
This project is licensed under the MIT License