Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/josh0xA/darkdump
Open Source Intelligence Interface for Deep Web Scraping
https://github.com/josh0xA/darkdump
Last synced: 1 day ago
JSON representation
Open Source Intelligence Interface for Deep Web Scraping
- Host: GitHub
- URL: https://github.com/josh0xA/darkdump
- Owner: josh0xA
- License: mit
- Created: 2021-02-11T01:51:44.000Z (almost 4 years ago)
- Default Branch: main
- Last Pushed: 2024-09-13T19:53:18.000Z (4 months ago)
- Last Synced: 2024-09-14T10:48:51.367Z (4 months ago)
- Language: Python
- Homepage:
- Size: 1.75 MB
- Stars: 806
- Watchers: 24
- Forks: 154
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- cybersources - Darkdump - line search engine for exploring the dark web. | ([↑](#-content) 🛠️ Tools)
README
# darkdump
## About
Darkdump is a OSINT interface for carrying out deep web investgations written in python in which it allows users to enter a search query in which darkdump provides the ability to scrape .onion sites relating to that query to try to extract emails, metadata, keywords, images, social media etc. Darkdump retrieves sites via Ahmia.fi and scrapes those .onion addresses when connected via the tor network.## Installation
1) ``git clone https://github.com/josh0xA/darkdump``
2) ``cd darkdump``
3) ``python3 -m pip install -r requirements.txt``
4) ``python3 darkdump.py --help``### Tor Configuration
To use Darkdump effectively, you need to configure Tor to allow your script to control it via the Tor control port. Here's how to set up your `torrc` file and verify that Tor is running:#### Step 1: Install Tor
If Tor is not already installed on your system, you need to install it. Here's how you can install Tor on various operating systems:Debian/Kali/Ubuntu: `sudo apt install tor`
MacOS: `brew install tor`#### Step 2: Configure the Tor torrc File
Locate your torrc file. This file is usually found at `/etc/tor/torrc`on Linux and sometimes Mac.Add the following lines to your torrc to enable the control port and set a control port password:
```
ControlPort 9051
HashedControlPassword [YourHashedPasswordHere]
```
Replace `[YourHashedPasswordHere]` with a hashed password which can be generated using the `tor --hash-password` command: `tor --hash-password "my_password"`#### Step 3: Start Tor Service
Linux: `sudo systemctl start tor.service`
MacOS: `brew services start tor`### Example Queries:
`python3 darkdump.py -q "hacking" -a 10 --scrape --proxy` - search for 10 links and scrape each site
`python3 darkdump.py -q "free movies" -a 25` - don't scrape, just return 25 links for that query (does not require tor)
`python3 darkdump.py -q "marketplaces" -a 15 --scrape --proxy -i` - search for 10 links and scrape each site as well as find and store images.## Menu
```_ _ _ __
__| | __ _ _ __| | ____| |_ _ _ __ ___ _ __ / /
/ _` |/ _` | '__| |/ / _` | | | | '_ ` _ \| '_ \ / /
| (_| | (_| | | | < (_| | |_| | | | | | | |_) | / /
\__,_|\__,_|_| |_|\_\__,_|\__,_|_| |_| |_| .__/ /_/ v3 by Josh Schiavone
|_|usage: darkdump.py [-h] [-v] [-q QUERY] [-a AMOUNT] [-p] [-i] [-s]
Darkdump is an interface for scraping the deepweb through Ahmia. Made by yours truly.
options:
-h, --help show this help message and exit
-v, --version returns darkdump's version
-q QUERY, --query QUERY
the keyword or string you want to search on the deepweb
-a AMOUNT, --amount AMOUNT
the amount of results you want to retrieve
-p, --proxy use tor proxy for scraping
-i, --images scrape images and visual content from the site
-s, --scrape scrape the actual site for content and look for keywords```
## Visual
## Ethical Notice
The developer of this program, Josh Schiavone, is not resposible for misuse of this data gathering tool. Do not use darkdump to navigate websites that take part in any activity that is identified as illegal under the laws and regulations of your government. May God bless you all.## License
MIT License
Copyright (c) Josh Schiavone