Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/threatcode/proxy-spider
https://github.com/threatcode/proxy-spider
Last synced: 1 day ago
JSON representation
- Host: GitHub
- URL: https://github.com/threatcode/proxy-spider
- Owner: threatcode
- License: mit
- Created: 2023-12-30T01:59:25.000Z (about 1 year ago)
- Default Branch: main
- Last Pushed: 2025-02-07T02:33:57.000Z (4 days ago)
- Last Synced: 2025-02-07T03:25:50.324Z (4 days ago)
- Language: Python
- Size: 383 KB
- Stars: 0
- Watchers: 1
- Forks: 1
- Open Issues: 7
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# proxy-spider
[![CI](https://github.com/threatcode/proxy-spider/actions/workflows/ci.yml/badge.svg)](https://github.com/threatcode/proxy-spider/actions/workflows/ci.yml)
![Screenshot](screenshot.png)
HTTP, SOCKS4, SOCKS5 proxies scraper and checker.
- Asynchronous.
- Uses regex to search for proxies (ip:port format) on a web page, allowing proxies to be extracted even from json without making changes to the code.
- It is possible to specify the URL to which to send a request to check the proxy.
- Can sort proxies by speed.
- Supports determining the geolocation of the proxy exit node.
- Can determine if the proxy is anonymous.You can get proxies obtained using this script in [threatcode/proxy-list](https://github.com/threatcode/proxy-list).
## Installation and usage
### Desktop
- Download and unpack [the archive with the program](https://github.com/threatcode/proxy-spider/archive/refs/heads/main.zip).
- Edit `config.ini` to your preference.
- Install [Python](https://python.org/downloads) (minimum required version is 3.7).
- Run the script that installs dependencies and starts `proxy-spider`:
- On Windows run `start.cmd`
- On Unix-like operating systems run `start.sh`### Termux
To use `proxy-spider` in Termux, knowledge of the Unix command-line interface is required.
- Download Termux from [F-Droid](https://f-droid.org/en/packages/com.termux/). [Don't download it from Google Play](https://github.com/termux/termux-app#google-play-store-deprecated).
- Run the following command (it will automatically update Termux packages, install Python, and download and install `proxy-spider`):
```bash
bash <(curl -fsSL 'https://raw.githubusercontent.com/threatcode/proxy-spider/main/install-termux.sh')
```
- Edit `~/proxy-spider/config.ini` to your preference using a text editor (vim/nano).
- To run `proxy-spider` use the following command:
```bash
cd ~/proxy-spider && sh start-termux.sh
```## Checking local proxy lists
To check the local proxy lists, start the Python HTTP server on your local machine by running the `python -m http.server --bind localhost` command in the folder with the proxy lists. After that, add links to the appropriate files in `config.ini`.
## Folders description
When the script finishes running, the following folders will be created (this behavior can be changed in the config):
- `proxies` - proxies with any anonymity level.
- `proxies_anonymous` - anonymous proxies.
- `proxies_geolocation` - same as `proxies`, but includes exit-node's geolocation.
- `proxies_geolocation_anonymous` - same as `proxies_anonymous`, but includes exit-node's geolocation.Geolocation format is `ip:port|Country|Region|City`.
## License
[MIT](LICENSE)