Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/jackrobison/bittorrent-dht-spider
A program to crawl the bittorrent dht, resulting in a list of magnet links and their resolved metadata
https://github.com/jackrobison/bittorrent-dht-spider
Last synced: 3 months ago
JSON representation
A program to crawl the bittorrent dht, resulting in a list of magnet links and their resolved metadata
- Host: GitHub
- URL: https://github.com/jackrobison/bittorrent-dht-spider
- Owner: jackrobison
- Created: 2015-12-08T14:54:39.000Z (about 9 years ago)
- Default Branch: master
- Last Pushed: 2015-12-08T15:13:15.000Z (about 9 years ago)
- Last Synced: 2023-03-01T13:01:54.818Z (almost 2 years ago)
- Language: Python
- Homepage:
- Size: 7.81 KB
- Stars: 15
- Watchers: 5
- Forks: 5
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
**Disclaimer:**
This program was designed for experimental purposes. It blindly downloads magnet links to acquire metadata and cannot verify the legitimacy of the publisher or the content. Use it at your own risk.
**Dependencies:**
mongod
pymongo
libtorrent-rasterbar
bencode**To start this program:**
1. Start mongod using whatever dbpath you'd like
2. Navigate to this folder and run 'python spider.py', use -h to for more about optional argumentsThe database this program builds has two collections, 'dht' and 'torrent'.
'dht' is populated by posts in the format: {'_id': infohash, 'address': [list of peer IP addresses]}
IPs are recorded to approximate popularity, which is necessary to determine how to prioritize metadata aquisition.'torrent' is populated by posts in the format:
{'magnet': magnet link, 'name':, torrent description, 'info': [{'path': file name, 'size': file size}]}