An open API service indexing awesome lists of open source software.

https://github.com/moredure/drum

Golang implementation of the disk repository with update management (DRUM) framework as presented by Hsin-Tsang Lee, Derek Leonard, Xiaoming Wang, and Dmitri Loguinov in the paper "IRLbot: Scaling to 6 Billion Pages and Beyond"
https://github.com/moredure/drum

drum golang url webcrawler

Last synced: about 1 month ago
JSON representation

Golang implementation of the disk repository with update management (DRUM) framework as presented by Hsin-Tsang Lee, Derek Leonard, Xiaoming Wang, and Dmitri Loguinov in the paper "IRLbot: Scaling to 6 Billion Pages and Beyond"

Awesome Lists containing this project

README

          

# DRUM

[![GoDoc](https://godoc.org/github.com/moredure/drum?status.png)](https://godoc.org/github.com/moredure/drum)

Golang implementation of the disk repository with update management (DRUM) framework as presented by Hsin-Tsang Lee, Derek Leonard, Xiaoming Wang, and Dmitri Loguinov in the paper "IRLbot: Scaling to 6 Billion Pages and Beyond". Disk-based bloom filter alternative with storage capabilities.

Credits
- [DRUM - A C++ Implementation for the URL-seen Test of a Web Crawler](https://www.codeproject.com/Articles/36221/DRUM-A-C-Implementation-for-the-URL-seen-Test-of-a)
- [Scaling to 6 Billion Pages and Beyond](https://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.145.7075&rep=rep1&type=pdf)
- [JDRUM](https://github.com/RovoMe/JDrum)