Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/memgonzales/parallel-email-scraper

Multiprocess email address scraper for the De La Salle University website staff directory. Our approach models the scraping task as a multiple producer – multiple consumer problem to achieve a 7.22× superlinear speedup compared to serial execution
https://github.com/memgonzales/parallel-email-scraper

email-scraper multiprocessing parallel-programming producer-consumer python queue selenium selenium-webdriver synchronized-queue web-scraper web-scraping

Last synced: 26 days ago
JSON representation

Host: GitHub
URL: https://github.com/memgonzales/parallel-email-scraper
Owner: memgonzales
Created: 2022-11-11T15:55:00.000Z (over 2 years ago)
Default Branch: master
Last Pushed: 2022-12-25T14:59:52.000Z (about 2 years ago)
Last Synced: 2024-11-19T09:16:46.367Z (3 months ago)
Topics: email-scraper, multiprocessing, parallel-programming, producer-consumer, python, queue, selenium, selenium-webdriver, synchronized-queue, web-scraper, web-scraping
Language: Python
Homepage:
Size: 15.4 MB
Stars: 1
Watchers: 1
Forks: 0
Open Issues: 0

Ecosyste.ms: Awesome

https://github.com/memgonzales/parallel-email-scraper

Awesome Lists containing this project