Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/adagio/ivoox_categories
Extract categories and subcategories from iVoox, using concurrent processes
https://github.com/adagio/ivoox_categories
async beautifulsoup beautifulsoup4 concurrency concurrent futures
Last synced: about 1 month ago
JSON representation
Extract categories and subcategories from iVoox, using concurrent processes
- Host: GitHub
- URL: https://github.com/adagio/ivoox_categories
- Owner: adagio
- Created: 2018-11-25T01:46:02.000Z (about 6 years ago)
- Default Branch: master
- Last Pushed: 2018-12-01T22:51:18.000Z (about 6 years ago)
- Last Synced: 2024-11-07T09:11:36.669Z (3 months ago)
- Topics: async, beautifulsoup, beautifulsoup4, concurrency, concurrent, futures
- Language: Python
- Homepage:
- Size: 5.86 KB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Scraping iVoox Categories
We are extacting iVoox Categories and Subcategories
## BeautifulSoup
Using BeautifulSoup4 to get data from html
## Async scraping
Using ProcessPoolExecutor from concurrent.futures.
Also using as_completed