https://github.com/chubek/potpourri
Parallelized web scraping library
https://github.com/chubek/potpourri
concurrency google google-programmable-search-engine google-search library parallel pip python python-library requests web web-api web-request-handling web-scraping
Last synced: 15 days ago
JSON representation
Parallelized web scraping library
- Host: GitHub
- URL: https://github.com/chubek/potpourri
- Owner: Chubek
- License: other
- Created: 2022-02-20T19:08:08.000Z (over 4 years ago)
- Default Branch: master
- Last Pushed: 2022-02-20T19:10:35.000Z (over 4 years ago)
- Last Synced: 2025-10-29T14:50:41.914Z (8 months ago)
- Topics: concurrency, google, google-programmable-search-engine, google-search, library, parallel, pip, python, python-library, requests, web, web-api, web-request-handling, web-scraping
- Language: Python
- Homepage:
- Size: 55.7 KB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- Changelog: CHANGES.txt
- License: LICENSE
Awesome Lists containing this project
README
# Potpourri
## Scraping API in Form of a Library
Potpourri is a Python library that scrapes HTML pages, searches google for the page, and all that hooya.
Install:
```
pip3 install git+git@github.com:Chubek/potpourri.git
OR
pip3 install git+https://github.com/Chubek/potpourri.git
```
Make sure to provide environment variables.
(I will write a fully-fledged documentation for this, meanwhile you can ask me on Discord as of how to use it Chubak#7400)