https://github.com/chubek/potpourri

Parallelized web scraping library
https://github.com/chubek/potpourri

concurrency google google-programmable-search-engine google-search library parallel pip python python-library requests web web-api web-request-handling web-scraping

Last synced: about 1 month ago
JSON representation

Parallelized web scraping library

Host: GitHub
URL: https://github.com/chubek/potpourri
Owner: Chubek
License: other
Created: 2022-02-20T19:08:08.000Z (over 4 years ago)
Default Branch: master
Last Pushed: 2022-02-20T19:10:35.000Z (over 4 years ago)
Last Synced: 2025-10-29T14:50:41.914Z (9 months ago)
Topics: concurrency, google, google-programmable-search-engine, google-search, library, parallel, pip, python, python-library, requests, web, web-api, web-request-handling, web-scraping
Language: Python
Homepage:
Size: 55.7 KB
Stars: 1
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- Changelog: CHANGES.txt
- License: LICENSE

Awesome Lists containing this project

README

# Potpourri

## Scraping API in Form of a Library

Potpourri is a Python library that scrapes HTML pages, searches google for the page, and all that hooya.

Install:
```
pip3 install git+git@github.com:Chubek/potpourri.git
OR
pip3 install git+https://github.com/Chubek/potpourri.git
```

Make sure to provide environment variables.

(I will write a fully-fledged documentation for this, meanwhile you can ask me on Discord as of how to use it Chubak#7400)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/chubek/potpourri

Awesome Lists containing this project

README