https://github.com/elliotgao2/htmlfetcher
No pain HTML fetching library.
https://github.com/elliotgao2/htmlfetcher
ajax fetcher python selenium
Last synced: 5 months ago
JSON representation
No pain HTML fetching library.
- Host: GitHub
- URL: https://github.com/elliotgao2/htmlfetcher
- Owner: elliotgao2
- Created: 2018-02-28T07:13:24.000Z (over 8 years ago)
- Default Branch: master
- Last Pushed: 2018-03-02T13:35:00.000Z (over 8 years ago)
- Last Synced: 2025-09-29T16:55:23.113Z (9 months ago)
- Topics: ajax, fetcher, python, selenium
- Language: Python
- Size: 2.15 MB
- Stars: 2
- Watchers: 3
- Forks: 2
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# HTML Parsing
No pain HTML fetching library.
A light wrapper of selenium.
For page with a lot of ajax.
## Installation
```python
pip install htmlfetcher
```
download browser: [https://github.com/mozilla/geckodriver/releases](https://github.com/mozilla/geckodriver/releases)
## Usage
```python
from htmlfetcher import HTMLFetcher
fetcher = HTMLFetcher(browser='/home/bug/桌面/geckodriver')
text = fetcher.get('http://zhaopin.baidu.com/quanzhi?tid=4139&ie=utf8&oe=utf8&query=python&city_sug=%E6%88%90%E9%83%BD')
fetcher.close()
print(text)
```