Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/jmcarp/robobrowser
https://github.com/jmcarp/robobrowser
Last synced: about 2 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/jmcarp/robobrowser
- Owner: jmcarp
- License: bsd-3-clause
- Created: 2014-02-08T21:29:49.000Z (over 10 years ago)
- Default Branch: master
- Last Pushed: 2020-09-10T18:41:47.000Z (almost 4 years ago)
- Last Synced: 2024-04-25T15:43:47.467Z (about 2 months ago)
- Language: Python
- Size: 563 KB
- Stars: 3,693
- Watchers: 111
- Forks: 340
- Open Issues: 59
-
Metadata Files:
- Readme: README.rst
- Changelog: HISTORY.rst
- Contributing: docs/contributing.rst
- License: LICENSE
Lists
- awesome-python - robobrowser - A simple, Pythonic library for browsing the web without a standalone web browser. (Web Crawling)
- awesome-python-cn - RoboBrowser
- awesome-crawler - RoboBrowser - A simple, Pythonic library for browsing the web without a standalone web browser. (Python)
- Python-Awesome - RoboBrowser - A simple, Pythonic library for browsing the web without a standalone web browser. (Web Crawling)
- Awesome-Python - RoboBrowser - A simple, Pythonic library for browsing the web without a standalone web browser. (Web Crawling & Web Scraping)
- awesome-fullstack - Robo Browser
- awesome-fullstack - Robo Browser
- python-awesome-case1 - robobrowser - A simple, Pythonic library for browsing the web without a standalone web browser. (Web Crawling)
- awesome-python - robobrowser - A simple, Pythonic library for browsing the web without a standalone web browser. (Web Crawling)
- awesome-python - robobrowser - A simple, Pythonic library for browsing the web without a standalone web browser. (Web Crawling)
- awesome-python - robobrowser - A simple, Pythonic library for browsing the web without a standalone web browser. (Web Crawling)
- fucking-awesome-python - :octocat: robobrowser - :star: 3680 :fork_and_knife: 343 - A simple, Pythonic library for browsing the web without a standalone web browser. (Web Crawling)
- awesome-stars - jmcarp/robobrowser - (Python)
- awesome-python-master - robobrowser - A simple, Pythonic library for browsing the web without a standalone web browser. (Web Crawling)
- awesome-python - RoboBrowser - A simple, Pythonic library for browsing the web without a standalone web browser. (Web Crawling)
- awesome-python - RoboBrowser - A simple, Pythonic library for browsing the web without a standalone web browser. (Web Crawling)
- awesome_python - RoboBrowser - A simple, Pythonic library for browsing the web without a standalone web browser. (Web Crawling & Web Scraping)
- awesome-stars - robobrowser
- awesome-python - robobrowser - A simple, Pythonic library for browsing the web without a standalone web browser. (Web Crawling)
- awesome-python-cn - RoboBrowser
- join-awesome-python-interview-topics - RoboBrowser - A simple, Pythonic library for browsing the web without a standalone web browser. (Web Crawling & Web Scraping)
- awesome-stars - jmcarp/robobrowser - (Python)
- awesome-python - RoboBrowser - A simple, Pythonic library for browsing the web without a standalone web browser. (Web Crawling)
- awesome-python - robobrowser - A simple, Pythonic library for browsing the web without a standalone web browser. (Web Crawling)
- awesome-python-cn - 官网
- awesome-python-clone - robobrowser - A simple, Pythonic library for browsing the web without a standalone web browser. (Web Crawling)
- awesome-python - RoboBrowser - A simple, Pythonic library for browsing the web without a standalone web browser. (Web Crawling)
- awesome-python4 - RoboBrowser - A simple, Pythonic library for browsing the web without a standalone web browser. (Web Crawling)
- awesome-python-resources-all - RoboBrowser - A simple, Pythonic library for browsing the web without a standalone web browser. (Web Crawling & Web Scraping)
- awesome-stars - jmcarp/robobrowser - (Python)
- fucking-awesome-python - :octocat: robobrowser - :star: 3495 :fork_and_knife: 328 - A simple, Pythonic library for browsing the web without a standalone web browser. (Web Crawling)
- awesome-python - robobrowser - A simple, Pythonic library for browsing the web without a standalone web browser. (Web Crawling)
- my-awesome-github-stars - jmcarp/robobrowser - (Python)
- awesome-python - robobrowser - A simple, Pythonic library for browsing the web without a standalone web browser. (Web Crawling)
- awesome-python-resources - GitHub - 58% open · ⏱️ 07.06.2015): (HTML 处理)
- awesome-python - robobrowser - A simple, Pythonic library for browsing the web without a standalone web browser. (Web Crawling)
- awesome-crawlers - RoboBrowser - 08-23 | A simple, Pythonic library for browsing the web without a standalone web browser. | (All)
- awesome-python - robobrowser - A simple, Pythonic library for browsing the web without a standalone web browser. (Web Crawling)
- awesome-python - robobrowser - A simple, Pythonic library for browsing the web without a standalone web browser. (Web Crawling)
- awesome-stars - jmcarp/robobrowser - (Python)
- awesome-stars - jmcarp/robobrowser - (Python)
- awesome-python - RoboBrowser - A simple, Pythonic library for browsing the web without a standalone web browser. (Web Crawling & Web Scraping)
- awesome_python_with_star - jmcarp/robobrowser
- awesome-python - RoboBrowser - A simple, Pythonic library for browsing the web without a standalone web browser. (Web Crawling)
- awesome-python-cn - 官网
- awesome-python - robobrowser - A simple, Pythonic library for browsing the web without a standalone web browser. (Web Crawling)
- git-github.com-vinta-awesome-python - RoboBrowser - A simple, Pythonic library for browsing the web without a standalone web browser. (Web Crawling & Web Scraping)
- awesome-python-master - robobrowser - A simple, Pythonic library for browsing the web without a standalone web browser. (Web Crawling)
- python-awesome - robobrowser - A simple, Pythonic library for browsing the web without a standalone web browser. (Web Crawling)
- awesomePython - robobrowser - A simple, Pythonic library for browsing the web without a standalone web browser. (Web Crawling)
- awesome-python - RoboBrowser - A simple, Pythonic library for browsing the web without a standalone web browser. (Web Crawling & Web Scraping)
- awesome-python-zh - robobrowser - 一个简单的Pythonic库,用于在没有独立web浏览器的情况下浏览web。 (Web爬行)
- my-awesome-stars - jmcarp/robobrowser - (Python)
- awesome-python - robobrowser - A simple, Pythonic library for browsing the web without a standalone web browser. (Web Crawling)
- awesome-python - RoboBrowser - A simple, Pythonic library for browsing the web without a standalone web browser. (Web Crawling)
- fucking_awesome_python - RoboBrowser - A simple, Pythonic library for browsing the web without a standalone web browser. (Web Crawling)
- awesome-python - RoboBrowser - A simple, Pythonic library for browsing the web without a standalone web browser. (Web Crawling)
- Mpaperlee-awesome-python - RoboBrowser - A simple, Pythonic library for browsing the web without a standalone web browser. (Web Crawling)
- awesome-python - robobrowser - A simple, Pythonic library for browsing the web without a standalone web browser. (Web Crawling)
- awesome-python - robobrowser - A simple, Pythonic library for browsing the web without a standalone web browser. (Web Crawling)
- awesome-crawler-cn - RoboBrowser - 一个简单的,不基于Web浏览器的基于Python的Web 浏览器. (Python)
- awesome_python - RoboBrowser - A simple, Pythonic library for browsing the web without a standalone web browser. (Web Crawling)
- awesome-python - RoboBrowser - A simple, Pythonic library for browsing the web without a standalone web browser. (Web Crawling & Web Scraping)
- awesome-crawler - RoboBrowser - A simple, Pythonic library for browsing the web without a standalone web browser. (Python)
- starred-awesome - robobrowser - (Python)
- awesome-python - robobrowser - A simple, Pythonic library for browsing the web without a standalone web browser. (Web Crawling)
- awesome-python - RoboBrowser - A simple, Pythonic library for browsing the web without a standalone web browser. (Web Crawling)
- awesome-python - RoboBrowser - A simple, Pythonic library for browsing the web without a standalone web browser. (Web Crawling)
- awesome-python - RoboBrowser - A simple, Pythonic library for browsing the web without a standalone web browser. (Web Crawling & Web Scraping)
- awesome-python - robobrowser - A simple, Pythonic library for browsing the web without a standalone web browser. (Web Crawling & Web Scraping)
- awesome-stars - jmcarp/robobrowser - `★3700` (Python)
- my-awesome-stars - jmcarp/robobrowser - (Python)
- awesome-python - robobrowser - Millions of developers and companies build, ship, and maintain their software on GitHub — the largest and most advanced development platform in the world. ` 📝 6 years ago ` (Web Crawling [🔝](#readme))
README
RoboBrowser: Your friendly neighborhood web scraper
===============================================.. image:: https://badge.fury.io/py/robobrowser.png
:target: http://badge.fury.io/py/robobrowser.. image:: https://travis-ci.org/jmcarp/robobrowser.png?branch=master
:target: https://travis-ci.org/jmcarp/robobrowser.. image:: https://coveralls.io/repos/jmcarp/robobrowser/badge.png?branch=master
:target: https://coveralls.io/r/jmcarp/robobrowserHomepage: `http://robobrowser.readthedocs.org/ `_
RoboBrowser is a simple, Pythonic library for browsing the web without a standalone web browser. RoboBrowser
can fetch a page, click on links and buttons, and fill out and submit forms. If you need to interact with web services
that don't have APIs, RoboBrowser can help... code-block:: python
import re
from robobrowser import RoboBrowser# Browse to Genius
browser = RoboBrowser(history=True)
browser.open('http://genius.com/')# Search for Porcupine Tree
form = browser.get_form(action='/search')
form #
form['q'].value = 'porcupine tree'
browser.submit_form(form)# Look up the first song
songs = browser.select('.song_link')
browser.follow_link(songs[0])
lyrics = browser.select('.lyrics')
lyrics[0].text # \nHear the sound of music ...# Back to results page
browser.back()# Look up my favorite song
song_link = browser.get_link('trains')
browser.follow_link(song_link)# Can also search HTML using regex patterns
lyrics = browser.find(class_=re.compile(r'\blyrics\b'))
lyrics.text # \nTrain set and match spied under the blind...RoboBrowser combines the best of two excellent Python libraries:
`Requests `_ and
`BeautifulSoup `_.
RoboBrowser represents browser sessions using Requests and HTML responses
using BeautifulSoup, transparently exposing methods of both libraries:.. code-block:: python
import re
from robobrowser import RoboBrowserbrowser = RoboBrowser(user_agent='a python robot')
browser.open('https://github.com/')# Inspect the browser session
browser.session.cookies['_gh_sess'] # BAh7Bzo...
browser.session.headers['User-Agent'] # a python robot# Search the parsed HTML
,
browser.select('div.teaser-icon') # [
# ...
browser.find(class_=re.compile(r'column', re.I)) #
#