https://github.com/jfilter/get-wayback-machine
Fetch a URL via the latest Wayback Machine snapshot
https://github.com/jfilter/get-wayback-machine
wayback-machine web-scraping
Last synced: 30 days ago
JSON representation
Fetch a URL via the latest Wayback Machine snapshot
- Host: GitHub
- URL: https://github.com/jfilter/get-wayback-machine
- Owner: jfilter
- License: mit
- Created: 2018-12-14T16:55:54.000Z (over 6 years ago)
- Default Branch: master
- Last Pushed: 2022-12-09T05:18:20.000Z (over 2 years ago)
- Last Synced: 2025-06-10T08:07:45.938Z (about 1 month ago)
- Topics: wayback-machine, web-scraping
- Language: Python
- Homepage:
- Size: 19.5 KB
- Stars: 4
- Watchers: 2
- Forks: 1
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# get-wayback-machine [](https://travis-ci.com/jfilter/get-wayback-machine) [](https://pypi.org/project/get-wayback-machine/) [](https://pypi.org/project/get-wayback-machine/)
Fetch a URL via the latest Wayback Machine Snapshot.
## Why?
Occasionally, you have a given URL that it is not online anymore. You may still access it's content via the Internet Archive's [Wayback Machine](https://archive.org/web/).
## Install
```bash
pip install get_wayback_machine
```## Usage
```python
import get_wayback_machineresponse = get_wayback_machine.get('https://en.wikipedia.org')
if response:
print(response.status_code)
```The response is either `None` (for fails) or a [Requests](http://docs.python-requests.org/en/master/) response. This module uses [get-retries](https://github.com/jfilter/get-retries) internally to fetch the data.
## Related
- https://github.com/hartator/wayback-machine-downloader
- https://github.com/sangaline/wayback-machine-scraper
- https://github.com/jsvine/waybackpack## License
MIT.