https://github.com/scribblemaniac/wayback-save
A simple command-line utility save URLs to the Wayback Machine
https://github.com/scribblemaniac/wayback-save
cli geckodriver python selenium wayback-machine
Last synced: 2 months ago
JSON representation
A simple command-line utility save URLs to the Wayback Machine
- Host: GitHub
- URL: https://github.com/scribblemaniac/wayback-save
- Owner: scribblemaniac
- License: mit
- Created: 2017-03-17T23:36:16.000Z (over 9 years ago)
- Default Branch: master
- Last Pushed: 2018-08-05T05:07:29.000Z (almost 8 years ago)
- Last Synced: 2025-03-13T15:15:44.997Z (over 1 year ago)
- Topics: cli, geckodriver, python, selenium, wayback-machine
- Language: Python
- Size: 2.93 KB
- Stars: 3
- Watchers: 2
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Wayback Save
Wayback Save is simple command-line utility save URLs to the Wayback Machine.
### Prerequisites
Wayback Save requires [Python 3](https://www.python.org/), [Selenium for Python 3](http://docs.seleniumhq.org/), and [geckodriver](https://github.com/mozilla/geckodriver).
On macOS, these can be installed easily with [Homebrew](https://brew.sh/) using the following commands:
```
brew install python3 geckodriver
pip3 install selenium
```
### Usage
The usage is simply:
```
wayback-save URL [URL...]
```
### For Many Pages
This script is very slow and is only practical for a very small number of web pages. If you wish to archive full websites then [Archive Team's ArchiveBot](https://www.archiveteam.org/index.php?title=ArchiveBot) is a more practical solution for you. It can crawl websites with up to a few hundred thousand urls.