https://github.com/dlenski/wp2git
Downloads and imports Wikipedia page histories to a git repository
https://github.com/dlenski/wp2git
git mediawiki wikipedia-dump
Last synced: 3 months ago
JSON representation
Downloads and imports Wikipedia page histories to a git repository
- Host: GitHub
- URL: https://github.com/dlenski/wp2git
- Owner: dlenski
- License: other
- Fork: true (CyberShadow/wp2git)
- Created: 2014-10-14T03:19:13.000Z (over 10 years ago)
- Default Branch: master
- Last Pushed: 2024-12-14T17:20:24.000Z (4 months ago)
- Last Synced: 2024-12-14T18:27:11.727Z (4 months ago)
- Topics: git, mediawiki, wikipedia-dump
- Language: Python
- Homepage: http://blog.thecybershadow.net/2010/06/16/import-wikipedia-page-history-to-git/
- Size: 69.3 KB
- Stars: 33
- Watchers: 4
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
wp2git
======This program allows you to download and convert any Wikipedia article's history to a `git` repository, for easy browsing, [annotation](https://git-scm.com/docs/git-annotate),
and [bisecting](https://git-scm.com/docs/git-annotate) (etc.) of older revisions.### Requirements
Requires Python 3.x and `git` accessible in your `PATH`, and the [`mwclient` package](https://github.com/mwclient/mwclient)
(which will be auto-installed by `pip`).### Quick installation
For the latest release, install with:
```
pip3 install https://github.com/dlenski/wp2git/archive/v2.0.zip
```For the latest development build, install with
```
pip3 install https://github.com/dlenski/wp2git/archive/master.zip
```### Usage
```
$ wp2git [--lang XY] article_name
````wp2git` will create a directory, in which a new `git` repository will be created.
The repository will contain a single file named `article_name.mw`, along with the entire edit history
of that article on `XY.wikipedia.org`. (If unspecified, the default language is guessed according to
your locale.)Use `wp2git --help` to show more options.
### Entirely based on
[CyberShadow's version](https://github.com/CyberShadow/wp2git) written in the D language.