https://github.com/macbre/mediawiki-tags-cloud
Generates tags cloud using MediaWiki XML content dump
https://github.com/macbre/mediawiki-tags-cloud
fandom tag-cloud wikia wikipedia wikipedia-dump
Last synced: 12 days ago
JSON representation
Generates tags cloud using MediaWiki XML content dump
- Host: GitHub
- URL: https://github.com/macbre/mediawiki-tags-cloud
- Owner: macbre
- License: mit
- Created: 2018-11-18T17:48:14.000Z (over 7 years ago)
- Default Branch: master
- Last Pushed: 2022-12-09T05:26:26.000Z (over 3 years ago)
- Last Synced: 2025-01-03T14:50:31.057Z (over 1 year ago)
- Topics: fandom, tag-cloud, wikia, wikipedia, wikipedia-dump
- Language: Python
- Size: 2.38 MB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 3
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# mediawiki-tags-cloud
Generates tags cloud using [MediaWiki XML content dump](https://github.com/macbre/mediawiki-dump) and [wordcloud module](https://github.com/amueller/word_cloud).
### Gallery
#### [The Muppet Wiki](http://muppet.wikia.com)

#### [Poznańska Wiki](http://poznan.wikia.com)

#### [GTA Wiki](http://gta.wikia.com)

### [Polish Wikipedia](https://pl.wikipedia.org)

### Install
```
pip install pipenv
pipenv install
```
### Example
Generate tags cloud for [The Muppet Wiki](http://muppet.wikia.com):
```
python frequencies_from_dump.py | tee muppet.json
python cloud_from_frequencies.py muppet.json
```
## Troubelshooting
If you get `AttributeError: python: undefined symbol: archive_errno` please refer to [`mediawiki-dump` package README](https://github.com/macbre/mediawiki-dump#dependencies).