Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/matkoniecz/osm_wiki_tag_api

Obtains data from OSM Wiki infoboxes and data items, compares them and finds where edits are needed
https://github.com/matkoniecz/osm_wiki_tag_api

mediawiki openstreetmap openstreetmap-wiki parsing wikibase

Last synced: 13 days ago
JSON representation

Obtains data from OSM Wiki infoboxes and data items, compares them and finds where edits are needed

Awesome Lists containing this project

README

        

This repository contains

- [compare.py](compare.py) runnable by using `python3 compare.py` comparing info in data items and in infoboxes, listing mismatches and cases where infobox is not using its parameters but is silently displaying info from data item. It can be run to validate OSM Wiki and found some automatically detectable issues.
- [code](extract_infobox_data.py) to obtain relevant parameters from infobox of OSM Wiki tag page (OSM Wiki is a Mediawiki wiki, so standard tools such as `pywikibot` and `mwparserfromhell` were used)
- this code is fairly trivial, but may be useful for others as finding correct tool for job was not trivial
- [code](data_item.py) to obtain statements from data item (from Wikibase setup)
- this code is fairly trivial and is parsing in a weird way (as workaround for [bug](https://phabricator.wikimedia.org/T269635) that hopefully is now fixed as you read this), but finding even this way was not so easy so maybe it will be useful for someone
- mismatches often indicate that something is wrong, and use of data item instead of infobox parameters is unwanted. Especially in cases where someone deleted parameter from infobox ecause it was wrong but incorrect data from data item still haunts us.

# Limitations

If you need anything of things listed below - feel free to make a pull request!

Ideally, open an issue first if you want to add something not listed below.

* Full parsing of data items is missing.

* Missing documentation and proper tests

It is also existing as bunch of code and is not published as `pip` library. If OSM Wiki template extraction, data item extraction or other code would be useful for you in a library - please open an issue for that.

# Dependencies

Use Python command matching your install:

`python3.10 -m pip install -r requirements.txt`

# Alternatives

## Taginfo code (Ruby)

This can be used as an alternative way of getting parsed infoboxes if you need a Ruby code.

https://github.com/taginfo/taginfo/blob/master/sources/wiki/lib/mediawikiapi.rb

https://github.com/taginfo/taginfo/blob/master/sources/wiki/get_wiki_data.rb

```
require_relative 'mediawikiapi.rb'

api = MediaWikiAPI::API.new('/w/index.php?')
title = "Tag:highway=motorway"
returned = api.get({ :title => title, :action => 'raw' }).body
puts(returned)
```

# Published

Right now it is published on https://github.com/matkoniecz/osm_wiki_tag_api