Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://davidoesch.github.io/geoservice_harvester_poc/

Open Geoservice scraper proof of concept to extract info of each dataset contained in an OGC compliant Geoservice
https://davidoesch.github.io/geoservice_harvester_poc/

Last synced: 17 days ago
JSON representation

Open Geoservice scraper proof of concept to extract info of each dataset contained in an OGC compliant Geoservice

Awesome Lists containing this project

README

        

[![GitHub commit](https://img.shields.io/github/last-commit/davidoesch/geoservice_harvester_poc)](https://github.com/davidoesch/geoservice_harvester_poc/commits/master)
[![Run scraper](https://github.com/davidoesch/geoservice_harvester_poc/actions/workflows/run-scraper.yml/badge.svg)](https://github.com/davidoesch/geoservice_harvester_poc/actions/workflows/run-scraper.yml)

# Geoservice Harvester POC open geo services reported by the Swiss Gov Agencies and Third parties

### Important note
> Find and use **high quality data published by our data colleagues of the [SWISS FEDERAL ADMINSTRATION](https://www.geo.admin.ch) [CANTONS](https://www.geodienste.ch)** for all Cantons and FL:
> 1. GeoServiceHarvesterPOC visualized (Dashboard): https://davidoesch.github.io/geoservice_harvester_poc
> 2. geo.admin.ch visualized (Dashboard): https://docs.google.com/spreadsheets/d/1QdxYv6RYWe9PFIq5XQ-BNLnKfgtFcDkfZ58ftZ_Va9I/edit?usp=sharing
> 3. all published as 'open government data': https://opendata.swiss/de/dataset/?groups=geography

## Aim of this repository

The aim of this repository is to provide a POC to open OGC Compliant geodata services provided by the Swiss Confederation, Cantons Municipalities, the Principality of Liechtenstein and third parties. Inspired by the pre-POC [wmsChecker](https://github.com/davidoesch/wmschecker) and driven by the [ 4th Geounconference Workshop: Sujet / Thema 16 –– Service-Verzeichnis](https://github.com/GeoUnconference/discussions/discussions/38). For key findings, dive into the [ blog post](https://www.linkedin.com/pulse/geoharvesting-unexploiting-seo-treasures-geoservices-david-oesch/). Updates of services are infrequent.

If you have any questions, please don't hestitate to contact us:

- https://twitter.com/davidoesch (follow us, we send you a private Direct Message, thanks!)

## Datasets of all services : geoservices_CH.csv

**General description**

This data is generated and validated weekly using automated procedures. Note that we only publish data that are OGC compliant. Thus, gaps might be the case.

**Data**

>**https://github.com/davidoesch/geoservice_harvester_poc/blob/main/data/geoservices_CH.csv**

>*Description:* Data description for each layer separately

>*Spatial unit:* Swiss cantons and Principality of Liechtenstein covered

>*Updated:* weekly

>*Format:* csv

>*Additional remark*: )

| Field Name | Description | Format | Note |
| ----------- | ---------------------------------------------------------------------------------------------------------------- | --------------------------- | --------------------------- |
| OWNER | Both Owner type and Owner Name are reflected,they corespond as well to the corrsponing py file with scraper info.The owner of the data, likely an organization or agency that created or manages the data | Text | |
| TITLE | The title of the dataset, which provides a brief description of what the data represents. | Text | |
| NAME | The name or identifier for this particular data layer or feature within the larger dataset | Text | |
| TREE | Layertree derived from service info.A hierarchical category or grouping for this data layer | Text , tree separator „/“ | |
| GROUP | Group Name of agreggated datasets. A grouping or category for this data layer, which may be used for organization or visualization purposes | Text | only applicable for WMS |
| ABSTRACT | A brief summary or abstract of the data, which may provide additional context or details beyond the title. | Text | |
| KEYWORDS | A list of relevant keywords or tags associated with this data layer, which can help users discover and filter relevant data. | keywords, commaseparated | |
| LEGEND | Link to legend for the symbols or colors used to represent this data layer, which can aid in interpreting the data. | URL to Image | |
| CONTACT |The contact information for the individual or organization responsible for maintaining or providing access to the da | Text, email | |
| SERVICELINK | Link to Get Capabilities | URL to Image | |
| METADATA | A link or URL to additional metadata or documentation about the data, such as data dictionaries or technical specifications | URL | |
| UPDATE | Publication Date | Text | |
| SERVICETYPE | OGC Service type | Text | WMTS WMS and WFS |
| MAX ZOOM | Zoom level (mapzoom) on which the data is visible | Int | |
| CENTER\_LAT | Lat center of data WGS84 | Float | |
| CENTER\_LON | Lon center of data WGS84 | Float | |
| MAPGEO | permalink to map.geo.admin.ch | URL | |
| BBOX | The bounding box or extent of this data layer, represented as a list of four coordinates in the order west, south, east, and north. | Textt | |

## Unified data : geodata_CH.csv

**General description**

This data is generated and validated weekly using automated procedures based on geoservices_CH.csv (1NF / normalization). Note that we only publish data that are OGC compliant. Thus, gaps might be the case.

**Data**

>**https://github.com/davidoesch/geoservice_harvester_poc/blob/main/data/geodata_CH.csv**

>*Description:* This dataset has been aggregated to include data from all of its different services. This means that the information is now more comprehensive and includes all relevant occurrences from each of the services that contribute to the dataset.

>*Spatial unit:* Swiss cantons and Principality of Liechtenstein covered

>*Updated:* weekly

>*Format:* csv

>*Additional remark*: )

| Field Name | Description | Format | Note |
| ----------- | ---------------------------------------------------------------------------------------------------------------- | --------------------------- | --------------------------- |
| OWNER | Both Owner type and Owner Name are reflected,they corespond as well to the corrsponing py file with scraper info | Text | |
| TITLE | Title of the dataset | Text | |
| NAME | Name of the dataset | Text | |
| MAPGEO | permalink to map.geo.admin.ch | URL | |
| CONTACT | Contact info | Text, email | |
| WMSGetCap | Link to WMSGetCap | Link | |
| WMTSGetCap | Link to WMTSGetCap | Link | |
| WFSGetCap | Link to WFSGetCap | Link | |

## Dataset title : geodata_simple_CH.csv

**General description**

This data is generated and validated weekly using automated procedures based on geodata_CH.csv. Note that we only publish data that are OGC compliant. Thus, gaps might be the case.

**Data**

>**https://github.com/davidoesch/geoservice_harvester_poc/blob/main/data/geodata_simple_CH.csv**

>*Description:* This dataset contains only the title of the datset and a link to map.geo.admin.ch. (no link if only WFS is availbale) . It's sole pupose was to serve as source for https://davidoesch.github.io/geoservice_harvester_poc

>*Spatial unit:* Swiss cantons and Principality of Liechtenstein covered

>*Updated:* weekly

>*Format:* csv

>*Additional remark*: will be decomissioned soon, since it is not needed)

| Field Name | Description | Format | Note |
| ----------- | ---------------------------------------------------------------------------------------------------------------- | --------------------------- | --------------------------- |
| OWNER | Both Owner type and Owner Name are reflected,they corespond as well to the corrsponing py file with scraper info | Text | |
| TITLE | Title of the dataset | Text | |
| MAPGEO | permalink to map.geo.admin.ch | URL | in case of a WFS only: link will not work

## Dataset title : geodata_stats_CH.csv

**General description**

This data is generated and validated weekly using automated procedures based on geoservices_CH.csv and is used to show overall [quality stats parameter](https://davidoesch.github.io/geoservice_harvester_poc/#anchor-QualityControl)

**Data**

>**https://github.com/davidoesch/geoservice_harvester_poc/blob/main/data/geodata_stats_CH.csv**

>*Description:* This dataset contains OWNER and the total number datasets and the corresponing completness / existence of parameters. It's sole pupose is to serve as source for https://davidoesch.github.io/geoservice_harvester_poc/#anchor-QualityContro

>*Spatial unit:* Swiss cantons and Principality of Liechtenstein covered

>*Updated:* weekly

>*Format:* csv

>*Additional remark*: )

## How to fix / add additonal WMS WMTS Services
1. Fix / add your service to [sources.csv](https://github.com/davidoesch/geoservice_harvester_poc/sources.csv) following the OWNER Naming Convention and URL (only https) to the service endpoint
2. make a pull request

## Current status Provider & Services

| Provider | Status | Notes |
|:---------: |-------- |------- |
| AG | ![ok](https://placehold.jp/b8e186/000000/200x50.png?text=WMS 'ok') | Source: https://www.ag.ch/de/verwaltung/dfr/geoportal/geodienste-(wms) |
| AI | ![ok](https://placehold.jp/b8e186/000000/200x50.png?text=WMS 'ok') | Source: https://www.ai.ch/themen/planen-und-bauen/geodaten-und-plaene/geobasisdaten some layers do cover AI AR SG |
| AR | ![ok](https://placehold.jp/b8e186/000000/200x50.png?text=WMS 'ok') | Source: reverse engineered from AI some layers do cover AI AR SG |
| BE | ![ok](https://placehold.jp/b8e186/000000/200x50.png?text=WMS 'ok') | Source https://www.agi.dij.be.ch/de/start/geoportal/geodienste/angebot-an-geodiensten.html FR to be done |
| BL | ![ok](https://placehold.jp/b8e186/000000/200x50.png?text=WMS-WMTS 'ok') | Source: https://www.baselland.ch/politik-und-behorden/direktionen/volkswirtschafts-und-gesundheitsdirektion/amt-fur-geoinformation/geoportal/geodienste |
| BS | ![ok](https://placehold.jp/b8e186/000000/200x50.png?text=WMS-WFS-WMTS 'ok') | Source: https://www.geo.bs.ch/geodaten/geodienste.html |
| FR | ![ok](https://placehold.jp/b8e186/000000/200x50.png?text=WMS-WFS 'ok') |Source https://geo.fr.ch/ags/rest/services |
| GE | ![ok](https://placehold.jp/b8e186/000000/200x50.png?text=WMS 'ok') | Source: https://ge.ch/sitg/services/services-carto/open-data, 3 datasets can not be parsed, see error log in /tools |
| GL | ![ok](https://placehold.jp/b8e186/000000/200x50.png?text=WMS-WFS 'ok') | Source https://www.gl.ch/verwaltung/bau-und-umwelt/hochbau/raumentwicklung-und-geoinformation/geoportal-kanton-glarus.html/808 Drops WFS warnings, seems to be ok see error log in /tools |
| GR | ![ok](https://placehold.jp/b8e186/000000/200x50.png?text=WMS-WFS 'ok') | Source https://geo.gr.ch/geodienste/katalog |
| JU | ![ok](https://placehold.jp/b8e186/000000/200x50.png?text=WMS-WFS-WMTS 'ok') |Source: reverse engineered from the once working NE, no love for WebmerCator |
| LU | ![ok](https://placehold.jp/de77ae/000000/200x50.png?text=no-service 'ok') | |
| NE | ![ok](https://placehold.jp/de77ae/000000/200x50.png?text=no-service 'ok') | |
| NW | ![ok](https://placehold.jp/de77ae/000000/200x50.png?text=no-service 'ok') | |
| OW | ![ok](https://placehold.jp/de77ae/000000/200x50.png?text=no-service 'ok') | |
| SG | ![ok](https://placehold.jp/b8e186/000000/200x50.png?text=WMS-WFS-WMTS 'ok') |Source: extracted the PDf from https://www.sg.ch/bauen/geoinformation/gi/geodienste.html |
| SH | ![ok](https://placehold.jp/b8e186/000000/200x50.png?text=WMS-WFS 'ok') | Soiurce: https://sh.ch/CMS/Webseite/Kanton-Schaffhausen/Beh-rde/Verwaltung/Volkswirtschaftsdepartement/Amt-f-r-Geoinformation-2303920-DE.html |
| SO | ![ok](https://placehold.jp/b8e186/000000/200x50.png?text=WMS-WFS-WMTS 'ok') | Source: https://so.ch/verwaltung/bau-und-justizdepartement/amt-fuer-geoinformation/geoportal/geodienste/wmts-web-map-tile-service/ |
| SZ | ![ok](https://placehold.jp/b8e186/000000/200x50.png?text=WMS-WFS-WMTS 'ok') |Source: https://www.sz.ch/behoerden/vermessung-geoinformation/geoportal/daten-und-dienste.html/72-416-414-1762-1761 |
| TG | ![ok](https://placehold.jp/b8e186/000000/200x50.png?text=WMS-WFS 'ok') | Source: scraped them from opendata.swiss API with https://ckan.opendata.swiss/api/3/action/package_search?fq=organization:kanton-thurgau%20AND%20res_format:WMS&rows=10000 and https://ckan.opendata.swiss/api/3/action/package_search?fq=organization:kanton-thurgau%20AND%20res_format:WFS&rows=10000 |
| TI | ![ok](https://placehold.jp/b8e186/000000/200x50.png?text=WMS-WFS-WMTS 'ok') |Source https://www4.ti.ch/dt/sg/sai/ugeo/temi/geoportale-ticino/geoportale/geoservizi/ |
| UR | ![ok](https://placehold.jp/b8e186/000000/200x50.png?text=WMS-WFS 'ok') | Source: https://oereb.ur.ch/?basemap=AV&lat=46.87491213706447&lng=8.645001065327628&zoom=13.75 sources and www.geo.ur.ch |
| VD | ![ok](https://placehold.jp/b8e186/000000/200x50.png?text=WMS 'ok') |Source: https://www.ogc.vd.ch/public/services/OGC/wmsVD/Mapserver/WMSServer? |
| VS | ![ok](https://placehold.jp/de77ae/000000/200x50.png?text=no-service 'nok') | |
| ZG | ![ok](https://placehold.jp/b8e186/000000/200x50.png?text=WMS-WFS-WMTS 'ok') | Source https://www.zg.ch/behoerden/direktion-des-innern/geoportal/geodaten-einbinden |
| ZH | ![ok](https://placehold.jp/b8e186/000000/200x50.png?text=WMS-WFS 'ok') | Source https://ckan.opendata.swiss/api/3/action/package_search?fq=organization:geoinformation-kanton-zuerich%20AND%20res_format:WMS&rows=10000 and https://ckan.opendata.swiss/api/3/action/package_search?fq=organization:geoinformation-kanton-zuerich%20AND%20res_format:WFS&rows=10000 |
| LI | ![ok](https://placehold.jp/b8e186/000000/200x50.png?text=WMS 'ok') | Source https://www.llv.li/inhalt/11694/amtsstellen/internet-kartendienst-geowebservices |
| Geodienste |![ok](https://placehold.jp/b8e186/000000/200x50.png?text=WMS-WFS 'ok') | Source: Source https://github.com/geoadmin/mf-geoadmin3/blob/master/src/js/ImportController.js FR IT to be done |
| Bund | ![ok](https://placehold.jp/b8e186/000000/200x50.png?text=WMS-WMTS 'ok') | Source: Source https://www.geo.admin.ch/de/geo-dienstleistungen/geodienste/darstellungsdienste-webmapping-webgis-anwendungen.html FR IT EN RM to be done |

## Operation
Automated daily run of [scraper.py](https://github.com/davidoesch/geoservice_harvester_poc/scraper.py) via GithubAction [scheduler](https://github.com/davidoesch/geoservice_harvester_poc/blob/main/.github/workflows/scheduler-scraper.yml). The scraper results are logged in [debug.log](https://github.com/davidoesch/geoservice_harvester_poc/blob/main/tools/debug.log), faulty or offline services in [sources.csv](https://github.com/davidoesch/geoservice_harvester_poc/blob/main/sources.csv) are logged in [tools](https://github.com/davidoesch/geoservice_harvester_poc/tree/main/tools). Harvested data in [geoservices_CH.csv](https://github.com/davidoesch/geoservice_harvester_poc/blob/main/data/geoservices_CH.csv)

## Roadmap and Ideas
Are collected in [Issues](https://github.com/davidoesch/geoservice_harvester_poc/issues)