https://github.com/culturecreates/artsdata-planet-gtq
Import pipeline from Grand Théâtre de Québec to Artsdata
https://github.com/culturecreates/artsdata-planet-gtq
artsdata import planet
Last synced: 5 days ago
JSON representation
Import pipeline from Grand Théâtre de Québec to Artsdata
- Host: GitHub
- URL: https://github.com/culturecreates/artsdata-planet-gtq
- Owner: culturecreates
- Created: 2023-08-28T01:37:11.000Z (almost 3 years ago)
- Default Branch: main
- Last Pushed: 2026-06-05T05:43:17.000Z (6 days ago)
- Last Synced: 2026-06-05T07:34:28.322Z (6 days ago)
- Topics: artsdata, import, planet
- Language: Ruby
- Homepage:
- Size: 12.7 MB
- Stars: 0
- Watchers: 0
- Forks: 0
- Open Issues: 3
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Grand Théâtre de Québec Website data pipelines
https://grandtheatre.qc.ca
## Mapping to Artsdata event types
This repo contains a manually created taxonomy [gtq-event-type-mapping.ttl](https://github.com/culturecreates/artsdata-planet-gtq/blob/main/gtq-event-type-mapping.ttl) to map strings from the original grandtheatre.qc.ca website to Artsdata event types. To push the taxonomy to Artsdata you can manually run the Github workflow in this repo.
Check for missing event type mapping using this [all keywords](https://s.zazuko.com/op7aN2) or [ event type mapping](https://s.zazuko.com/2DndLw9). If there is no closeMatch or relatedMatch then one should be added. To look for Artsdasta Event Types you can use this [Artsdata event types query](https://s.zazuko.com/k9sKGH). Edit the mappings in [gtq-event-type-mapping.ttl](https://github.com/culturecreates/artsdata-planet-gtq/blob/main/gtq-event-type-mapping.ttl) and pushed to Artsdata using the automatic workflow in this repo.
## Input to Artsdata
This website is crawled by an agent on the [Artsdata Huginn platform](https://huginn-staging.herokuapp.com/scenarios/26/diagram).
Here is the summary of the GTQ pipeline in Huginn:
1. Crawl events listed on https://grandtheatre.qc.ca/programmation/
2. Extract JSON-LD from each webpage
3. Transform with the following SPARQLs
'remove-blanks',
'fix-schemaorg-https-objects',
'fix-wikidata-uri',
'add-artsdata-uri-using-wikidata-bridge',
'fix-schemaorg-date-datatype',
'create-eventseries',
'copy-subevent-data-to-eventseries',
'fix-isni',
'add-artsdata-uri-using-isni-bridge',
'collapse_duplicate_contact_point_blanknodes'
1. Load graph into Artsdata using Artsdata Databus API
### Compare event images between original website and Artsdata
https://api.artsdata.ca/events?source=http://kg.artsdata.ca/culture-creates/huginn/derived-grandtheatre-qc-ca
https://grandtheatre.qc.ca/programmation/
## Output from Artsdata
Nothing is outputed from Artsdata for use by GTQ at this time.