https://github.com/dbpedia/archivo
DBpedia Archivo - Augmented Ontology Archive powered by Databus
https://github.com/dbpedia/archivo
archiver dependency-management linked-open-data ontology-lookup-services ontology-metrics ontology-repository ontology-search ontology-services ontology-verification owl rdfs semantic-versioning
Last synced: 4 months ago
JSON representation
DBpedia Archivo - Augmented Ontology Archive powered by Databus
- Host: GitHub
- URL: https://github.com/dbpedia/archivo
- Owner: dbpedia
- License: agpl-3.0
- Created: 2020-04-23T22:29:25.000Z (about 6 years ago)
- Default Branch: master
- Last Pushed: 2024-08-26T08:39:23.000Z (almost 2 years ago)
- Last Synced: 2025-09-04T21:47:58.382Z (9 months ago)
- Topics: archiver, dependency-management, linked-open-data, ontology-lookup-services, ontology-metrics, ontology-repository, ontology-search, ontology-services, ontology-verification, owl, rdfs, semantic-versioning
- Language: Python
- Homepage: https://archivo.dbpedia.org/
- Size: 87 MB
- Stars: 25
- Watchers: 21
- Forks: 7
- Open Issues: 29
-
Metadata Files:
- Readme: README.md
- License: LICENSE
- Citation: CITATION.cff
Awesome Lists containing this project
README
# DBpedia Archivo
Archivo is an online ontology interface and augmented archive, that discovers, crawls, versions and archives ontologies on the [DBpedia Databus](https://databus.dbpedia.org/ontologies/). Each Databus Artifact represents one certain ontology and each version represents a new version of the ontology.
## Use Cases
* **Ontology-Backup:** From time to time ontologies are unavailable at their usual location in the web. In this case Archivo can be used as a simple backup to prevent failure of services.
* **Testing & Rating:** Archivo runs some test to check the usability of a ontology, for example parsing, licenses or consistency. For this Archivo introduced a [star-rating](#Stars).
Check out for a detailed view of the versions of each ontology with the test results.
## Accessing an Ontology (Version) via Archivo
### Simple
To download the latest version use
`http://archivo.dbpedia.org/download?o={ontology-URI}&f={file-extension}`
and Archivo redirects to the latest version that is backed up.
The format can be set by using `f={ttl,owl,nt}` (optional, default: owl) or by setting the Accept-Header (first one overwrites the latter).
**Example:**
`curl -L "http://archivo.dbpedia.org/download?o=http://advene.org/ns/cinelab/ld&f=ttl"`
downloads the latest version of the Cinelab ontology as Turtle file.
### Advanced
By using a GET request with the implemented formats (`application/rdf+xml`, `application/n-triples` or `text/turtle`) as Accept-Headers returns all information about all versions of this URI as RDF.
## Browse, Find and Search Ontologies included in Archivo
There are multiple options to find an ontology (snapshot):
* [Here](http://archivo.dbpedia.org/list) is a complete list of all ontologies in Archivo with their Databus Artifact and the URL of the latest Turtle-File of each.
* Enter your ontology-URI [here](http://archivo.dbpedia.org/info) and you get information about all the versions of the ontology archived in Archivo.
* Check out the the handy collections on the DBpedia Databus:
* A collection of the latest original files: [here](https://databus.dbpedia.org/jfrey/collections/archivo-latest-original-ontology-snapshots)
* A collecttion of the latest parsed files: [here](https://databus.dbpedia.org/jfrey/collections/archivo-latest-ontology-snapshots)
## Adding an Ontology to Archivo
Ontologies can be added to Archivo using the [add-service](http://archivo.dbpedia.org/add) of the frontend. But the ontology must fulfill two requirements to be added:
* The URI must be accessible and the RDF content of the ontology must be reachable via content negotiation from there in any of these formats: RDF+XML, N-Triples, Turtle
* The URI defined in the a owl:Ontology (or skos:ConceptScheme) triple must be the same as the one provided here. If that's not the case Archivo tries to handle the new URI just like the one entered.
## Star Rating
Archivo provides a basic star-rating (not to be confused with the 5 stars of linked data).
Baseline: The minimum requirements a Ontology should fulfill.
* All of the following criteria have to be fulfilled:
1) The non-information URI resolves to a machine readable format or a machine readable version of the ontology is deterministically discoverable by other common means.
2) Download was successful
3) Uses a common format implemented by Archivo (rdf+xml, turtle or n-triples)
4) At least one format was found that parses with no or few (negligible) syntactical warnings
* A proper ontology declaration was found using `rdf:type owl:Ontology` and some form of license could be detected. A high degree of heterogeneity is permissible for this star regarding the used property/subproperty as well as object:license URI (resolvable linked data or web link),xsd:stringorxsd:anyURI
If the ontology fulfills the baseline, it can earn two further stars by using good practises:
* We require a homogenized license declaration using `dct:license` as object property with a URI (not string or anyURI).
* We measure the compatibility with currently available reasoners such as Pellet/Stardog (more to follow) and run available tasks such as consistency checks and classification.
## Archivo Documentation
### Augmentation Files (on the Databus)
Archivo provides for each version different files:
| type-values | sub-cvs | Explaination |
| -------- | -------- | -------- |
| orig | | Snapshot of the original ontology file |
| parsed | | Files parsed by rapper, available as owl, nt and ttl |
| OOPS | | OOPS-report |
| generatedDocu | | human readable documentation by LODE |
| profile | | a profile check done by [profilechecker](https://github.com/stain/profilechecker)|
| shaclReport | validates={minLicense, goodLicense, lodeMetadata} | shacl-report as turtle file|
| meta | | a JSON file containing some meta info |
| pelletInfo | imports={FULL,NONE} | the pellet info report, with and without imports |
| pelletConsistency | imports={FULL,NONE} | the pellet consistency report, with and without imports |
| diff | axioms={old,new} | These file contain the added/deleted triples |
### Ontology Sources
Archivo uses four different sources to find potential ontologies:
1. Ontology Repositories: e.g ontologies listed in [LOV](https://lov.linkeddata.es/dataset/lov/)
2. Subjects, Predicates and Objects of Ontologies: Every SPO in an Ontology can lead to a potential new ontology, so Archivo can discover new vocabularies by analyzing already listed ontologies
3. [VOID](https://www.w3.org/TR/void/) Data: Search for new ontologies by looking at VoID metadata summaries describing used classes and properties (on the DBpedia Databus)
4. [User Suggestions](http://archivo.dbpedia.org/add)