https://github.com/ivandonadello/semantic-pascal-part

This is a curated semantic version of the PASCAL-Part dataset for part-based object detection. Objects are aligned with WordNet and Yago concepts. The dataset is both in PASCAL-Voc and RDF format.
https://github.com/ivandonadello/semantic-pascal-part

computer-vision dataset knowledge-graph object-detection ontologies ontology owl owl-ontology part-whole partof-ontology pascal-part pascal-voc rdf rdf-format rdflib semantic-web wordnet yago

Last synced: 2 months ago
JSON representation

This is a curated semantic version of the PASCAL-Part dataset for part-based object detection. Objects are aligned with WordNet and Yago concepts. The dataset is both in PASCAL-Voc and RDF format.

Host: GitHub
URL: https://github.com/ivandonadello/semantic-pascal-part
Owner: ivanDonadello
Created: 2020-12-09T11:23:48.000Z (over 4 years ago)
Default Branch: main
Last Pushed: 2022-01-19T14:25:15.000Z (over 3 years ago)
Last Synced: 2025-05-01T12:05:21.147Z (2 months ago)
Topics: computer-vision, dataset, knowledge-graph, object-detection, ontologies, ontology, owl, owl-ontology, part-whole, partof-ontology, pascal-part, pascal-voc, rdf, rdf-format, rdflib, semantic-web, wordnet, yago
Language: Python
Homepage:
Size: 85.9 KB
Stars: 10
Watchers: 1
Forks: 1
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# semantic PASCAL-Part
Semantic PASCAL-Part is the RDF version of the famous PASCAL-Part dataset used for object detection in Computer Vision. This original release joins Computer Vision with Semantic Web as the objects in the dataset are aligned with concepts from:
- the provided supporting ontology;
- the [WordNet](https://wordnet.princeton.edu/) database through its synstes;
- the [Yago](https://yago-knowledge.org/) ontology.

The provided Python 3 code (see the APIs below) is able to browse the dataset and convert it in RDF knowledge graph format. This new format allows to easily foster research in both Semantic Web and Machine Learning fields.

## Differences with the original PASCAL-Part
The original [PASCAL-Part dataset](https://www.cs.stanford.edu/~roozbeh/pascal-parts/pascal-parts.html) contains objects labelled with classes of animals, vehicles, indoor objects and their parts. However, labels for parts are very specific, e.g., “left lower leg” and “right hand” and in many applications of semantic image interpretation such a fine-grained distinction is not necessary. Therefore, we merged the segments of the images that refer to the same part in a unique segment, e.g. two segments labelled with “left lower leg” and “left front leg” of the same leg have been merged in a segment labelled with “leg”. Then, we converted the segments into bounding boxes.

## Structure of the semantic PASCAL-Part Dataset
Download the data [here](https://zenodo.org/record/5878773#.YegfKiwo-qB) and unzip the semantic PASCAL-Part Dataset:
- `semanticPascalPart`: it contains the refined images and annotations (e.g., small specific parts are merged into bigger parts) of the PASCAL-Part dataset in Pascal-voc style.
- `Annotations_set`: the test set annotations in `.xml` format. For further information See the PASCAL VOC format [here](http://host.robots.ox.ac.uk/pascal/VOC/index.html).
- `Annotations_trainval`: the train and validation set annotations in `.xml` format. For further information See the PASCAL VOC format [here](http://host.robots.ox.ac.uk/pascal/VOC/index.html).
- `JPEGImages_test`: the test set images in `.jpg` format.
- `JPEGImages_trainval`: the train and validation set images in `.jpg` format.
- `test.txt`: the 2416 image filenames in the test set.
- `trainval.txt`: the 7687 image filenames in the train and validation set.

## The PASCAL-Part Ontology
The PASCAL-Part OWL ontology formalizes, through logical axioms, the part-of relationship between whole objects (22 classes) and their parts (39 classes). The ontology contains 85 logical axiomns in Description Logic in (for example) the following form:
```
Every potted_plant has exactly 1 plant AND
has exactly 1 pot
```
We provide two versions of the ontology: with and without cardinality constraints in order to allow users to experiment with or without them. The WordNet alignment is encoded in the ontology as annotations. We further provide the `WordNet_Yago_alignment.csv` file with both WordNet and Yago alignments.

The ontology can be browsed with many Semantic Web tools such as:

- [Protégé](https://protege.stanford.edu/): a graphical tool for ongology modelling;
- [OWLAPI](http://owlapi.sourceforge.net/): Java API for manipulating OWL ontologies;
- [rdflib](https://rdflib.readthedocs.io/en/stable/): Python API for working with the RDF format.
- RDF stores: databases for storing and semantically retrieve RDF triples. See [here](https://www.w3.org/wiki/LargeTripleStores) for some examples.

## Provided code
We provide some Python 3 functions for parsing the dataset. Before loading the dataset create an empty annotation object with:
```python
ann = PASCALPArt_annotations()
```
Then, you need to load one of the test or trainval set with:
```python
ann.load_data(split="trainval")
```
Annotations are stored as a Python dictionary containing the file names of the images, the ids of the contained objects, bounding boxes coordinates alog with the class name and the isPartOf/harParts relationships. Here an example:
```python
{
"00001": {
"1": {
"class": "Person",
"x_1": 1,
"y_1": 1,
"x_2": 123,
"y_2": 124,
"isPartOf": ""
"hasParts": "2,3"
},
"2": {
"class": "Leg",
"x_1": 23,
"y_1": 23,
"x_2": 44,
"y_2": 44,
"isPartOf": "1"
},
"3": {
"class": "Body",
"x_1": 28,
"y_1": 321,
"x_2": 312,
"y_2": 932,
"isPartOf": "1"
}
}
}
```
You can browse the annotation object with dedicated functions:
- `get_objects(filename)`: given a `filename` of an image, it returns a dictionary containing the objects in the image.
- `get_BB(filename, obj_id)` : given a `filename` of an image, it returns the bounding box coordinates of `obj_id`.
- `get_obj_class(filename, obj_id)` : given a `filename` of an image, it returns the ontology string class of `obj_id`.
- `get_isPartOf_id(filename, obj_id)` : given a `filename` of an image, it returns the id the whole object of `obj_id`.
- `get_whole_ids(filename, obj_id)` : given a `filename` of an image, it returns the list if ids of the part objects of `obj_id`.

Last, you convert the annotation object into an RDF ontology:
```python
ann_rdf = ann.toRDF("trainval")
```
The function `toRDF` is built by using the previous browsing functions. It is possible to create a whole RDF file with both trainval and test set by instantiating the annotation object only one time:
```python
ann = PASCALPArt_annotations()
ann.load_data(split="trainval")
ann.load_data(split="test")
ann_rdf = ann.toRDF()
```

## Citing semantic PASCAL-Part
If you use semantic PASCAL-Part in your research, please use the following BibTeX entry

```
@article{DBLP:journals/ia/DonadelloS16,
author = {Ivan Donadello and
Luciano Serafini},
title = {Integration of numeric and symbolic information for semantic image
interpretation},
journal = {Intelligenza Artificiale},
volume = {10},
number = {1},
pages = {33--47},
year = {2016}
}
```

## Papers using semantic PASCAL-Part
- Díaz-Rodríguez, N., Lamas, A., Sanchez, J., Franchi, G., Donadello, I., Tabik, S., ... & Herrera, F. (2022). "EXplainable Neural-Symbolic Learning (X-NeSyL) methodology to fuse deep learning representations with expert knowledge graphs: The MonuMAI cultural heritage use case." Information Fusion, 79, 58-83.
- Donadello, Ivan, Luciano Serafini, and Artur D'Avila Garcez. "Logic tensor networks for semantic image interpretation." Proceedings of the 26th International Joint Conference on Artificial Intelligence. 2017.
- Serafini, Luciano, Ivan Donadello, and Artur d'Avila Garcez. "Learning and reasoning in logic tensor networks: theory and application to semantic image interpretation." Proceedings of the Symposium on Applied Computing. 2017.
- Donadello, Ivan. Semantic image interpretation-integration of numerical data and logical knowledge for cognitive vision. Diss. University of Trento, 2018.
- Donadello, Ivan, and Luciano Serafini. "Mixing low-level and semantic features for image interpretation." European Conference on Computer Vision. Springer, Cham, 2014.
- Donadello, Ivan, and Luciano Serafini. "Integration of numeric and symbolic information for semantic image interpretation." Intelligenza Artificiale 10.1 (2016): 33-47.
- Donadello, Ivan. "Ontology Based Semantic Image Interpretation." the 14th Conference of the Italian Association for Artificial Intelligence (AI* IA 2015).

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/ivandonadello/semantic-pascal-part

Awesome Lists containing this project

README