{"id":21180281,"url":"https://github.com/SANSA-Stack/SANSA-Stack","last_synced_at":"2025-07-09T23:32:13.081Z","repository":{"id":14044166,"uuid":"45770657","full_name":"SANSA-Stack/SANSA-Stack","owner":"SANSA-Stack","description":"Big Data RDF Processing and Analytics Stack built on Apache Spark and Apache Jena http://sansa-stack.github.io/SANSA-Stack/","archived":false,"fork":false,"pushed_at":"2025-06-13T22:30:42.000Z","size":68349,"stargazers_count":145,"open_issues_count":40,"forks_count":30,"subscribers_count":20,"default_branch":"develop","last_synced_at":"2025-06-13T23:26:04.851Z","etag":null,"topics":["apache-jena","apache-spark","distributed-computing","flink","rdf","semantic-web","spark"],"latest_commit_sha":null,"homepage":"http://sansa-stack.net","language":"Scala","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"apache-2.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/SANSA-Stack.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null}},"created_at":"2015-11-08T07:07:12.000Z","updated_at":"2025-05-03T09:48:00.000Z","dependencies_parsed_at":"2023-02-19T10:45:22.548Z","dependency_job_id":"eec93bbd-c09b-4db5-bcba-c8f5bae974dc","html_url":"https://github.com/SANSA-Stack/SANSA-Stack","commit_stats":{"total_commits":4618,"total_committers":49,"mean_commits":94.24489795918367,"dds":0.7821567778258987,"last_synced_commit":"433eee59d35b201757d93ac7604a2658faec1367"},"previous_names":["sansa-stack/sansa-rdf"],"tags_count":27,"template":false,"template_full_name":null,"purl":"pkg:github/SANSA-Stack/SANSA-Stack","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/SANSA-Stack%2FSANSA-Stack","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/SANSA-Stack%2FSANSA-Stack/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/SANSA-Stack%2FSANSA-Stack/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/SANSA-Stack%2FSANSA-Stack/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/SANSA-Stack","download_url":"https://codeload.github.com/SANSA-Stack/SANSA-Stack/tar.gz/refs/heads/develop","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/SANSA-Stack%2FSANSA-Stack/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":263440962,"owners_count":23467019,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["apache-jena","apache-spark","distributed-computing","flink","rdf","semantic-web","spark"],"created_at":"2024-11-20T17:42:42.911Z","updated_at":"2025-07-09T23:32:08.071Z","avatar_url":"https://github.com/SANSA-Stack.png","language":"Scala","funding_links":[],"categories":["数据科学"],"sub_categories":[],"readme":"# SANSA-Stack\n\u003c!-- [![Maven Central](https://maven-badges.herokuapp.com/maven-central/net.sansa-stack/sansa-parent/badge.svg)](https://maven-badges.herokuapp.com/maven-central/net.sansa-stack/sansa-parent) --\u003e\n[![Build Status](https://github.com/SANSA-Stack/SANSA-Stack/workflows/CI/badge.svg)](https://github.com/SANSA-Stack/SANSA-Stack/actions?query=workflow%3ACI)\n[![License](https://img.shields.io/badge/License-Apache%202.0-blue.svg)](https://opensource.org/licenses/Apache-2.0)\n[![Twitter](https://img.shields.io/twitter/follow/SANSA_Stack.svg?style=social)](https://twitter.com/SANSA_Stack)\n\nThis project comprises the whole Semantic Analytics Stack (SANSA). At a glance, it features the following functionality:\n\n* Ingesting RDF and OWL data in various formats into RDDs\n* Operators for working with RDDs and data frames of RDF data at various levels (triples, bindings, graphs, etc)\n* *Transformation* of RDDs to data frames and *partitioning* of RDDs into R2RML-mapped data frames\n* Distributed SPARQL querying over R2RML-mapped data frame partitions using RDB2RDF engines (Sparqlify \u0026 Ontop)\n* Enrichment of RDDs with inferences\n* Application of machine learning algorithms\n\nFor a detailed description of SANSA, please visit http://sansa-stack.net. \n\n## Layers\nThe SANSA project is structured in the following five layers developed in their respective sub-folders:\n\n* [RDF](sansa-rdf)\n* [OWL](sansa-owl)\n* [Query](sansa-query)\n* [Inference](sansa-inference)\n* [ML](sansa-ml)\n\n## Release Cycle\nA SANSA stack release is done every six months and consists of the latest stable versions of each layer at this point. This repository is used for organising those joint releases.\n\n## Usage\n\n### Spark\n\n#### Requirements\n\nWe currently require a Spark 3.x.x with Scala 2.12 setup. A Spark 2.x version can be built from source based on the [spark2](https://github.com/SANSA-Stack/SANSA-Stack/tree/spark2) branch.\n\n#### Release Version\nSome of our dependencies are not in Maven central (yet), so you need to add following Maven repository to your project POM file `repositories` section:\n```xml\n\u003crepository\u003e\n   \u003cid\u003emaven.aksw.internal\u003c/id\u003e\n   \u003cname\u003eAKSW Release Repository\u003c/name\u003e\n   \u003curl\u003ehttp://maven.aksw.org/archiva/repository/internal\u003c/url\u003e\n   \u003creleases\u003e\n      \u003cenabled\u003etrue\u003c/enabled\u003e\n   \u003c/releases\u003e\n   \u003csnapshots\u003e\n      \u003cenabled\u003efalse\u003c/enabled\u003e\n   \u003c/snapshots\u003e\n\u003c/repository\u003e\n```\n\nIf you want to import the full SANSA Stack, please add the following Maven dependency to your project POM file:\n```xml\n\u003c!-- SANSA Stack --\u003e\n\u003cdependency\u003e\n   \u003cgroupId\u003enet.sansa-stack\u003c/groupId\u003e\n   \u003cartifactId\u003esansa-stack-spark_2.12\u003c/artifactId\u003e\n   \u003cversion\u003e$LATEST_RELEASE_VERSION$\u003c/version\u003e\n\u003c/dependency\u003e\n```\nIf you only want to use particular layers, just replace `$LAYER_NAME$` with the corresponding name of the layer\n```xml\n\u003c!-- SANSA $LAYER_NAME$ layer --\u003e\n\u003cdependency\u003e\n   \u003cgroupId\u003enet.sansa-stack\u003c/groupId\u003e\n   \u003cartifactId\u003esansa-$LAYER_NAME$-spark_2.12\u003c/artifactId\u003e\n   \u003cversion\u003e$LATEST_RELEASE_VERSION$\u003c/version\u003e\n\u003c/dependency\u003e\n```\n\n#### SNAPSHOT Version\nWhile the release versions are available on Maven Central, latest SNAPSHOT versions have to be installed from source code:\n```bash\ngit clone https://github.com/SANSA-Stack/SANSA-Stack.git\ncd SANSA-Stack\n```\nThen to build and install the full SANSA Spark stack you can do\n```bash\n./dev/mvn_install_stack_spark.sh \n```\nor for a single layer `$LAYER_NAME$` you can do\n```bash\nmvn -am -DskipTests -pl :sansa-$LAYER_NAME$-spark_2.12 clean install \n```\n\nAlternatively, you can use the following Maven repository and add it to your project POM file `repositories` section:\n```xml\n\u003crepository\u003e\n   \u003cid\u003emaven.aksw.snapshots\u003c/id\u003e\n   \u003cname\u003eAKSW Snapshot Repository\u003c/name\u003e\n   \u003curl\u003ehttp://maven.aksw.org/archiva/repository/snapshots\u003c/url\u003e\n   \u003creleases\u003e\n      \u003cenabled\u003efalse\u003c/enabled\u003e\n   \u003c/releases\u003e\n   \u003csnapshots\u003e\n      \u003cenabled\u003etrue\u003c/enabled\u003e\n   \u003c/snapshots\u003e\n\u003c/repository\u003e\n```\nThen do the same as for the release version and add the dependency:\n```xml\n\u003c!-- SANSA Stack --\u003e\n\u003cdependency\u003e\n   \u003cgroupId\u003enet.sansa-stack\u003c/groupId\u003e\n   \u003cartifactId\u003esansa-stack-spark_2.12\u003c/artifactId\u003e\n   \u003cversion\u003e$LATEST_SNAPSHOT_VERSION$\u003c/version\u003e\n\u003c/dependency\u003e\n```\n\n## How to Contribute\nWe always welcome new contributors to the project! Please see [our contribution guide](http://sansa-stack.net/contributing-to-sansa/) for more details on how to get started contributing to SANSA.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2FSANSA-Stack%2FSANSA-Stack","html_url":"https://awesome.ecosyste.ms/projects/github.com%2FSANSA-Stack%2FSANSA-Stack","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2FSANSA-Stack%2FSANSA-Stack/lists"}