Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/giedomak/telepath

A path-index based graph database engine.
https://github.com/giedomak/telepath

graph-database indexing java kotlin maven

Last synced: about 1 month ago
JSON representation

A path-index based graph database engine.

Awesome Lists containing this project

README

        

Telepath
=====
[![Build Status](https://travis-ci.org/giedomak/Telepath.svg?branch=master)](https://travis-ci.org/giedomak/Telepath)
[![Codacy Badge](https://api.codacy.com/project/badge/Grade/54b77ddc30294e5ca6ce0743f50811a4)](https://www.codacy.com/app/giedomak/Telepath?utm_source=github.com&utm_medium=referral&utm_content=giedomak/Telepath&utm_campaign=Badge_Grade)
[![Maintainability](https://api.codeclimate.com/v1/badges/be773a45c811a21e1b00/maintainability)](https://codeclimate.com/github/giedomak/Telepath/maintainability)
[![codebeat badge](https://codebeat.co/badges/ffa0cab1-0edc-4900-b96c-68a17c73e3a8)](https://codebeat.co/projects/github-com-giedomak-telepath-master)
[![codecov](https://codecov.io/gh/giedomak/Telepath/branch/master/graph/badge.svg)](https://codecov.io/gh/giedomak/Telepath)

Massive graph-structured data collections are ubiquitous in contemporary data management scenarios such as social networks, linked open data, and chemical compound databases.

The selection and manipulation of paths forms the core of querying graph datasets. Path indexing techniques can speed up this core functionality of querying graph datasets.

We propose a path-index based graph database engine.

## Documentation

The documentation can be found [here](https://giedomak.github.io/Telepath/telepath/) and a schematic overview of the architecture can be found [here](https://github.com/giedomak/Telepath/tree/master/src/main/resources).

Giedo Mak. Telepath: A path-index based graph database engine.
MSc Thesis. Department of Mathematics and Computer Science, Eindhoven University of Technology. 2017. [PDF](https://github.com/giedomak/Telepath/raw/master/src/main/resources/thesis.pdf)

## Life of a Query

This section describes the essence of the life of a query within Telepath. Each heading contains links to its docs, test and source. In most cases, the test will give a clear insight into what each specific module produces.

1. __Query input__

The user gives a regular path query as input. For example:

```
a/(b/c)
```

Where `a`, `b` and `c` are edge labels, and `/` is interpreted as the concatenation logical operator.

2. __Parse the input__ [(docs)](https://giedomak.github.io/Telepath/telepath/com.github.giedomak.telepath.staticparser/-static-parser-r-p-q/index.html) [(test)](https://github.com/giedomak/Telepath/blob/master/src/test/java/com/github/giedomak/telepath/staticparser/StaticParserRPQTest.kt#L19) [(source)](https://github.com/giedomak/Telepath/blob/master/src/main/java/com/github/giedomak/telepath/staticparser/StaticParserRPQ.kt#L18)

The query input is parsed into our internal representation of a logical plan. Our internal representation uses a tree datastructure:

CONCATENATION
/ \
a CONCATENATION
/ \
b c

3. __Generate the cheapest physical plan__ [(docs)](https://giedomak.github.io/Telepath/telepath/com.github.giedomak.telepath.planner/-dynamic-programming-planner/index.html) [(test)](https://github.com/giedomak/Telepath/blob/master/src/test/java/com/github/giedomak/telepath/planner/DynamicProgrammingPlannerTest.kt#L29) [(source)](https://github.com/giedomak/Telepath/blob/master/src/main/java/com/github/giedomak/telepath/planner/DynamicProgrammingPlanner.kt#L20)

Our planner uses the `DPsize` [algorithm](https://scholar.google.nl/scholar?q=Analysis+of+two+existing+and+one+new+dynamic+programming+algorithm+for+the+generation+of+optimal+bushy+join+trees+without+cross+products&btnG=&hl=en&as_sdt=0%2C5) as inspiration, which calculates the cheapest physical plan in a bottom-up fashion.

Since this phase is one of the main contributions, an in-depth explanation can be found [here](https://github.com/giedomak/Telepath/blob/master/src/main/java/com/github/giedomak/telepath/planner).

INDEX_LOOKUP
/ | \
a b c

4. __Evaluate the physical plan__

The physical plan is evaluated in a bottom-up fashion. All intermediate results are materialized through our MemoryManager [(docs)](https://giedomak.github.io/Telepath/telepath/com.github.giedomak.telepath.memorymanager/-memory-manager/index.html) [(test)](https://github.com/giedomak/Telepath/blob/master/src/test/java/com/github/giedomak/telepath/memorymanager/SimpleMemoryManagerTest.kt#L25) [(source)](https://github.com/giedomak/Telepath/blob/master/src/main/java/com/github/giedomak/telepath/memorymanager/SimpleMemoryManager.kt#L23).

Using [PathDB](https://github.com/maxsumrall/PathDB) to gather the paths satisfying our query:

kPathIndex.search(
PathPrefix(
physicalPlan.pathIdOfChildren()
)
)

5. __Visualize results__

At the time of writing, results will be shown to the user through a command-line interface.

```
Telepath: >>>>> Results:
Telepath: Path(pathId=9, nodes=[Node(id=10), Node(id=12), Node(id=14)])
Telepath: Path(pathId=9, nodes=[Node(id=10), Node(id=12), Node(id=8772)])
Telepath: Number of results: 2, after 5 ms
Telepath: ----------------------------
```

## Want to contribute?

The [contributing guide](https://github.com/giedomak/Telepath/blob/master/CONTRIBUTING.md)
is a good place to start. If you have questions, feel free to ask.

## Authors
[![Giedo Mak](https://avatars0.githubusercontent.com/u/6235566?v=4&s=100)](https://github.com/giedomak) | [![Max Sumrall](https://avatars2.githubusercontent.com/u/628843?v=4&s=100)](https://github.com/maxsumrall) | [![Nikolay Yakovets](https://avatars3.githubusercontent.com/u/5265191?v=4&s=100)](https://github.com/nikk186) |
---|---|---|---
[Giedo Mak](https://github.com/giedomak) | [Max Sumrall](https://github.com/maxsumrall) | [Nikolay Yakovets](https://github.com/nikk186) | [George Fletcher](https://github.com/HiroshiLyda)