https://github.com/KNowledgeOnWebScale/walder

Walder offers an easy way to set up a website or Web API on top of decentralized knowledge graphs.
https://github.com/KNowledgeOnWebScale/walder
knowledge-graph linked-data rdf solid web-api website
Last synced: 8 months ago
JSON representation
Walder offers an easy way to set up a website or Web API on top of decentralized knowledge graphs.
Host: GitHub
URL: https://github.com/KNowledgeOnWebScale/walder
Owner: KNowledgeOnWebScale
License: mit
Created: 2020-08-18T08:30:46.000Z (almost 5 years ago)
Default Branch: development
Last Pushed: 2024-04-29T12:43:56.000Z (about 1 year ago)
Last Synced: 2024-08-08T18:26:51.836Z (11 months ago)
Topics: knowledge-graph, linked-data, rdf, solid, web-api, website
Language: JavaScript
Homepage:
Size: 28.6 MB
Stars: 62
Watchers: 7
Forks: 8
Open Issues: 29
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- License: LICENSE
Awesome Lists containing this project

awesome-starred - KNowledgeOnWebScale/walder - Walder offers an easy way to set up a website or Web API on top of decentralized knowledge graphs. (website)
README

        ![logo of Walder](logo/logo.png)

Walder offers an easy way to set up a website or Web API on top of decentralized knowledge graphs.

Knowledge graphs incorporate data together with the meaning of that data.

This makes it possible to combine data from multiple knowledge graphs,

even if different, independent parties maintain or host them.

Knowledge graphs can be hosted via 

[Solid](https://solidproject.org/) PODs, 

[SPARQL](https://www.w3.org/TR/sparql11-query/) endpoints, 

[Triple Pattern Fragments](https://linkeddatafragments.org/) interfaces, 

[RDF](https://www.w3.org/TR/rdf11-concepts/) files, 

and so on.

Using content negotiation, 

Walder makes the data in these knowledge graphs available to clients 

via HTML, RDF, and JSON-LD.

Users define in a configuration file which data Walder uses and 

how it processes this data.

Find out which APIs are built with Walder [here](#built-with-walder).

**Table of contents**

- [Installation](#installation)

- [Usage](#usage)

  - [CLI](#cli)

  - [Library](#library)

  - [Config file structure](#config-file-structure)

    - [Resources](#resources)

    - [Path entry](#path-entry)

  - [Example](#example)

  - [Options for GraphQL-LD queries](#options-for-graphql-ld-queries)

  - [Multiple config files](#multiple-config-files)

  - [Content negotiation](#content-negotiation)

    - [RDF](#rdf)

  - [HTML templates](#html-templates)

    - [Accessing query results in view templates](#accessing-query-results-in-view-templates)

    - [Using layouts in view templates](#using-layouts-in-view-templates)

    - [Accessing front-matter metadata in view templates and layout templates](#accessing-front-matter-metadata-in-view-templates-and-layout-templates)

- [Input validation](#input-validation)

- [Error handling](#error-handling)

  - [Errors](#errors)

    - [Global](#global)

    - [Pipe modules](#pipe-modules)

    - [GraphQL-LD](#graphql-ld)

  - [Example](#example-1)

- [Developing your website](#developing-your-website)

- [Dependencies](#dependencies)

- [Tests](#tests)

- [Built with Walder](#built-with-walder)

- [License](#license)

## Installation

Install Walder globally via `yarn global add walder`.

For development, follow these steps:

1. Clone the repository.

2. Install dependencies via `yarn install`.

3. Install Walder globally via `$ yarn global add file:$(pwd)`.

## Usage

Walder is available as a [CLI](#cli) and JavaScript [library](#library).

### CLI

```bash

Usage: walder [options]

Options:

  -v, --version              output the version number

  -c, --config   YAML configuration file input

  -p, --port [portNumber]    server port number (default: 3000)

  -l, --log [level]          enable logging and set logging level (one of [error, warn, info, verbose, debug]) (default: "info")

  --no-cache                 disable Comunica default caching

  --lenient                  turn Comunica errors on invalid data into warnings

  -h, --help                 output usage information

```

### Library

```js

// From the root directory

const Walder = require('.');

const configFilePath = 'path/to/configfile';

const port = 9000; // Defaults to 3000 

const logging = 'info'; // Defaults to 'info' 

const cache = false;  // Defaults to true

const lenient = true; // Defaults to false 

const walder = new Walder(configFilePath, {port, logging, cache, lenient, cwd});

walder.activate();    // Starts the server

walder.deactivate();  // Stops the server

```

### Config file structure

You write a config file in YAML following the [OpenAPI 3.0 specification](https://swagger.io/docs/specification/basic-structure/).

The config file must have the following structure:

```yaml

openapi: 3.0.2

info:  # OpenAPI metadata

  title: 'Example site'

  version: 0.1.0

x-walder-resources:  # Directories used by Walder - OPTIONAL

  root:  # Path to the root folder of the directories used by Walder (absolute or relative to the directory containing the config file) - OPTIONAL (default: .)

  views:  # Path to directory containing view template files (absolute or relative to the root folder) - OPTIONAL (default: views)

  pipe-modules:  # Path to directory containing local pipe modules (absolute or relative to the root folder) - OPTIONAL (default: pipe-modules)

  public:  # Path to directory containing all files that should be available statically (e.g. stylesheets) (absolute or relative to the root folder) - OPTIONAL (default: public)

  layouts: # Path to directory containing all layout template files that can be used by view template files (absolute or relative to the root folder) - OPTIONAL (default: layouts)

x-walder-datasources:  # Default list of data sources

  - ...  # E.g. link to SPARQL endpoint or a GraphQL-LD query

paths:  # List of path entries

  path-entry-1:

    ...

  path-entry-2:

    ...

x-walder-errors: # Default error page views - status codes with files containing the HTML view template (absolute path or relative to the views directory)

  404: ...

  500: ...

  ...

```

#### Resources

The `x-walder-resources` key of the config file contains paths to directories used by Walder.

This key and it's values are optional. 

If a user does not give paths, 

Walder uses the following default values relative to the directory of the config file.

```yaml

root: .

views: views

pipe-modules: pipe-modules

public: public

layouts: layouts

```

To prevent Walder from making the wrong files public, 

when a user does not give a path to the `public` field, 

Walder creates a new directory `public` if it does not find this directory in the current working directory and uses that one.

#### Path entry

A path entry defines a route and has the following structure:

```yaml

path:  # The path linked to this query

  request:  # The HTTP request type (GET, POST, etc.)

    summary: ...  # Short description

     parameters:  # Path variables/Query parameters

        - in: ...  # 'path' or 'query'

          name: ...  # Name of the parameter

          schema:

            type: ... # Type of the parameter

          description: ...  # Description of the parameter

    x-walder-query:

      graphql-query: ...  # One or more GraphQL queries

      json-ld-context: ...  # The JSON-LD corresponding to the GraphQL query

      sparql-query: ... # One or more SPARQL queries

      json-ld-frame: ... # A JSON-LD frame that should be applied to the result of a SPARQL query 

      options: # Global options that will be applied to all the graphql-queries of this path (OPTIONAL)

      datasources:  # Query specific datasources (OPTIONAL)

        additional: ...  # Boolean stating that the following datasources are meant to be used on top of the default ones

        sources:  # List of query specific datasources

          - ...  # E.g. link to SPARQL endpoint

    x-walder-postprocessing:  # The (list of) pipe modules used for postprocessing

      module-id:  # Identifier of the pipe module

        source: ...  # Path leading to source code of the pipe module (absolute path or relative to the pipe-modules directory)

        parameters: # the parameters for the pipe module (OPTIONAL)

          - _data # (DEFAULT) this gives all the data, but all paths in the data object are supported (e.g. _data.0.employee)

          - ... # Additional parameters if your function supports those (OPTIONAL)

    responses:  # Status codes with files containing the HTML view template (absolute path or relative to the views directory)

      200: ...  # (REQUIRED)

      500: ...  # (OPTIONAL)

```

### Example

The following command starts a server on port 3000 (default) using an example config file.

`$ walder -c example/config.yaml `

This will start a server on `localhost:3000` with the following routes:

*  - Returns a list of directors of movies starring Brad Pitt.

*  - Returns a list of songs by a given musician.

For example,  returns a list of songs by Lady Gaga.

*  - Returns a list of a given artist's songs 

selecting those written by a specific person identified by name. 

For example,  returns a list of 

a given artist's songs written by a specific person.

*  - Returns a list of songs by a given artist 

that have 'star' in the title, using pipe modules.

For example,  returns a list of songs by David Bowie that 

have 'star' in the title.

### Options for GraphQL-LD queries

In the path entry above, 

the user defined `options` as a global (optional) identifier that Walder uses for every query of that path.

We have two options where we can choose from: 

`sort` and `remove-duplicates`. 

With given syntax:

```yaml

options:

  sort: # Enable sorting on the data (OPTIONAL)

    object: # JSONPath to the object you want to sort for

    selectors: # The values inside the object over which you want to sort

      - ... # The default option when you want ascending order, just give the value (JSONPath notation supported for further nesting)

      - value: ...  # When you want descending order, specify the value/order (JSONPath notation supported for further nesting)

        order: desc

  remove-duplicates: # Enable the removal of duplicates of the data (OPTIONAL)

    object: ... # The JSONPath tot the object that you want to compare

    value: ... # The value that has to be compared to determine whether it's duplicate (JSONPath notation is also supported for further nesting)

```

If you do not want `options` to be global for the whole path, 

you can define `options` per query.

```yaml

path:  # The path linked to this query

  request:  # The HTTP request type (GET, POST, etc.)

    summary: ...  # Short description

     parameters:  # Path variables/Query parameters

        - in: ...  # 'path' or 'query'

          name: ...  # Name of the parameter

          schema:

            type: ... # Type of the parameter

          description: ...  # Description of the parameter

    x-walder-query:

      graphql-query: ...  # One or more GraphQL queries

        name:

          query: ... # The GraphQL query

          options: # options that will be applied only to this specific graphql-query (OPTIONAL)

...

```

The following command starts a server using [this](example/config-sorting-duplicates.yaml) config file.

`$ walder -c example/config-sorting-duplicates.yaml`

This will start a server on `localhost:3000` with the following routes:

*  - 

Returns a list of songs with their artists written by a given musician. 

Walder sorts the songs in descending order by song name.

For example,  returns a list of such songs written by Marcella Detroit.

*  - 

Returns a list of songs with their artists written by a given musician.

Walder removes duplicate song names from the list.

For example,  returns a list of such songs by John Marcella Detroit.

*  - 

Returns a list of songs with their artists written by a given musician.

Walder sorts the songs in descending order by song name and removes duplicate song names.

For example,  returns a list of such songs by Marcella Detroit.

*  - 

Returns a list of song names and movie URLs for a given artist. 

Walder removes duplicate songs based on their names and 

sorts movies in descending order by their URLs.

For example,  returns a list of such songs and movies.

### Multiple config files

You can split a config file in multiple files, 

using the `$ref` keyword. 

We follow the [OpenAPI 3.0 spec](https://swagger.io/docs/specification/using-ref/) 

that explains how to use the referencing.

When first referenced you need to use the path beginning from the directory of the config file, 

but if the referenced file has references itself, 

it can use paths relative to its own location, as shown below.

The actual config file referencing its paths

```yaml

openapi: 3.0.2

info:

  title: 'Example site'

  version: 0.1.0

x-walder-resources:

  path: ./

  views: views

  pipe-modules: pipeModules

  public: public

x-walder-datasources:

  - http://fragments.dbpedia.org/2016-04/en

paths:

  /music/{musician}:

    $ref: './paths/music_musician.yaml'

  /movies/{actor}:

    $ref: './paths/movies_actor.yaml'

x-walder-errors:

  404:

    description: page not found error

    x-walder-input-text/html: error404.html

  500:

    description: internal server error

    x-walder-input-text/html: error500.html

```

Below you see `./example/paths/movies_actor.yaml` with reference with path relative to its own location

```yaml

get:

  summary: Returns a list of the all movies the given actor stars in

  parameters:

    - in: path

      name: actor

      required: true

      schema:

        type: string

      description: The target actor

  x-walder-query:

    $ref: '../walderQueryInfo/movies_actor_info.yaml'

  responses:

    200:

      description: list of movies

      x-walder-input-text/html: movies.pug

```

### Content negotiation

Using content negotiation, 

Walder makes the following output formats available:

* 'text/html'

* 'application/ld+json'

* 'text/turtle'

* 'application/n-triples'

* 'application/n-quads'

#### RDF

Walder uses [graphql-ld-comunica](https://www.npmjs.com/package/graphql-ld-comunica) 

to execute the GraphQL queries

and [@comunica/query-sparql](https://github.com/comunica/comunica/tree/master/engines/query-sparql) 

to execute SPARQL queries.

The result of a GraphQL-LD query is a JSON data. 

Walder first converts it into JSON-LD. 

This enables conversion to other RDF formats during content negotiation.

The result of a SPARQL query is an array of quads.

If a JSON-LD frame is specified, the quads are converted to JSON-LD.

Due to the importance of content negotiation,

only CONSTRUCT queries are supported.

### HTML templates

Walder uses [consolidate](https://www.npmjs.com/package/consolidate) 

to automatically retrieve the corresponding engine for a given template. 

This means that the [supported template engines](https://www.npmjs.com/package/consolidate#supported-template-engines) 

are dependent on consolidate.

You can use different template engines for different routes, e.g., 

[pug](https://pugjs.org/api/getting-started.html) renders one route's HTML, 

while [handlebars](https://handlebarsjs.com/) renders another route's HTML. 

Walder does this all by looking at the file extension of the given template.

Templates can be used in views as well as in layouts. So we'll name them **view templates** and **layout templates** in order to distinguish.

#### Accessing query results in view templates

The results of the queries, specified in the configuration file for a route, 

are available for rendering in view templates as data.

- If the route only has a single query, the data contains an object named `data`.

- If the route has multiple queries, 

each query's result is available in the data as a separate object 

whose name equals the query name in the configuration file.

In the case of a GraphQL-LD query,

each object will be an array, 

unless the query was [singularized](https://github.com/rubensworks/graphql-to-sparql.js#singularizing-everything).

[songs.handlebars](example/views/songs.handlebars) is an example of the consumption of the result

of the single query in the route `/music/{musician}` in [this configuration file](example/config.yaml).

[songs_movies.handlebars](example/views/songs_movies.handlebars) is an example of the consumption of the results

of the two queries in the route `/artist/{artist}` in [this configuration file](example/config-multiple-queries.yaml).

In the case of a SPARQL query, 

each object is an array of quads if no JSON-LD frame is specified.

Else it will be a JSON-LD object.

#### Using layouts in view templates

Using layouts is a great way to avoid repetition in route-specific view templates.

Reusable HTML structures such as headers, footers, navigation bars and other contents, meant to appear in multiple routes,

are preferable specified in *layout files*.

A layout template file can be specified in a view template file, 

by means of [front-matter](https://github.com/jxson/front-matter) metadata field `layout`.

It should contain a filename, available at the `layouts` location defined in the configuration file.

It may contain a relative path in front of the filename.

Example view template file specifying a layout:

```

---

layout: my-layout.pug

---

// view template continues here

```

Walder puts the inner HTML contents generated from the view template file into the data forwarded to 

the layout template file as an object named `content`.

The layout template file is yet another template. 

It usually expands these inner HTML contents at the position of its choice.

A simple *pug* example (mind the `!{content}`):

```

doctype html

html(lang="en")

    head

        title I'm based on a layout

    body !{content}

```

#### Accessing front-matter metadata in view templates and layout templates

In addition to query results, Walder adds [front-matter](https://github.com/jxson/front-matter) metadata,

specified in view templates, as additional attributes to the data.

Each additional attribute's name is equal to the metadata field name provided.

The following metadata field are reserved: `layout`, `content`, `data`, and 

the names assigned to queries in routes having multiple queries (see above).

These attributes are available to the view template and to the layout template it refers to, if any.

Example view template file specifying a front-matter metadata field and reading that field (mind the `#{a1}`):

```

--

a1: Value for FrontMatter attribute a1!

---

doctype html

html(lang="en")

    body

        main a1: #{a1}

```

Example view template file, specifying a layout template and another front-matter metadata field:

```

---

layout: layout-fm.pug

a2: Value for FrontMatter attribute a2!

---

main Lorem ipsum

```

Example corresponding layout template (layout-fm.pug) reading that field (mind the `#{a2}`):

```

doctype html

html(lang="en")

    head

        if a2

            title #{a2}

    body !{content}

```

## Input validation

While parsing the config file, 

Walder also validates the correctness and completeness of the input.

When Walder has parsed the whole config file and found errors, 

Walder returns all errors and exits.

 

At the moment, Walder validates the following:

- The config files describe all variables in the query in the parameters section.

- The config files mention only existing files in `200~x-walder-input-text/html` entries.

## Error handling

Walder binds error pages to a certain HTTP status code. 

You can define default error pages, 

but also path specific error pages by adding them to the `responses` key in the corresponding path entry.

### Errors

#### Global

* Error `404`: Page not found

* Error `500`: Internal server error

#### Pipe modules

* Error `500`: Could not apply the given pipe modules

#### GraphQL-LD

* Error `404`: Expected variable was not given

* Error `500`: Could not execute the given query

### Example

When you run Walder using the following command:

`$ walder -c example/config-errors.yaml`

the following paths lead to errors:

*  → error `404` (Global: Page not found)

*  → error `500` (Pipe modules: Could not apply the given pipe modules)

*  → error `500` (GraphQL-LD: Could not execute the given query)

The following config file excerpt will use the path specific `moviesServerError.handlebars` view template on errors leading to status code `500` when navigating to `/movies`.

When the required query parameter `actor` is not passed, Walder returns the status code `404`. 

Walder will use the default `error404.html` file since the config file has no path-specific HTML view template for the corresponding status.

```yaml

...

paths:

  /movies:

    get:

      summary: Returns a list of the all movies the given actor stars in

      parameters:

        - in: query

          name: actor

          schema:

            type: string

            minimum: 0

          description: The actor from whom the movies are requested

          required: true

      x-walder-query:

        graphql-query: >

          {

            id @single

            ... on Film {

              starring(label: $actor) @single

            }

          }

        json-ld-context: >

          {

            "@context": {

              "Film": "http://dbpedia.org/ontology/Film",

              "label": { "@id": "http://www.w3.org/2000/01/rdf-schema#label", "@language": "en" },

              "starring": "http://dbpedia.org/ontology/starring"

            }

          }

      responses:

        200:

          description: list of movies

          x-walder-input-text/html: movies.pug

        500:

          description: internal movie server error

          x-walder-input-text/html: moviesServerError.handlebars

x-walder-errors:

  404:

    description: page not found error

    x-walder-input-text/html: error404.html

  500:

    description: internal server error

    x-walder-input-text/html: error500.html

```

## Developing your website

Whilst developing your website,

you probably want your website to reload while making changes to `config.yaml`.

You can easily do this using [npm-watch](https://www.npmjs.com/package/npm-watch).

See the `package.json` snippet below on how to start

```json

{

  "watch": {

    "run": "config.yaml"

  },

  "scripts": {

    "run": "walder -c config.yaml --no-cache",

    "watch": "npm-watch"

  },

  "dependencies": {

    "walder": "^2.0.1"

  },

  "devDependencies": {

    "npm-watch": "^0.7.0"

  }

}

```

Run `npm run watch` and Walder reloads every `config.yaml` change!

## Dependencies

| Library | License |

| ------- | ------- |

| [@comunica](https://github.com/comunica/) | MIT |

| [accepts](https://www.npmjs.com/package/accepts) | MIT

| [axios](https://www.npmjs.com/package/axios) | MIT

| [chai](https://www.npmjs.com/package/chai) | MIT

| [commander](https://www.npmjs.com/package/commander) | MIT |

| [consolidate](https://www.npmjs.com/package/consolidate) | MIT |

| [debug](https://www.npmjs.com/package/debug) | MIT |

| [express](https://www.npmjs.com/package/express) | MIT |

| [front-matter](https://www.npmjs.com/package/front-matter) | MIT |

| [fs-extra](https://www.npmjs.com/package/fs-extra) | MIT |

| [graphql-ld](https://www.npmjs.com/package/graphql-ld) | MIT |

| [graphql-ld-comunica](https://www.npmjs.com/package/graphql-ld-comunica) | MIT |

| [handlebars](https://www.npmjs.com/package/handlebars) | MIT |

| [is-html](https://www.npmjs.com/package/is-html) | MIT |

| [jade-to-handlebars](https://www.npmjs.com/package/jade-to-handlebars) | MIT |

| [json-refs](https://www.npmjs.com/package/json-refs) | MIT |

| [jsonld](https://www.npmjs.com/package/jsonld) | BSD-3-Clause |

| [jsonpath](https://www.npmjs.com/package/jsonpath) | MIT |

| [markdown-it](https://www.npmjs.com/package/markdown-it) | MIT |

| [mocha](https://www.npmjs.com/package/mocha) | MIT |

| [morgan](https://www.npmjs.com/package/morgan) | MIT |

| [n3](https://www.npmjs.com/package/n3) | MIT |

| [object-path](https://www.npmjs.com/package/object-path) | MIT |

| [pug](https://www.npmjs.com/package/pug) | MIT |

| [supertest](https://www.npmjs.com/package/supertest) | MIT |

| [tmp](https://www.npmjs.com/package/tmp) | MIT |

| [winston](https://www.npmjs.com/package/winston) | MIT |

| [yaml](https://www.npmjs.com/package/yaml) | ISC |

## Tests

* Test framework: [Mocha](https://www.npmjs.com/package/mocha)

* BDD / assertion library: [Chai](https://www.npmjs.com/package/chai)

* HTTP assertions: [SuperTest](https://www.npmjs.com/package/supertest)

## Built with Walder

- [KNoWS](https://knows.idlab.ugent.be)

Did you build something Walder and

want to add it to the list?

Please create a pull request! 

## License

This code is copyrighted ©2019–2020 by [Ghent University – imec](http://idlab.ugent.be/)

and released under the [MIT license](http://opensource.org/licenses/MIT).
ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/KNowledgeOnWebScale/walder

Awesome Lists containing this project

README