Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/nodestream-proj/nodestream

A Declarative framework for Building, Maintaining, and Analyzing Graph Data
https://github.com/nodestream-proj/nodestream

api athena aws cli data-engineering data-lake data-science declarative etl framework graph graphql kafka knowledge-graph neo4j python s3 security visualization yaml

Last synced: 2 days ago
JSON representation

A Declarative framework for Building, Maintaining, and Analyzing Graph Data

Awesome Lists containing this project

README

        

# Nodestream


Nodestream Logo

> A Declarative framework for Building, Maintaining, and Analyzing Graph Data 🚀

[![Continuous Integration](https://github.com/nodestream-proj/nodestream/actions/workflows/ci.yaml/badge.svg)](https://github.com/nodestream-proj/nodestream/actions/workflows/ci.yaml)
[![codecov](https://codecov.io/gh/nodestream-proj/nodestream/branch/main/graph/badge.svg?token=HAPEVKQ6OQ)](https://codecov.io/gh/nodestream-proj/nodestream)
[![ApacheV2 License](https://img.shields.io/badge/License-Apache%202.0-yellow.svg)](https://opensource.org/license/apache-2-0/)

[![All Contributors](https://img.shields.io/badge/all_contributors-12-orange.svg?style=flat-square)](#contributors-)

**Nodestream allows you to work with graphs declaratively.** With nodestream, you unlock a bounty of features purpose built for working with graphs.
Semantically model your graph and map labels and properties directly to your data.
Better yet, you are not locked into your choices.
Nodestream works with you as you evolve your application by providing migration utilities to change your data schema.
Nodestream even decouples you from the underyling database technology so you can even change databases.

#### Highlights

- Connect to data sources like [Kafka](https://nodestream-proj.github.io/docs/docs/reference/extractors/#streamextractor), [files](https://nodestream-proj.github.io/docs/docs/reference/extractors/#unifiedfileextractor), [apis](https://nodestream-proj.github.io/docs/docs/reference/extractors/#simpleapiextractor), and [more](https://nodestream-proj.github.io/docs/docs/reference/extractors/)!
- Evolve your application over time with database migrations ([Docs](https://nodestream-proj.github.io/docs/docs/tutorials-intermediate/working-with-migrations/))
- Use your favorite Graph Database to fit any tech stack ([Docs](https://nodestream-proj.github.io/docs/docs/category/database-support/))
- Clean up your own data with TTLs ([Docs](https://nodestream-proj.github.io/docs/docs/tutorials-intermediate/removing-data/#implementing-a-ttl-pipeline))
- Infinite Customizability Since Nearly Everything is Pluggable!

> [Website](https://nodestream-proj.github.io/docs/) • [Blog](https://nodestream-proj.github.io/docs/blog/) • [Discussions](https://github.com/orgs/nodestream-proj/discussions) • [Contributing](#contributing) • [Contributing Developer Guides](https://nodestream-proj.github.io/docs/docs/category/developer-reference/) • [Talks from Maintainers](https://www.youtube.com/watch?v=2F-xx4LcTng&list=PLUiAbWRQecSOorv_V6TzfUBoIZyf-6r6R&pp=gAQBiAQB)

## Features

Nodestream has a pleasant CLI interface to get new projects up and running fast.

![Demo](https://raw.githubusercontent.com/nodestream-proj/nodestream/e94d0faa024c0f8da1e83a4ff6d83746504d197e/docs/img/demo.gif)

Not a fan of the defaults? You can change out databases very easily

![Using Another Database](https://nodestream-proj.github.io/docs/assets/images/neptune-2c1c78b173e824fc1e824f54287e467f.gif)

Then you can start to model your data and nodestream will evolve your database for you. No more messing with constraints or writing database queries.

![Running Migrations](https://nodestream-proj.github.io/docs/assets/images/migrations-1ede1ab3d5438cdca24d66cfa6d66231.gif)

## Getting Started

Conviced? Install nodestream with `pip` to get started.

```bash
pip install nodestream
nodestream new --database neo4j my_project && cd my_project
nodestream run sample -v
```

We highly recommend following our tutorials [here](https://nodestream-proj.github.io/docs/docs/category/tutorial---basics/)

## Packages

Nodestream is built on a Highly Pluggable and Modular Architecture. Thus... we have a lot of packages to keep track of.

| Package | Description | Version |
|------------------------------ |------------------------------------------------------------------------------- |-------------------------------------------------------------------------------------------------------------------------------- |
| `nodestream` | The core library. Declarative ingestion. | ![PyPI Version](https://badge.fury.io/py/nodestream.svg) |
| `nodestream-plugin-neo4j` | Neo4j database connector. | ![PyPI Version](https://badge.fury.io/py/nodestream-plugin-neo4j.svg) |
| `nodestream-plugin-neptune` | AWS Neptune database connector. | ![PyPI Version](https://badge.fury.io/py/nodestream-plugin-neptune.svg) |
| `nodestream-plugin-dotenv` | Adds DotEnv integration. | ![PyPI Version](https://badge.fury.io/py/nodestream-plugin-dotenv.svg) |
| `nodestream-plugin-pedantic` | A series of lints to enforce reasonable naming standards, etc. | ![PyPI Version](https://badge.fury.io/py/nodestream-plugin-pedantic.svg) |
| `nodestream-plugin-shell` | An integration with nodestream to run shell commands. | ![PyPI Version](https://badge.fury.io/py/nodestream-plugin-shell.svg) |
| `nodestream-plugin-sbom` | Import SBOM files in CycloneDX and SPDX into an opinionated graph data model. | ![PyPI Version](https://badge.fury.io/py/nodestream-plugin-sbom.svg) |
| `nodestream-plugin-akamai` | Parse Akamai properties, redirect configs, and much more and ingests them. | ![PyPI Version](https://badge.fury.io/py/nodestream-plugin-akamai.svg) |
| `nodestream-plugin-k8s` | In incubation. A plugin that orchestrates Nodestream on k8s. | ![PyPI Version](https://badge.fury.io/py/nodestream-plugin-k8s.svg) |

## Contributors

Nodestream is a community project. We welcome all contributions.
Be sure to checkout or [Contributing Docs](https://nodestream-proj.github.io/docs/docs/category/developer-reference/) and our [Code of Conduct](./CODE_OF_CONDUCT.md) before contributing.



Zach Probst
Zach Probst

💻 👀 🚧
Chad Cloes
Chad Cloes

💻 👀 🚧
asantos4
asantos4

💻 👀 🚧
Grant Hoffman
Grant Hoffman

💻 👀
khneal
khneal

💻
orozen
orozen

💻
Sophia Don Tranho
Sophia Don Tranho

💻


bechbd
bechbd

💻
yasonk
yasonk

💻 👀
Stuart Macleod
Stuart Macleod

💻
Cole Greer
Cole Greer

💻
Austin Lucas
Austin Lucas

💻






Add your contributions



### Contributing

Need a quick reference guide on how to contribute? Here you go!

#### Getting Setup

To get started you'll need to install poery.

```bash
curl -sSL https://install.python-poetry.org | python3 -
```

You then can install the project dependencies with the following command:

```bash
poetry install
```

No need to active a virtual environment. Poetry handles that for you with `poetry run` and `poetry shell`.

#### Running Tests

To run tests for the entire project, run the following command:

```bash
poetry run pytest
```