https://github.com/marcosschroh/faust-docker-compose-example

Faust dockerized application
https://github.com/marcosschroh/faust-docker-compose-example

data-stream-processing docker faust kafka schema-registry zookeeper

Last synced: 12 months ago
JSON representation

Faust dockerized application

Host: GitHub
URL: https://github.com/marcosschroh/faust-docker-compose-example
Owner: marcosschroh
License: mit
Created: 2019-03-16T17:28:49.000Z (almost 7 years ago)
Default Branch: master
Last Pushed: 2022-12-01T12:29:43.000Z (over 3 years ago)
Last Synced: 2025-03-18T05:43:59.222Z (12 months ago)
Topics: data-stream-processing, docker, faust, kafka, schema-registry, zookeeper
Language: Python
Homepage:
Size: 52.7 KB
Stars: 68
Watchers: 5
Forks: 12
Open Issues: 0
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- Funding: .github/FUNDING.yml
- License: LICENSE

Awesome Lists containing this project

README

          # Faust-Docker-Compose

[![Build Status](https://travis-ci.org/marcosschroh/faust-docker-compose-example.svg?branch=master)](https://travis-ci.org/marcosschroh/faust-docker-compose-example)

[![License](https://img.shields.io/github/license/marcosschroh/faust-docker-compose-example.svg?logo=MIT)](https://github.com/marcosschroh/faust-docker-compose-example/blob/master/LICENSE)

An example to show how to include a `faust` project as a service using `docker compose`, with [Kafka](https://kafka.apache.org/), [Zookeeper](https://zookeeper.apache.org/) and [Schema Registry](https://docs.confluent.io/current/schema-registry/docs/index.html)

Notice that everything runs using `docker-compose`, including the faust example application. For

local development is preferable to run the `kafka` cluster separate from the `faust app`.

If you want to generate a `faust` project from scratch, please use the [cookiecutter-faust](https://github.com/marcosschroh/cookiecutter-faust)

Read more about Faust here: https://github.com/robinhood/faust

## Project

The project skeleton is defined as a medium/large project according to [faust layout](https://faust.readthedocs.io/en/latest/userguide/application.html#projects-and-directory-layout)

The `setup.py` has the entrypoint to resolve the [entrypoint problem](https://faust.readthedocs.io/en/latest/userguide/application.html#problem-entrypoint)

## Applications

* *Page Views*: This application corresponds to [Tutorial: Count page views](https://faust.readthedocs.io/en/latest/playbooks/pageviews.html)

* *Leader Election*: This application corresponds to [Tutorial: Leader Election](https://faust.readthedocs.io/en/latest/playbooks/leaderelection.html)

* *Users*: This is a custom application to demostrate how to integrate `Faust` with `Avro Schema`.

## Faust Project Dockerfile

The `Dockerfile` is based on  `python:3.7-slim`. The most important here is that the [`entrypoint`]() will wait for `kafka` too be ready and after that execute the script [`run.sh`]()

## Docker compose

`docker-compose.yaml` includes `zookeepeer`, `kafka` and `schema-registry` based on `confluent-inc`.

For more information you can go to [confluentinc](https://docs.confluent.io/current/installation/docker/docs/index.html) and see the docker compose example [here](https://github.com/confluentinc/cp-docker-images/blob/master/examples/cp-all-in-one/docker-compose.yml#L23-L48)

Useful ENVIRONMENT variables that you may change:

|Variable| description  | example |

|--------|--------------|---------|

| WORKER | Entrypoint in setup.py | `example`|

| WORKER_PORT | Worker port | `6066` |

| KAFKA_BOOSTRAP_SERVER | Kafka servers | `kafka://kafka:9092` |

| KAFKA_BOOSTRAP_SERVER_NAME | Kafka server name| `kafka` |

| KAFKA_BOOSTRAP_SERVER_PORT | Kafka server port | `9092` |

| SCHEMA_REGISTRY_SERVER | Schema registry server name | `schema-registry` |

| SCHEMA_REGISTRY_SERVER_PORT | Schema registry server port | `8081` |

| SCHEMA_REGISTRY_URL | Schema Registry Server url | `http://schema-registry:8081` |

## Commands

* Start application: `make run-dev`. This command start both the *Page Views* and *Leader Election* applications

* Stop and remove containers: `make clean`

* List topics: `make list-topics`

* Send events to page_view topic/agent: `make send-page-view-event payload='{"id": "foo", "user": "bar"}'`

## Avro Schemas, Custom Codecs and Serializers

Because we want to be sure that the message that we encode are valid we use [Avro Schemas](https://docs.oracle.com/database/nosql-12.1.3.1/GettingStartedGuide/avroschemas.html).

Avro is used to define the data schema for a record's value. This schema describes the fields allowed in the value, along with their data types.

For our demostration in the `Users` application we are using the following schema:

```json

{

    "type": "record",

    "namespace": "com.example",

    "name": "AvroUsers",

    "fields": [

        {"name": "first_name", "type": "string"},

        {"name": "last_name", "type": "string"}

    ]

}

```

In order to use `avro schemas` with `Faust` we need to define a custom codec, a custom serializer and be able to talk with the `schema-registry`.

You can find the custom codec called `avro_users` registered using the [codec registation](https://faust.readthedocs.io/en/latest/userguide/models.html#codec-registry) approach described by faust.

The [AvroSerializer](https://github.com/marcosschroh/faust-docker-compose-example/blob/fix/replace-helpers-with-schemaregistry-library/faust-project/example/codecs/serializers.py#L8) is in charge to `encode` and `decode` messages using the [schema registry client](https://github.com/marcosschroh/python-schema-registry-client).

Now the final step is to integrate the faust model with the `AvroSerializer`.

```python

# users.models

class UserModel(faust.Record, serializer='avro_users'):

    first_name: str

    last_name: str

```

Now our application is able to send and receive message using arvo schemas!!!! :-)

## Tests

Run tests with `tox`. Make sure that you have installed it.

```bash

tox

```

## Achievements

* [x] Application examples

* [x] Integration with Schma Registry

* [x] Schema Registry Client

* [x] Custom codecs

* [x] Custom Serializers

* [x] Avro Schemas

* [x] Make Schema Registry Client and Serializers a python package

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/marcosschroh/faust-docker-compose-example

Awesome Lists containing this project

README