Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/Linaro/step
Secure data pipeline proof of concept module for Zephyr RTOS
https://github.com/Linaro/step
Last synced: about 2 months ago
JSON representation
Secure data pipeline proof of concept module for Zephyr RTOS
- Host: GitHub
- URL: https://github.com/Linaro/step
- Owner: Linaro
- License: apache-2.0
- Created: 2021-06-12T10:17:34.000Z (over 3 years ago)
- Default Branch: main
- Last Pushed: 2023-09-19T13:50:59.000Z (over 1 year ago)
- Last Synced: 2024-11-02T16:05:36.747Z (2 months ago)
- Language: C
- Homepage:
- Size: 7.41 MB
- Stars: 11
- Watchers: 5
- Forks: 4
- Open Issues: 2
-
Metadata Files:
- Readme: README.rst
- License: LICENSE
Awesome Lists containing this project
- awesome-zephyr-rtos - linaro_sensor_pipeline - Secure data pipelines. (Libraries / Misc)
README
.. _secure_telemetry_pipeline:
Secure Telemetry Pipeline (STeP) for Zephyr
###########################################Overview
********STeP is an experimental asychronous data processing pipeline that enables
telemetry data to be represented in a generic manner (`measurements`), and
processed via one or more **processor nodes** in pipelines called
**node chains**.Processor nodes can be thought of as mini telemetry applications that can be
chained together to represent complex data processing workflows.Processor nodes and node-chains are added to a runtime-updateable
**node registry**. Incoming measurements will be evaluated againt registered
and active processor nodes to determine if there is a **filter match** between
the measurement and the first record in the node chain. If a match occurs, the
measurement will be processed by the matching node or node chain.**NOTE**: STeP is NOT intended to replace Zephyr's sensor API or sensor
drivers, but to act as an endpoint for that raw telemetry data!Presentation
************STeP was presented as a proof of concept at Linaro Connect Fall 2021
in LVC21F-303. This slidedeck provides more detailed information on STeP as of
the date that presentation was given.Click the image below to see a PDF of the slidedeck:
.. image:: doc/LVC21F-303.png
:target: doc/LVC21F-303%20Secure%20Sensor%20Data%20Pipeline.pdfStatus
******This project is a proof of concept and a work in progress. It is intended to
test the concept of a flexible telemetry pipeline that can handle common
security requirements such as sanitising, hashing, signing, encrypting and
encoding telemetry data, reducing individual operations to reusable
building blocks or mini applications.Eventually, some of these nodes, or the entire system, should be allocated to
a trusted execution environment (TF-M, etc.), and linked to an inferrence
engine for demonstration purposes. As a first step, however, implementing this
as a Zephyr module allows for a flexible CI workflow, easy emulation in QEMU,
and rapid development and testing.Adding STeP to your project via ``west``
****************************************
The recommended way is to use ``west`` to initialize this repository directly and
all its dependencies:.. code-block:: console
$ west init -m [email protected]:Linaro/step.git
$ west updateAlternatively you can add a local copy of this module by adding the following sections
to ``zephyr/west.yml``:1. In the ``manifest/remotes`` section add:
.. code-block::
remotes:
- name: linaro
url-base: https://github.com/Linaro2. In the ``manifest/projects`` section add:
.. code-block::
- name: step
remote: linaro
path: modules/lib/step
revision: main3. Save the file, and run ``west update`` from the project root to retrieve the
latest version of the library from Github, or whatever ``revision`` was
specified above.Building and Running
********************This application can be built and executed on QEMU as follows:
.. code-block:: console
$ west build -p auto -b qemu_cortex_m3 modules/lib/step/samples/shell/ -t run
To build for another board, change ``qemu_cortex_m3`` above to that board's
name, and remove the ``-t run`` appendix.Sample Output
=============.. code-block:: console
*** Booting Zephyr OS build zephyr-v2.6.0-536-g89212a7fbf5f ***
Type 'step help' for command options.
1.) Populate the processor registry: step add
2.) Publish measurement(s): step pub
3.) Check results: step stats
uart:~$ step add
[00:01:42.089,072] proc_mgr: step_pm_register: Registering node/chain (handle 0, pri 4)
[00:01:42.089,125] step_shell: step_shell_cmd_add: Took 53750 ns
uart:~$ step pub
[00:05:38.661,550] step_shell: step_shell_cmd_pub: Published 1 measurement
[00:05:38.661,580] step_shell: step_shell_cmd_pub: Took 60916 ns, excluding polling thread processing time (run 'step list').
[00:05:38.771,241] step_shell: [7496940] Received die temp: 32.00 C (handle 0:0)
[00:05:38.771,266] step_shell: cfg: mult by 10.00 (handle 0:1)
[00:05:38.771,289] step_shell: [7496940] Received die temp: 320.00 C (handle 0:1)
[00:05:38.771,316] step_shell: [7496940] Received die temp: 320.00 C (handle 0:2)
uart:~$ step stats
init: 3
evaluate: 0
matched: 1
start: 1
run: 3
stop: 1
error: 0Exit QEMU by pressing :kbd:`CTRL+A` :kbd:`x`.
Running Unit Tests
==================To run the unit tests for this module, you can run ``twister`` via:
.. code-block:: console
$ cd $ZEPHYR_BASE
$ twister --inline-logs \
-p qemu_cortex_m3 \
-T ../modules/lib/step/testsBasic Architecture
******************The Secure Telemetry Pipeline (STeP) aims to implement an extensible workflow
to process generic sensor data (**measurements**) in a content-agnostic manner.In theory, any type of measurement, using any standard SI unit and scale, and
represented in any standard C type should be expressable in a relatively
light-weight manner, keeping in mind the memory constraints of small embedded
systems.Processing of measurements happens based on one or more **processor nodes**,
which can be chained together for more complex operations.Processor nodes have an optional filter mechanism to indicate which types of
measurements they process, allowing for processing workflows to be defined on
a per-measurement-type basis.The **Secure** in STeP comes from the goal to provide basic secure processor
nodes out of the box, implementing common operations like: hash, sign,
compress, encrypt, etc.A high-level overview of the system is shown here:
.. raw:: html
Measurement Values
==================Measurements are the main component in STeP, and traverse the system starting
as inputs from a data source, are processed, and output to an appropriate
data endpoint.STeP attempts to compromise between optimising for memory in small embedded
systems, and trying to describe exactly what this measurement represents in as
expressive a manner as possible. It aims to balance the ability to precisely
represent the exact meaning of the measurement, without wasting precious memory
on that representation.Measurements make use of the following header, with a 12-byte overhead:
::
3 2 1
1 0 9 8 7 6 5 4 3 2 1 0 9 8 7 6 5 4 3 2 1 0 9 8 7 6 5 4 3 2 1 0
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Flags | Ext. M Type | Base M Type | <- Filter
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| C Type | Scale Factor | SI Unit Type | <- Unit
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Source ID | S Cnt | V | F | Payload Length | <- SrcLen
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Timestamp (optional) |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| |
| Payload |
| |
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
1
5 4 3 2 1 0 9 8 7 6 5 4 3 2 1 0
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| Res | TSt | CMP | Encod | DF | <- Flags
+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
| | | | |
| | | | +-------- Data Format (CBOR, etc.)
| | | +--------------- Encoding (BASE64, BASE45, etc.)
| | +---------------------- Compression (LZ4, etc.)
| +---------------------------- Timestamp
+--------------------------------- Reserved (version flag?)For futher technical details, see ``ìnclude/step/measurement.h``, but a
high-level summary of these three key words is shown below:Filter
------The **Filter** word allows processor nodes to determine if this measurement
interests them or not.It consists of an 8-bit **Base Measurement Type**, and an optional 8-bit
**Extended Measurement Type**, which can be used to specialise the meaning of
the base type.EXAMPLE: ``STEP_MES_TYPE_LIGHT`` is a base type, which uses a default
SI unit of ``STEP_MES_UNIT_SI_LUX``. If we wish to represent a different
measurement in the same measurement family (base type), we could indicate
``STEP_MES_EXT_TYPE_LIGHT_RADIO_RADIANCE`` as the extended type, which
represents a radiometric measurement based on W/(sr m^2).The **Flags** field indicates other important data about this measurement
packet, such as how the data has been formatted, encoded, what compression
algorithm has been used (if any), and if a timestamp is present.Unit
----The **Unit** word describes the SI unit and optional scale factor this
measurement uses, as well as how that unit is represented in memory. A 32-bit
floating-point value may use less memory in most cases, but we may require the
additional range and precision a 64-bit float provides. The ``unit`` word
allows for a flexible expression of this information on a per-measurement basis,
without an excessive amount of overhead.Standard SI units, scale factors and C types are all represented via enums in
STeP in the ``include/step/measurement`` folder.SrcLen
------The **Source/Len** word describes the size of the payload, with an option to
spread larger payloads over multiple packets.The vector size field can be set to indicate that individual samples are
vectors (versus scalars), consisting of 2, 3 or 4 components. This covers
the most common vector representations: XY coordinates (2), XYZ vectors (3),
quaternions (4), etc.It also indicates the number of samples present in this measurement payload,
in steps of power of two (2, 4, 8, 16, 32, etc., samples). This allows for
better use of system resources by hashing, signing and encrypting larger sets
of data, with only one 12-byte header as additional memory overhead. The 4-bits
reserved to indicate that multiple samples are present allows for between 2 and
16384 samples to be stored in the payload (2^n), or an arbitrary value:::
0 = 1 sample (default) 8 = 256 samples
1 = 2 samples 9 = 512 samples
2 = 4 samples 10 = 1024 samples
3 = 8 sammples 11 = 2048 samples
4 = 16 samples 12 = 4096 samples
5 = 32 samples 13 = 8192 samples
6 = 64 samples 14 = 16384 samples
7 = 128 samples 15 = Arbitrary (see below)If the sample count is set to 15 (0xF), the number of samples should be
indicated via an unsigned 32-bit integer in little-endian format at the start
of the payload, but AFTER the optional timestamp (if present).This word also contains an 8-bit **Source ID** field, which allows the
measurement value's source to be identified to retrieve further information
about the source device out of band, such as it's min/max values, sample rate,
gain setting, etc.Measurement Memory Management
=============================In order to minimize endless memcpy operations, and deal with variable length
measurements, all ``step_measurement`` records are allocated from a central
heap memory block managed by the **sample pool manager**.Allocating and freeing memory imposes a certain amount of rigor on behalf of
the developper, and heap memory fragmentation may be an issue over time, but
at present this seems like the best tradoff for an initial proof of concept.The allocation, population, consumption and release of the measurement packet
is describe in the sequence diagram below:.. raw:: html
Filter Engine
=============The **processor manager** makes uses of the ``.filter`` word in measurements to
optionally determine if registered filter nodes should or shouldn't process
the incoming measurement value(s).If the processor node's filter chain is set to ``NULL`` (default), it will
accept all incoming measurements. If one or more filters are indicated for the
processor node, the filter engine will evaluate the measurement's filter fields
against the processor node's filter value(s), to determine if there is a match.This evaluation process introduces some overhead, which can be addressed by
enabling **filter caching**, which works as follows:.. raw:: html