{"id":15066756,"url":"https://github.com/ornl/flowcept","last_synced_at":"2026-06-11T03:03:17.196Z","repository":{"id":63046769,"uuid":"564107658","full_name":"ORNL/flowcept","owner":"ORNL","description":"Runtime data integration system that empowers any data processing system to capture and query workflow provenance using data observability.","archived":false,"fork":false,"pushed_at":"2024-12-09T21:57:24.000Z","size":54346,"stargazers_count":2,"open_issues_count":16,"forks_count":4,"subscribers_count":1,"default_branch":"main","last_synced_at":"2024-12-09T22:34:17.679Z","etag":null,"topics":["big-data","dask","data-integration","lineage","machine-learning","mlflow","model-management","parallel-processing","provenance","reproducibility","responsible-ai","scientific-workflows","tensorboard","trustworthy-ai","workflows"],"latest_commit_sha":null,"homepage":"","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":"renan-souza/flowcept","license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/ORNL.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":"CONTRIBUTING.md","funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null}},"created_at":"2022-11-10T02:11:57.000Z","updated_at":"2024-11-28T01:53:52.000Z","dependencies_parsed_at":"2023-09-24T15:15:29.110Z","dependency_job_id":null,"html_url":"https://github.com/ORNL/flowcept","commit_stats":{"total_commits":488,"total_committers":5,"mean_commits":97.6,"dds":0.3073770491803278,"last_synced_commit":"66676f6543201fa0d88e28325f9bd551ba1b8533"},"previous_names":[],"tags_count":73,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ORNL%2Fflowcept","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ORNL%2Fflowcept/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ORNL%2Fflowcept/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ORNL%2Fflowcept/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/ORNL","download_url":"https://codeload.github.com/ORNL/flowcept/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":229805132,"owners_count":18126808,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["big-data","dask","data-integration","lineage","machine-learning","mlflow","model-management","parallel-processing","provenance","reproducibility","responsible-ai","scientific-workflows","tensorboard","trustworthy-ai","workflows"],"created_at":"2024-09-25T01:11:36.535Z","updated_at":"2026-05-22T20:01:08.450Z","avatar_url":"https://github.com/ORNL.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"\u003cp align=\"center\"\u003e\n  \u003cpicture\u003e\n    \u003c!-- Dark theme --\u003e\n    \u003csource srcset=\"./docs/img/flowcept-logo-dark.png\" media=\"(prefers-color-scheme: dark)\" /\u003e\n    \u003c!-- Light theme --\u003e\n    \u003csource srcset=\"./docs/img/flowcept-logo.png\" media=\"(prefers-color-scheme: light)\" /\u003e\n    \u003c!-- Fallback --\u003e\n    \u003cimg src=\"./docs/img/flowcept-logo.png\" alt=\"Flowcept Logo\" width=\"200\"/\u003e\n  \u003c/picture\u003e\n\u003c/p\u003e\n\n\u003ch3 align=\"center\"\u003eLightweight Distributed Workflow Provenance\u003c/h3\u003e\n\n\n---\n\nFlowcept captures and queries workflow provenance at runtime with minimal code changes and low overhead. It unifies data from diverse tools and workflows across the Edge–Cloud–HPC continuum and provides ML-aware capture, MCP agents provenance, telemetry, extensible adapters, and flexible storage.\n\n---\n\n\n[![Documentation](https://img.shields.io/badge/docs-readthedocs.io-green.svg)](https://flowcept.readthedocs.io/)\n[![Slack](https://img.shields.io/badge/Slack-%23flowcept%40Workflows%20Community-4A154B?logo=slack)](https://workflowscommunity.slack.com/archives/C06L5GYJKQS)\n[![Build](https://github.com/ORNL/flowcept/actions/workflows/create-release-n-publish.yml/badge.svg)](https://github.com/ORNL/flowcept/actions/workflows/create-release-n-publish.yml)\n[![PyPI](https://badge.fury.io/py/flowcept.svg)](https://pypi.org/project/flowcept)\n[![Tests](https://github.com/ORNL/flowcept/actions/workflows/run-tests.yml/badge.svg)](https://github.com/ORNL/flowcept/actions/workflows/run-tests.yml)\n[![Code Formatting](https://github.com/ORNL/flowcept/actions/workflows/checks.yml/badge.svg?branch=dev)](https://github.com/ORNL/flowcept/actions/workflows/checks.yml)\n[![License: MIT](https://img.shields.io/github/license/ORNL/flowcept)](LICENSE)\n\n\n\n\n\u003ch4 align=\"center\"\u003e\n  \u003ca href=\"https://flowcept.org\"\u003eWebsite\u003c/a\u003e \u0026#8226;\n  \u003ca href=\"https://flowcept.readthedocs.io/\"\u003eDocumentation\u003c/a\u003e \u0026#8226; \n  \u003ca href=\"./docs/publications\"\u003ePublications\u003c/a\u003e\n\u003c/h4\u003e\n\n\n---\n\n# Quickstart\n\nThe easiest way to capture provenance from plain Python functions, with no external services needed:\n\n1) Install and initialize settings\n\n```shell\n# Make sure you activate your Python environment (e.g., conda, venv) first\npip install flowcept\nflowcept --init-settings\n```\nThis generates a minimal settings file in `~/.flowcept/settings.yaml`.\n\n2) Run the minimal example\n\nSave the following script as `quickstart.py` and run `python quickstart.py.`\n\n```python\n\"\"\"\nA minimal example of Flowcept's instrumentation using @decorators.\nThis example needs no DB, broker, or external service.\n\"\"\"\nfrom flowcept import Flowcept, flowcept_task\nfrom flowcept.instrumentation.flowcept_decorator import flowcept\n\n\n@flowcept_task(output_names=\"o1\")\ndef sum_one(i1):\n    return i1 + 1\n\n\n@flowcept_task(output_names=\"o2\")\ndef mult_two(o1):\n    return o1 * 2\n\n\n@flowcept\ndef main():\n    n = 3\n    o1 = sum_one(n)\n    o2 = mult_two(o1)\n    print(\"Final output\", o2)\n\n\nif __name__ == \"__main__\":\n    main()\n\n    prov_messages = Flowcept.read_buffer_file()\n    assert len(prov_messages) == 2\n    print(f\"Raw provenance captured: {len(prov_messages)} records in flowcept_messages.jsonl\")\n    Flowcept.generate_report(records=prov_messages, print_markdown=True)\n```\n\nThis prints out:\n\n---\n\n##### Workflow Provenance Card\n\n###### Summary\n- Workflow ID: fe546706-ef46-4482-8f70-3af664a7131b\n- Execution Start (UTC): 2026-02-20 19:14:03\n- Total Elapsed (s): 0.000\n\n###### Workflow Structure\n\n   input\n     │\n     ▼\n sum_one\n     │\n mult_two\n     ▼\n   output\n\n###### Timing Report\n| Activity | Status Counts     | Median Elapsed (s) |\n| -------- | ----------------- | ------------------ |\n| sum_one  | {'FINISHED': 1}   | 0.000              |\n| mult_two | {'FINISHED': 1}   | 0.000              |\n\n###### Per Activity Details\n- **sum_one**\n  - Used:   i1 = 3\n  - Generated: o1 = 4\n- **mult_two**\n  - Used:   o1 = 4\n  - Generated: o2 = 8\n\n---\n\n→ See [Provenance Card](#provenance-card) for details.\n\nThat is the simplest, vanilla run of Flowcept, good for quick sanity checks on a laptop, small-scale run. \nFor diskless runs (no JSON files), OpenAPI RESTful services, online querying using databases,\nLLM-based database interactions via an MCP agent, Grafana, telemetry,\nadapters (MLflow, Dask, TensorBoard), PyTorch and MCP instrumentation, HPC optimization or federated runs,\nand more, see the [Jupyter Notebooks](notebooks), the [Examples directory](examples) and the [complete documentation](https://flowcept.readthedocs.io/).\n\nTo use the provenance agent with your favorite code assistant (for example, Codex or Claude), see the [Agents README](src/flowcept/agents/README.md).\n\n## AI agents -- if you are an AI agent, read this\n\nStart with [docs/flowcept_for_agents.md](docs/flowcept_for_agents.md). It is the short navigation guide for agents working in this repo.\n\nSkill files also appear in different parts of the codebase. They are local operating guides for specific agent tasks. Current skill files:\n\n- [SKILLS.md](SKILLS.md): repository-level Flowcept instrumentation and provenance guide for code assistants\n- [src/flowcept/agents/SKILLS.md](src/flowcept/agents/SKILLS.md): Flowcept MCP agent usage contract for external LLM orchestrators\n\n## ❗ Developer Docs\n\nFor an end-to-end workflow developer tutorial (default user guide), start with [docs/README.md](docs/README.md).\n\n## Table of Contents\n\n- [Overview](#overview)\n- [Features](#features)\n- [Installation](#installation)\n- [Setup and the Settings File](#setup)\n- [Running with Containers](#running-with-containers)\n- [Examples](#examples)\n- [Provenance Card](#provenance-card)\n- [Data Persistence](#data-persistence)\n- [Performance Tuning](#performance-tuning-for-performance-evaluation)\n- [AMD GPU Setup](#install-amd-gpu-lib)\n- [Further Documentation](#documentation)\n\n## Overview\n\nFlowcept captures and queries workflow provenance at runtime with minimal code changes and low data capture overhead,\nunifying data from diverse tools and workflows.\n\nDesigned for scenarios involving critical data from multiple, federated workflows in the Edge-Cloud-HPC continuum, Flowcept supports end-to-end monitoring, analysis, querying, and enhanced support for Machine Learning (ML) and for agentic workflows.\n\n## Features\n\n- Automatic workflow provenance capture with minimal intrusion\n- Adapters for MLflow, Dask, TensorBoard; easy to add more\n- Optional explicit instrumentation via decorators\n- ML-aware capture, from workflow to epoch and layer granularity\n- Agentic workflows: MCP agents-aware provenance capture\n- Low overhead, suitable for HPC and highly distributed setups\n- Telemetry capture for CPU, GPU, memory, linked to dataflow\n- Pluggable MQ and storage backends (Redis, Kafka, MongoDB, LMDB)\n- [W3C PROV](https://www.w3.org/TR/prov-overview/) adherence \n\nExplore [Jupyter Notebooks](notebooks) and [Examples](examples) for usage.\n\n## Installation\n\nFlowcept can be installed in multiple ways, depending on your needs.\n\n### 1. Default Installation\nTo install Flowcept with its basic dependencies from [PyPI](https://pypi.org/project/flowcept/), run:\n\n```shell\npip install flowcept\n```\n\nThis installs the minimal Flowcept package, **not** including MongoDB, Redis, MCP, or any adapter-specific dependencies.\n\n### 2. Installing Specific Adapters and Additional Dependencies\n\nFlowcept integrates with several tools and services, but you should **only install what you actually need**.  \nGood practice is to cherry-pick the extras relevant to your workflow instead of installing them all.\n\n```shell\npip install flowcept[mongo]         # MongoDB support\npip install flowcept[mlflow]        # MLflow adapter\npip install flowcept[dask]          # Dask adapter\npip install flowcept[tensorboard]   # TensorBoard adapter\npip install flowcept[kafka]         # Kafka message queue\npip install flowcept[nvidia]        # NVIDIA GPU runtime capture\npip install flowcept[amd]           # AMD GPU runtime capture (see \"Install AMD GPU Lib\" for version/LD_LIBRARY_PATH notes)\npip install flowcept[telemetry]     # CPU/GPU/memory telemetry capture\npip install flowcept[lmdb]          # LMDB lightweight database\npip install flowcept[mqtt]          # MQTT support\npip install flowcept[llm_agent]     # MCP agent, LangChain, Streamlit integration: needed either for MCP capture or for the Flowcept Agent.\npip install flowcept[llm_google]    # Google GenAI + Flowcept agent support\npip install flowcept[analytics]     # Extra analytics (seaborn, plotly, scipy)\npip install flowcept[dev]           # Developer dependencies (docs, tests, lint, etc.)\n```\n\n### 3. Installing with Common Runtime Bundle\n\n```shell\npip install flowcept[extras]\n```\n\nThe `extras` group is a convenience shortcut that bundles the most common runtime dependencies.  \nIt is intended for users who want a fairly complete, but not maximal, Flowcept environment.\n\nYou might choose `flowcept[extras]` if:\n\n- You want Flowcept to run out-of-the-box with Redis, telemetry, and MongoDB.  \n- You prefer not to install each extra one by one\n\n⚠️ If you only need one of these features, install it individually instead of `extras`.\n\n### 4. Install All Optional Dependencies at Once\n\nFlowcept provides a combined all extra, but installing everything into a single environment is not recommended for users.\nMany of these dependencies are unrelated and should not be mixed in the same runtime. This option is only intended for Flowcept developers who need to test across all adapters and integrations.\n\n```\npip install flowcept[all]\n```\n\n### 5. Installing from Source\nTo install Flowcept from the source repository:\n\n```\ngit clone https://github.com/ORNL/flowcept.git\ncd flowcept\npip install .\n```\n\nYou can then install specific dependencies similarly as above:\n\n```\npip install .[optional_dependency_name]\n```\n\nThis follows the same pattern as step 2, allowing for a customized installation from source.\n\n## Setup\n\nThe [Quickstart](#quickstart) example works with just `pip install flowcept`, no extra setup is required.\n\nFor online queries or distributed capture, Flowcept relies on two optional components:\n\n- **Message Queue (MQ)** — message broker / pub-sub / data stream  \n- **Database (DB)** — persistent storage for historical queries  \n\n---\n\n#### Message Queue (MQ)\n\n- Required for anything beyond Quickstart  \n- Flowcept publishes provenance data to the MQ during workflow runs  \n- Developers can subscribe with custom consumers (see [this example](examples/consumers/simple_consumer.py).  \n- You can monitor or print messages in motion using `flowcept --stream-messages --print`.  \n\nSupported MQs:\n- [Redis](https://redis.io) → **default**, lightweight, works on Linux, macOS, Windows, and HPC (tested on [Frontier](link) and [Summit](link))  \n- [Kafka](https://kafka.apache.org) → for distributed environments or if Kafka is already in your stack  \n- [Mofka](https://mofka.readthedocs.io) → optimized for HPC runs  \n\n---\n\n#### Database (DB)\n\n- **Optional**, but required for:\n  - Persisting provenance beyond MQ memory/disk buffers  \n  - Running complex analytical queries on historical data  \n\nSupported DBs:\n- [MongoDB](https://www.mongodb.com) → default, efficient bulk writes + rich query support  \n- [LMDB](https://lmdb.readthedocs.io) → lightweight, no external service, basic query capabilities  \n\n---\n\n### Notes\n\n- Without a DB:\n  - Provenance remains in the MQ only (persistence not guaranteed)  \n  - Complex historical queries are unavailable  \n- Flowcept’s architecture is modular: other MQs and DBs (graph, relational, etc.) can be added in the future  \n- Deployment examples for MQ and DB are provided in the [deployment](deployment) directory  \n \n\n### Downloading and Starting External Services (MQ or DB)\n\nFlowcept uses external services for message queues (MQ) and databases (DB). You can start them with Docker Compose, plain containers, or directly on your host.\n\n---\n\n#### Using Docker Compose (recommended)\n\nWe provide a [Makefile](deployment/Makefile) with shortcuts:\n\n1. **Redis only (no DB)**: `make services`   (LMDB can be used in this setup as a lightweight DB)\n2. **Redis + MongoDB**: `make services-mongo`\n3. **Kafka + MongoDB**: `make services-kafka`\n4. **Mofka only (no DB)**: `make services-mofka`\n\nTo customize, edit the YAML files in [deployment](deployment/) and run `docker compose -f deployment/\u003ccompose-file\u003e.yml up -d`\n\n---\n\n#### Using Docker (without Compose)\n\nSee the [deployment/](deployment/) compose files for expected images and configurations. You can adapt them to your environment and use standard `docker pull / run / exec` commands.\n\n---\n\n#### Running on the Host (no containers)\n\n1. Install binaries for the service you need:  \n   - **macOS** users can install with [Homebrew](https://brew.sh).  \n     Example for Redis:\n     ```bash\n     brew install redis\n     brew services start redis\n     ```\n\n   - On Linux, use your distro package manager (e.g. `apt`, `dnf`, `yum`) \n   - If non-root (typically the case if you want to deploy these services locally in an HPC system), search for the installed binaries for your OS/hardware architecture, download them in a directory that you have r+w permission, and run them.\n   - On Windows, utilize [WSL](https://learn.microsoft.com/en-us/windows/wsl/install) to use a Linux distro.\n\n2. Start services normally (`redis-server`, `mongod`, `kafka-server-start.sh`, etc.).\n\n## Flowcept Settings File\n\nFlowcept uses a settings file for configuration.\n\n- To create a minimal settings file, run: `flowcept --init-settings` → creates `~/.flowcept/settings.yaml`\n\n- To copy the full sample settings file, run: `flowcept --init-settings --full` → creates `~/.flowcept/settings.yaml`\n\n- To switch runtime mode, apply a profile after creating the file:\n\n```bash\nflowcept --init-settings --full -y\nflowcept --config-profile full-online -y\n```\n\nMeaning:\n\n- `--init-settings` = minimal file with default settings.\n- `--init-settings --full` = copy `resources/sample_settings.yaml`\n- `--config-profile ...` = overlay a runtime mode on top of the existing file\n\n---\n\n#### What You Can Configure\n\n- Message queue and database routes, ports, and paths  \n- MCP agent ports and LLM API keys  \n- Buffer sizes and flush settings  \n- Telemetry capture settings  \n- Instrumentation and PyTorch details  \n- Log levels  \n- Data observability adapters  \n- And more (see [example file](resources/sample_settings.yaml))  \n\n---\n\n#### Custom Settings File\n\nFlowcept looks for its settings in the following order:\n\n1. Environment variable `FLOWCEPT_SETTINGS_PATH` — if set, Flowcept will use this path\n2. `~/.flowcept/settings.yaml` — created by running `flowcept --init-settings`  \n3. [Default sample file](resources/sample_settings.yaml) — used if neither of the above is found\n\nImportant:\n\n- environment variables can override settings values\n- use profiles for mode switches such as `full-online`, `full-offline`, `mq-only`, `mq-only-no-flush`, `full-telemetry`\n- adapter flags are additive:\n\n```bash\nflowcept --init-settings --dask -y\nflowcept --init-settings --mlflow -y\nflowcept --init-settings --tensorboard -y\n```\n\nThey add `adapters.\u003cname\u003e` to the current settings file instead of replacing the whole file.\n\n# Examples\n\n### Adapters and Notebooks\n\n See the [Jupyter Notebooks](notebooks) and [Examples directory](examples) for utilization examples.\n\n## Provenance Cards\n\nThe [Quickstart](#quickstart) example (`python quickstart.py`) shows a provenance card.\n\nFlowcept introduces the Workflow Provenance Card concept: a structured markdown summary of a workflow execution covering:\n\n- **Summary** — workflow name, IDs, execution window, elapsed time, host, git info\n- **Workflow-level Summary** — activity count, status counts, top slowest activities\n- **Workflow Structure** — ASCII diagram of the activity DAG\n- **Timing Report** — per-activity start, end, and median elapsed times with insights\n- **Per Activity Details** — aggregated inputs (`used`) and outputs (`generated`) per activity\n- **Per-activity Resource Usage** — CPU, memory, disk I/O, network, and GPU deltas (when telemetry is captured)\n- **Object Artifacts Summary** — versioned artifacts produced or consumed by the workflow\n\nCards also support **campaign-level reporting** for multi-workflow runs (replicated experiments or multi-stage pipelines):\n\n```python\n# From a JSONL buffer file (no DB needed)\nFlowcept.generate_report(input_jsonl_path=\"flowcept_messages.jsonl\")\n\n# From a live DB query\nFlowcept.generate_report(workflow_id=\"\u003cid\u003e\")\nFlowcept.generate_report(campaign_id=\"\u003cid\u003e\")\n\n# As PDF\nFlowcept.generate_report(workflow_id=\"\u003cid\u003e\", report_type=\"provenance_report\", format=\"pdf\")\n```\n\nSee [`docs/reporting.rst`](docs/reporting.rst) and [`src/flowcept/report/README.md`](src/flowcept/report/README.md) for the full reporting reference.\n\n# Summary: Observability, Instrumentation, MQs, DBs, and Querying\n\n| Category                           | Supported Options                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                       |\n|------------------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|\n| **Data Observability Adapters**    | [MLflow](https://github.com/ORNL/flowcept/blob/main/examples/mlflow_example.py), [Dask](https://github.com/ORNL/flowcept/blob/main/examples/dask_example.py), [TensorBoard](https://github.com/ORNL/flowcept/blob/main/examples/tensorboard_example.py)                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                 |\n| **Instrumentation and Decorators** | - [@flowcept](https://github.com/ORNL/flowcept/blob/main/examples/start_here.py): encapsulate a function (e.g., a main function) as a workflow. \u003cbr\u003e - [@flowcept_task](https://github.com/ORNL/flowcept/blob/main/examples/instrumented_simple_example.py): encapsulate a function as a task. \u003cbr\u003e - `@telemetry_flowcept_task`: same as `@flowcept_task`, but optimized for telemetry capture. \u003cbr\u003e - `@lightweight_flowcept_task`: same as `@flowcept_task`, but very lightweight, optimized for HPC workloads \u003cbr\u003e - [Loop](https://github.com/ORNL/flowcept/blob/main/examples/instrumented_loop_example.py) \u003cbr\u003e - [PyTorch Model](https://github.com/ORNL/flowcept/blob/main/examples/llm_complex/llm_model.py) \u003cbr\u003e - [MCP Agent](https://github.com/ORNL/flowcept/blob/main/examples/agents/aec_agent_mock.py) |\n| **Context Manager**                | `with Flowcept():` \u003cbr/\u003e \u0026nbsp;\u0026nbsp;\u0026nbsp;`# Workflow code` \u003cbr/\u003e\u003cbr/\u003eSimilar to the `@flowcept` decorator, but more flexible for instrumenting code blocks that aren’t encapsulated in a single function and for workflows with scattered code across multiple files.                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                 |\n| **Custom Task Creation**           | `FlowceptTask(activity_id=\u003cid\u003e, used=\u003cinputs\u003e, generated=\u003coutputs\u003e, ...)` \u003cbr/\u003e\u003cbr/\u003eUse for fully customizable task instrumentation. Publishes directly to the MQ either via context management (`with FlowceptTask(...)`) or by calling `send()`. It needs to have a `Flowcept().start()` first (or within a `with Flowcept()` context). See [example](examples/consumers/ping_pong_example.py).                                                                                                                                                                                                                                                                                                                                                                                                                       |\n| **Message Queues (MQ)**            | - **Disabled** (offline mode: provenance events stay in an in-memory buffer, not accessible to external processes) \u003cbr\u003e - [Redis](https://redis.io) → default, lightweight, easy to run anywhere \u003cbr\u003e - [Kafka](https://kafka.apache.org) → for distributed, production setups \u003cbr\u003e - [Mofka](https://mofka.readthedocs.io) → optimized for HPC runs \u003cbr\u003e\u003cbr\u003e _Setup example:_ [docker compose](https://github.com/ORNL/flowcept/blob/main/deployment/compose.yml)                                                                                                                                                                                                                                                                                                                                                      |\n| **Databases**                      | - **Disabled** → Flowcept runs in ephemeral mode (data only in MQ, no persistence) \u003cbr\u003e - **[MongoDB](https://www.mongodb.com)** → default, rich queries and efficient bulk writes \u003cbr\u003e - **[LMDB](https://lmdb.readthedocs.io)** → lightweight, file-based, no external service, basic query support                                                                                                                                                                                                                                                                                                                                                     |\n| **Querying and Monitoring**        | - **[Grafana](deployment/compose-grafana.yml)** → dashboarding via MongoDB connector \u003cbr\u003e - **MCP Flowcept Agent** → LLM-based querying of provenance data                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            | \n| **Custom Consumer**                | You can implement your own consumer to monitor or query the provenance stream in real time. Useful for custom analytics, monitoring, debugging, or to persist the data in a different data model (e.g., graph) . See [example](examples/consumers/simple_consumer.py).                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                  |\n\n\n## Performance Tuning for Performance Evaluation\n\nIn the settings.yaml file, many variables may impact interception efficiency. \nPlease be mindful of the following parameters:\n\n* `mq`\n    - `buffer_size` and `insertion_buffer_time_secs`. -- `buffer_size: 1` is really bad for performance, but it will give the most up-to-date info possible to the MQ.\n    \n* `log`\n    - set both stream and files to disable\n\n* `telemetry_capture` \n  The more things you enable, the more overhead you'll get. For GPU, you can turn on/off specific metrics.\n\n* `instrumentation`\n  This will configure whether every single granular step in the model training process will be captured. Disable very granular model inspection and try to use more lightweight methods. There are commented instructions in the settings.yaml sample file.\n\nOther thing to consider:\n\n```\nproject:\n  replace_non_json_serializable: false # Here it will assume that all captured data are JSON serializable\n  db_flush_mode: offline               # This disables the feature of runtime analysis in the database.\nmq:\n  chunk_size: -1                       # This disables chunking the messages to be sent to the MQ. Use this only if the main memory of the compute notes is large enough.\n```\n\nOther variables depending on the adapter may impact too. For instance, in Dask, timestamp creation by workers add interception overhead. As we evolve the software, other variables that impact overhead appear and we might not stated them in this README file yet. If you are doing extensive performance evaluation experiments using this software, please reach out to us (e.g., create an issue in the repository) for hints on how to reduce the overhead of our software.\n\n## Install AMD GPU Lib\n\nOnly needed for AMD GPU telemetry capture. NVIDIA users use `flowcept[nvidia]` instead.\n\n**Quick install:**\n```bash\npip install flowcept[amd]\n```\n\nThis installs the latest `amdsmi` from PyPI. The `amdsmi` Python package is a thin wrapper around the system's `libamd_smi.so`, so the PyPI version must match your ROCm installation. If you get a runtime error like `undefined symbol` or `libamd_smi.so not found`, follow the steps below.\n\n**Matching the version to your ROCm:**\n\n1. Find your ROCm version:\n   ```bash\n   ls /opt/rocm-*   # e.g. /opt/rocm-6.2.4\n   # or: rocm-smi --version\n   ```\n\n2. Find the matching `amdsmi` PyPI version — the major/minor version tracks ROCm (e.g. ROCm 6.2.x → `amdsmi==6.2.*`, ROCm 7.0.x → `amdsmi==7.0.*`):\n   ```bash\n   pip index versions amdsmi   # lists all available versions\n   pip install amdsmi==\u003cX.Y.Z\u003e\n   ```\n\n3. Set `LD_LIBRARY_PATH` so Python finds the correct shared library:\n   ```bash\n   export LD_LIBRARY_PATH=/opt/rocm-\u003cX.Y.Z\u003e/lib:$LD_LIBRARY_PATH\n   ```\n   Add this to your job script or shell profile so it persists.\n\n**Verify:**\n```bash\npython -c \"from amdsmi import amdsmi_init, amdsmi_get_processor_handles; amdsmi_init(); print(len(amdsmi_get_processor_handles()), 'GPU(s) found')\"\n```\n\n## Torch Dependencies\n\nSome unit tests utilize `torch==2.2.2`, `torchtext=0.17.2`, and `torchvision==0.17.2`. They are only really needed to run some tests and will be installed if you run `pip install flowcept[ml_dev]` or `pip install flowcept[all]`. If you want to use Flowcept with Torch, please adapt torch dependencies according to your project's dependencies.\n\n## Documentation\n\nFull documentation is available on [Read the Docs](https://flowcept.readthedocs.io/).\n\n## Cite us\n\nIf you used Flowcept in your research, consider citing our paper.\n\n```\nTowards Lightweight Data Integration using Multi-workflow Provenance and Data Observability\nR. Souza, T. Skluzacek, S. Wilkinson, M. Ziatdinov, and R. da Silva\n19th IEEE International Conference on e-Science, 2023.\n```\n\n**Bibtex:**\n\n```latex\n@inproceedings{souza2023towards,  \n  author = {Souza, Renan and Skluzacek, Tyler J and Wilkinson, Sean R and Ziatdinov, Maxim and da Silva, Rafael Ferreira},\n  booktitle = {IEEE International Conference on e-Science},\n  doi = {10.1109/e-Science58273.2023.10254822},\n  link = {https://doi.org/10.1109/e-Science58273.2023.10254822},\n  pdf = {https://arxiv.org/pdf/2308.09004.pdf},\n  title = {Towards Lightweight Data Integration using Multi-workflow Provenance and Data Observability},\n  year = {2023}\n}\n```\n\n## Disclaimer \u0026 Get in Touch\n\nRefer to [Contributing](CONTRIBUTING.md) for adding new adapters or contributing with the codebase.\n\nPlease note that this a research software. We encourage you to give it a try and use it with your own stack.\nWe are continuously working on improving documentation and adding more examples and notebooks, but we are continuously improving documentation and examples. If you are interested in working with Flowcept in your own scientific project, we can give you a jump start if you reach out to us. Feel free to [create an issue](https://github.com/ORNL/flowcept/issues/new), [create a new discussion thread](https://github.com/ORNL/flowcept/discussions/new/choose) or drop us an email (we trust you'll find a way to reach out to us :wink:).\n\n## Acknowledgement\n\nThis research uses resources of the Oak Ridge Leadership Computing Facility at the Oak Ridge National Laboratory, which is supported by the Office of Science of the U.S. Department of Energy under Contract No. DE-AC05-00OR22725.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fornl%2Fflowcept","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fornl%2Fflowcept","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fornl%2Fflowcept/lists"}