https://github.com/observicia/observicia

Cloud Native Observability and Policy Engine for LLM Applications
https://github.com/observicia/observicia

agentic-ai chatbot cloud-native jaeger kubernetes llm microservice observability open-policy-agent openai-api opentelemetry policy-engine python retrieval-augmented-generation watsonx-ai

Last synced: about 2 months ago
JSON representation

Cloud Native Observability and Policy Engine for LLM Applications

Host: GitHub
URL: https://github.com/observicia/observicia
Owner: Observicia
License: apache-2.0
Created: 2024-12-02T17:29:02.000Z (10 months ago)
Default Branch: main
Last Pushed: 2025-02-10T19:30:32.000Z (8 months ago)
Last Synced: 2025-07-30T20:40:28.137Z (2 months ago)
Topics: agentic-ai, chatbot, cloud-native, jaeger, kubernetes, llm, microservice, observability, open-policy-agent, openai-api, opentelemetry, policy-engine, python, retrieval-augmented-generation, watsonx-ai
Language: Python
Homepage:
Size: 1.28 MB
Stars: 7
Watchers: 1
Forks: 1
Open Issues: 19
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- License: LICENSE

Awesome Lists containing this project

README

          # Observicia SDK

Observicia is a Cloud Native observability and policy control SDK for LLM applications. It provides seamless integration with CNCF native observability stack while offering comprehensive token tracking, policy enforcement, and PII protection capabilities.

[![Documentation](https://img.shields.io/badge/docs-latest-brightgreen.svg?style=flat)](https://observicia.readthedocs.io/en/latest/)

[![License](https://img.shields.io/badge/license-Apache%202.0-blue.svg)](LICENSE)

[![OpenTelemetry](https://img.shields.io/badge/OpenTelemetry-enabled-blue)](https://opentelemetry.io/)

[![OPA](https://img.shields.io/badge/OPA-integrated-blue)](https://www.openpolicyagent.org/)

[![PyPI version](https://badge.fury.io/py/observicia.svg)](https://badge.fury.io/py/observicia)

[![PyPI Status](https://img.shields.io/pypi/status/observicia.svg)](https://pypi.python.org/pypi/observicia/)

[![OpenAI](https://img.shields.io/badge/OpenAI-Supported-74aa9c)](https://openai.com)

[![WatsonX](https://img.shields.io/badge/WatsonX-Supported-be95ff)](https://www.ibm.com/watsonx)

[![LangChain](https://img.shields.io/badge/LangChain-Supported-2dd4bf)](https://langchain.com)

[![Redis](https://img.shields.io/badge/Redis-Supported-dc382d)](https://redis.io)

[![SQLite](https://img.shields.io/badge/SQLite-Supported-003b57)](https://www.sqlite.org)

[![Grafana](https://img.shields.io/badge/Grafana-Dashboard-f46800)](https://grafana.com)

## Features

- **Token Tracking and Management**

  - Real-time token usage monitoring across providers

  - Stream-aware token counting

  - Token usage retention and cleanup

  - Per-session token tracking

  - Configurable data retention policies

- **LLM Backend Support**

  - OpenAI

    - Chat completions (sync/async)

    - Text completions (sync/async)

    - Embeddings

    - Image generation

    - File operations

    - Streaming support

  - Ollama

    - Local model deployment

    - Chat completions

    - Text generation

    - Embeddings

    - Streaming support

  - WatsonX

    - Foundation models integration

    - Text generation

    - Chat completions

    - Parameter controls

  - Basic scaffolding for:

    - Anthropic

    - LiteLLM

- **Transaction Tracking**

  - Multi-round conversation tracking

  - Transaction lifecycle management

  - Metadata and state tracking

  - Parent-child transaction relationships

  - Transaction performance metrics

- **Chat Logging and Analytics**

  - Structured chat history logging

  - Conversation flow analysis

  - Interaction metrics

  - Policy compliance logging

  - Chat completion tracking

- **Telemetry Storage and Export**

  - SQLite exporter for persistent telemetry storage

    - Structured schema for token usage and metrics

    - Transaction and trace correlation

    - Query-friendly format for analytics

  - Redis exporter with configurable retention

    - Time-based data retention policies

    - Real-time metrics access

    - Distributed telemetry storage

  - OpenTelemetry integration

    - Standard OTLP export support

    - Custom attribute mapping

    - Span context preservation

- **Policy Enforcement**

  - Integration with Open Policy Agent (OPA)

  - Support for multiple policy evaluation levels

  - Risk level assessment (low, medium, high, critical)

  - Custom policy definition support

  - Synchronous and asynchronous policy evaluation

- **Framework Integration**

  - LangChain support

    - Conversation chain monitoring

    - Chain metrics

    - Token usage across abstractions

- **Observability Features**

  - OpenTelemetry integration

  - Span-based tracing for all LLM operations

  - Configurable logging (console, file, OTLP)

  - Mermaid diagram generation from telemetry data

  - Detailed request/response tracing

  - Custom attribute tracking

## Quick Start

1. Install the SDK:

```bash

pip install observicia

```

2. Create a configuration file (`observicia_config.yaml`):

```yaml

service_name: my-service

otel_endpoint: http://localhost:4317

opa_endpoint: http://localhost:8181/

policies:

  - name: pii_check

    path: policies/pii

    description: Check for PII in responses

    required_trace_level: enhanced

    risk_level: high

logging:

  file: "app.json"

  telemetry:

    enabled: true

    format: "json"

    redis:

      enabled: true

      host: "localhost"

      port: 6379

      db: 0

      key_prefix: "observicia:telemetry:"

      retention_hours: 24

  messages:

    enabled: true

    level: "INFO"

  chat:

    enabled: true

    level: "both"

    file: "chat.log"

```

3. Initialize in your code:

```python

from observicia import init

from observicia.core.context_manager import ObservabilityContext

# Required - Initialize Observicia

init()

# Optional - Set user ID for tracking

ObservabilityContext.set_user_id("user123")

# Optional - Start a conversation transaction

transaction_id = ObservabilityContext.start_transaction(

    metadata={"conversation_type": "chat"}

)

# Use with OpenAI

from openai import OpenAI

client = OpenAI()

response = client.chat.completions.create(

    model="gpt-4",

    messages=[{"role": "user", "content": "Hello!"}]

)

# Or use with Ollama

import ollama

response = ollama.chat(

    model="llama2",

    messages=[{"role": "user", "content": "Hello!"}]

)

# Optional - End the transaction

ObservabilityContext.end_transaction(

    transaction_id,

    metadata={"resolution": "completed"}

)

```

## Architecture

```mermaid

flowchart TB

    App[Application] --> SDK[Observicia SDK]

    subgraph LLM Backends

        OpenAI[OpenAI API]

        Ollama[Ollama Local]

        Anthropic[Anthropic API]

        LiteLLM[LiteLLM]

        WatsonX[WatsonX]

    end

    SDK --> OpenAI

    SDK --> Ollama

    SDK --> Anthropic

    SDK --> LiteLLM

    SDK --> WatsonX

    SDK --> OPA[Open Policy Agent]

    SDK --> OTEL[OpenTelemetry Collector]

    SDK --> SQLite[(SQLite)]

    SDK --> Redis[(Redis)]

    OTEL --> Jaeger[Jaeger]

    OTEL --> Prom[Prometheus]

    OPA --> PII[PII Detection Service]

    OPA --> Compliance[Prompt Compliance Service]

    subgraph Telemetry Storage

        SQLite

        Redis

    end

    style OpenAI fill:#85e,color:#fff

    style Ollama fill:#85e,color:#fff

    style WatsonX fill:#85e,color:#fff

    style Anthropic fill:#ccc,color:#666

    style LiteLLM fill:#ccc,color:#666

```

## Example Applications

The SDK includes three example applications demonstrating different use cases:

## Example Applications

The SDK includes the following example applications demonstrating different use cases:

1. **Simple Chat Application** ([examples/simple-chat](examples/simple-chat))

   - Basic chat interface using OpenAI

   - Demonstrates token tracking and tracing

   - Shows streaming response handling

   - Includes transaction management

2. **RAG Application** ([examples/rag-app](examples/rag-app))

   - Retrieval-Augmented Generation example

   - Shows policy enforcement for PII protection

   - Demonstrates context tracking

   - Includes secure document retrieval

3. **LangChain Chat** ([examples/langchain-chat](examples/langchain-chat))

   - Integration with LangChain framework

   - Shows conversation chain tracking

   - Token tracking across abstractions

4. **WatsonX Generation** ([examples/watsonx-generate](examples/watsonx-generate))

   - Integration with IBM WatsonX.ai Foundation Models

   - Demonstrates model inference with parameters

   - Shows token tracking for WatsonX models

   - Includes chat and generation examples

   - Policy enforcement for enterprise use cases

5. **Ollama Generation** ([examples/ollama-generate](examples/ollama-generate))

   - Integration with local Ollama models

   - Shows local model deployment monitoring

   - Demonstrates both chat and generation modes

   - Includes embedding tracking

   - Token usage tracking for local models

   - Support for multiple model formats

## Deployment

### Prerequisites

- Kubernetes cluster with:

  - OpenTelemetry Collector

  - Open Policy Agent

  - Jaeger (optional)

  - Prometheus (optional)

### Example Kubernetes Deployment

See the [deploy/k8s](deploy/k8s) directory for complete deployment manifests.

## Core Components

- **Context Manager**: Manages trace context, transactions and session tracking

- **Policy Engine**: Handles policy evaluation and enforcement

- **Token Tracker**: Monitors token usage across providers

- **Patch Manager**: Manages LLM provider SDK instrumentation

- **Tracing Manager**: Handles OpenTelemetry integration

## Token Usage Visualization

The SDK includes sample tools to visualize token usage metrics through Grafana dashboards.

[![Token Usage Dashboard](https://img.youtube.com/vi/IkYBNWHVIXQ/maxresdefault.jpg)](https://www.youtube.com/watch?v=IkYBNWHVIXQ)

## Development Status

- ✅ Core Framework

- ✅ OpenAI Integration

- ✅ Basic Policy Engine

- ✅ Token Tracking

- ✅ OpenTelemetry Integration

- ✅ Transaction Management

- ✅ Chat Logging

- ✅ LangChain Support

- 🚧 Additional Provider Support

- 🚧 Advanced Policy Features

- 🚧 UI Components

## License

This project is licensed under the Apache License 2.0 - see the [LICENSE](LICENSE) file for details.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/observicia/observicia

Awesome Lists containing this project

README