https://github.com/lydakis/sandfs
Composable virtual filesystem sandbox with a UNIX-like command surface for agents.
https://github.com/lydakis/sandfs
Last synced: 6 months ago
JSON representation
Composable virtual filesystem sandbox with a UNIX-like command surface for agents.
- Host: GitHub
- URL: https://github.com/lydakis/sandfs
- Owner: lydakis
- License: mit
- Created: 2025-11-16T16:20:05.000Z (8 months ago)
- Default Branch: main
- Last Pushed: 2025-11-24T05:41:06.000Z (7 months ago)
- Last Synced: 2025-12-28T08:59:59.548Z (6 months ago)
- Language: Python
- Homepage:
- Size: 252 KB
- Stars: 3
- Watchers: 0
- Forks: 0
- Open Issues: 6
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# sandfs
`sandfs` is an experimental Python package that implements an entirely virtual filesystem (VFS) that can be embedded inside AI agent tooling. Each sandbox keeps its own private directory tree, supports dynamic nodes that are hydrated on demand, and exposes a small UNIX-like command surface (`cd`, `ls`, `cat`, `grep`, `rg`, etc.) so agents can reuse familiar workflows.
> **Status**: design prototype. APIs are subject to change.
## Why?
Agent builders frequently need a scratch filesystem for planning, iterating on generated code, and testing hypotheses. Shipping a simple `exec` tool that proxies every bash command to the host disk is unsafe. `sandfs` keeps those operations inside a controlled, in-memory sandbox while still feeling like a mini Linux environment.
## Features
- In-memory directories and files that never touch the host disk unless exported.
- Dynamic nodes backed by callables (e.g., query a DB, fetch from an API, or generate text on the fly).
- Pure-Python shell/executor that understands a handful of GNU-style commands and can be extended with your own.
- Optional Python execution helper that evaluates snippets against the sandbox state only.
- Bridge to real GNU utilities via `host -p ` which materializes the subtree and runs the host binary against it.
- Node policies (read-only, append-only, visibility labels) plus shell views so different agents see only the nodes they are allowed to.
- Write hooks and optimistic versions so hosts can flush files to external stores with conflict detection.
- Storage adapters and snapshots so you can mount external state and roll forward/back within a turn.
- Serialization helpers to snapshot or hydrate sandboxes (planned).
See [`docs/roadmap.md`](docs/roadmap.md) for the up-to-date view of supported commands and the upcoming backlog.
## Installation
```bash
pip install git+https://github.com/lydakis/sandfs.git
# Once a PyPI release is available: pip install sandfs
```
## Quickstart
```python
from sandfs import VirtualFileSystem, SandboxShell
vfs = VirtualFileSystem()
vfs.write_file("/notes/todo.txt", "- build VFS\n- add rg support\n")
shell = SandboxShell(vfs)
print(shell.exec("ls /notes").stdout)
print(shell.exec("rg 'VFS' /notes").stdout)
```
### Agent shell usage
```python
from sandfs import ProvidedNode, SandboxShell, VirtualFileSystem
vfs = VirtualFileSystem()
vfs.write_file("/workspace/app.py", "print('hello')\n")
def logs_provider(ctx):
return {"latest.log": ProvidedNode.file(content="generated on demand")}
vfs.mount_directory("/workspace/logs", logs_provider)
shell = SandboxShell(vfs)
shell.exec("cd /workspace")
print(shell.exec("ls").stdout)
print(shell.exec("tree").stdout)
```
### Running host GNU tools
`SandboxShell` exposes `host` to materialize a subtree onto the host disk and invoke any installed command:
```
host -p /workspace grep -n TODO app.py
```
The example above exports `/workspace` into a temporary directory, runs the system `grep` inside it, then discards the files.
> Tip: unknown commands automatically fall back to the host runner, so `echo`, `python3`, or `bash -lc '…'` work even if they are not native sandfs commands.
### Policies & views
```python
vfs.set_policy(
"/blue/identity/persona.md",
NodePolicy(writable=False, classification="private", principals={"alice", "bob"}),
)
shell = SandboxShell(vfs, view=VisibilityView(classifications={"public"}, principals={"alice"}))
```
### Agent shell mode
```python
shell = SandboxShell(
vfs,
allowed_commands={"ls", "cat", "rg"},
max_output_bytes=8_192,
)
```
Commands outside `allowed_commands` now return an error immediately, and outputs larger than `max_output_bytes` are rejected so partner-facing agents cannot dump oversized payloads.
### Storage adapters
```python
from sandfs import MemoryStorageAdapter, VirtualFileSystem
adapter = MemoryStorageAdapter(initial={"a.txt": "seed"})
vfs = VirtualFileSystem()
vfs.mount_storage("/data", adapter)
print(vfs.read_file("/data/a.txt"))
vfs.write_file("/data/a.txt", "update")
vfs.sync_storage("/data") # refresh from adapter if it changed externally
```
`mount_storage` keeps the virtual tree synchronized with the adapter, while `sync_storage` refreshes the VFS from the latest adapter contents.
### Snapshots
```python
snap = vfs.snapshot()
# run risky operations ...
vfs.restore(snap)
```
Snapshots capture the entire tree (including storage-backed nodes) so you can checkpoint before an agent action and roll back on failure.
## Repository layout
```
.
├── docs/ # design/vision notes
├── sandfs/ # library sources
└── tests/ # runtime smoke tests
```
## Local development
```
python -m venv .venv
source .venv/bin/activate
uv pip install -e .[dev]
pre-commit install
uv run pytest
uv run ruff check .
uv run mypy sandfs
```
### Persistence hooks
Register a write hook to flush files into your own store and use optimistic versions to avoid clobbering concurrent updates:
```python
from sandfs import VirtualFileSystem
from sandfs.hooks import WriteEvent
vfs = VirtualFileSystem()
def flush(event: WriteEvent) -> None:
save_to_db(event.path, event.content, event.version)
vfs.register_write_hook("/blue/work", flush)
vfs.write_file("/blue/work/note.md", "draft")
vfs.write_file("/blue/work/note.md", "final", expected_version=1)
```
### Integration hooks
```python
from sandfs.integrations import InboxRecorder
recorder = InboxRecorder()
recorder.attach(vfs, "/blue/inbox")
```
`PathEvent` (used under the hood) carries the path, event type (`create`, `update`, `delete`), and latest content so you can fan writes into staging queues or audit logs.
## License
MIT
## Further reading
- [Using sandfs with Blue](docs/blue.md)
- [Publishing sandfs](docs/publishing.md)