Run the Demo
Run HELM's governed company demo — 15 receipts, 7 phases, full EvidencePack in 2 minutes.
Run the Demo: technical reference for this page.
Run the Demo
See HELM in action with a realistic multi-phase scenario: an AI agent manages a company, hits policy boundaries, and produces a verifiable proof trail.
What you'll see
The demo runs 7 phases that exercise different enforcement properties:
| Phase | What happens | Receipt type |
|---|---|---|
| 1. Approval | Agent requests budget approval → timelock ceremony | ALLOW |
| 2. Tool execution | Agent calls approved tools | ALLOW |
| 3. Policy deny | Agent tries a forbidden action | DENY |
| 4. Sandbox | WASI-sandboxed compute with gas limits | ALLOW |
| 5. Skill gap | Agent encounters an unregistered tool | DENY |
| 6. Incident | Agent triggers incident response flow | ALLOW |
| 7. Export | EvidencePack exported and verified | — |
Result: 15 receipts forming a complete ProofGraph, plus a shareable HTML proof report.
Run it
# 1. Onboard (if you haven't already)
helm onboard --yes
# 2. Run the company demo with mock provider (no API key needed)
helm demo company --template starter --provider mock
# 3. Export the EvidencePack
helm export --evidence ./data/evidence --out evidence.tar
# 4. Verify offline
helm verify --bundle evidence.tar
Expected output:
verification: PASS
Explore the results
# List all receipts from the demo
helm pack list
# View incident history
helm incident list
# Get a daily brief
helm brief daily
Open the Proof Report
After running the demo, open the generated HTML report:
open data/evidence/run-report.html
The report shows:
- Causal chain visualization (which receipts link to which)
- Receipt details (tool name, args hash, verdict, reason code)
- Verification status per receipt
- One-click sharing
Run the full use-case suite
For a comprehensive test of all 12 enforcement properties:
make crucible
This runs UC-001 through UC-012:
| UC | Tests |
|---|---|
| UC-001 | PEP allows valid tool call |
| UC-002 | PEP denies schema mismatch |
| UC-003 | Approval ceremony (timelock + challenge) |
| UC-004 | WASI sandbox executes transform |
| UC-005 | WASI traps on gas/time/memory exhaustion |
| UC-006 | Idempotency (receipt-based dedup) |
| UC-007 | EvidencePack export |
| UC-008 | Replay verify |
| UC-009 | Output schema drift → hard error |
| UC-010 | Trust key rotation + replay |
| UC-011 | Island mode (build offline) |
| UC-012 | Conformance L1 + L2 gates |
Next steps
- Quickstart — the minimal 5-minute path
- MCP Integration — add HELM to Claude, Cursor, VS Code
- Conformance Levels — what L1/L2/L3 guarantee