Step 3 broke. You'll find out tomorrow.
A 4-agent pipeline fails at step 3. You need the exact handoff that broke — not a 24-hour grep across four services.
One import. The entire platform — stdlib, isolated runtime, full observability, and security.
One import drops in the full platform — stdlib primitives, isolated process execution, span capture, and security scanning. No new infrastructure. Works with your current stack.
Typed memory and tool primitives. Bounded process isolation. Full span trees with cost per call. Eval scores against your goldens. PII and prompt-injection scanning — all wired together.
Set SLOs on eval scores and latency. Runtime policy enforces tool allowlists. Cryptographic trace seals for audit. Replay any failure exactly. Ship knowing the whole stack is solid.
from agentos import trace, memory, tools # One import. Your existing code, unchanged.@trace(project="support-agent", env="production")async def run_agent(query: str) -> str: ctx = await memory.recall(query) # stdlib primitive response = await llm.complete(system_prompt, query, ctx) return response # AgentOS wires the full platform on every run:# ✓ Stdlib — memory, tools, retries as typed primitives# ✓ Isolated process — bounded memory, mediated syscalls# ✓ Full span tree — every tool call, every LLM hop, cost# ✓ Security — PII scan, prompt injection, policy enforcement# ✓ Eval score vs. golden dataset — graded on every deploy
Every build artifact, runtime event, trace, eval score, and security event lands in one PostgreSQL table — with the same identity columns. Cross-cutting questions become one SQL query.