Monitor every run. Host agents without infrastructure. Get alerted when things break. Ship AI agents with the confidence of a production system.
Works with any language or runtime — Python, Node.js, Go, or plain curl
How it works
Host a new agent entirely in AgentOS, or drop the SDK into your existing code. Both give you full observability.
No agent yet
Define your system prompt, tools, and model in the dashboard. AgentOS handles the LLM calls, tool execution, and the full run loop. You invoke it with a single HTTP call and get back the output.
// Your entire agent is one HTTP call fetch(`…/agents/${AGENT_ID}/invoke`, { method: "POST", headers: { "Authorization": `Bearer ${API_KEY}` }, body: JSON.stringify({ input }), });
Already have an agent
Your agent code stays exactly where it is — your server, Lambda, Colab, anywhere. Add 3 SDK lines to stream every run, event, and output to AgentOS. We record it, you watch it.
// Your existing agent — 3 lines added const run = await aos.startRun({ input }); /* ... your existing agent logic ... */ await run.complete({ output }); // streamed to AgentOS
Your code → AgentOS → full observability
Features
Every event, tool call, and LLM response captured with millisecond precision. Live run timeline. Never debug blind again.
Define system prompts, tools, and models. Run agents directly from AgentOS — no infra, no cold starts, no ops.
Gate any tool call behind a human approval. The agent pauses, notifies your team, and waits — no run gets through without sign-off.
Set conditions on error rate, duration, silence, or custom events. Get notified via in-app, email, or webhook before your users notice.
Daily and weekly AI-generated summaries of agent performance. Understand patterns across thousands of runs at a glance.
Connect Claude Desktop, Cursor, or any MCP client to your workspace. Manage agents, runs, and reports from inside your IDE.
Instrument any agent in two lines. startRun(), emit(), endRun() — full observability from your existing code, any runtime.
POST events from any language or runtime via the ingest API. Receive webhook notifications when runs complete, fail, or need attention.
Install pre-built agents in one click. Share your own templates with the community. Skip the blank page.
Install in seconds. Works with TypeScript, Python, or any HTTP client.
Pricing
No credit card required to get started.
For solo developers and side projects.
For companies running agents together.
or $200/seat/year billed annually — 2 months free
Start with TeamFor companies running agents at scale.
FAQ
No. If you have an existing agent, add our SDK (npm or pip) and wrap your code with startRun/complete. Takes about 10 minutes. Your agent keeps running wherever it is — AgentOS just observes it.
Non-hosted: your code runs the agent, AgentOS observes it. Hosted: you define the prompt and tools inside AgentOS and we run it for you. Both give you the same dashboard, alerts, and reports.
Yes. Run data is scoped to your workspace and only accessible by workspace members. We do not use your run data to train models.
For hosted agents, AgentOS supports Anthropic and OpenAI models. For non-hosted agents, your code picks the model — AgentOS doesn't care, it just observes the run.
Runs that exceed the monthly limit are dropped. You'll get a warning notification before you hit the cap. Upgrading to Team removes the limit.
Free forever for solo developers. No credit card, no lock-in.