BLOG

Writing about agent systems

Deep dives on building reliable tool calling: just-in-time auth, sandboxes, observability, and product decisions that make agents feel trustworthy.

Engineering•Feb 12, 2026•11 min read

Featured

Agent tools that scale: a practical checklist

A field guide to shipping dependable tool calling in production, from scoping and observability to rollback paths and user trust.

Read post →

Security•Feb 02, 2026

10 min read

Just-in-time auth: reducing friction without losing control

Why deferred authorization can make agent experiences feel smoother while still preserving clear approvals, short-lived access, and policy control.

Read post →

Product•Jan 22, 2026

11 min read

Designing tool interfaces your agent can’t misuse

Tool schemas shape behavior more than prompt wording does. Here is how to design interfaces that are narrow, legible, and resilient under real usage.

Read post →

Engineering•Jan 10, 2026

10 min read

Observability for agents: what to log (and what not to)

A practical logging model for tracing agent behavior end to end without turning your observability stack into a liability.

Read post →

Security•Dec 18, 2025

11 min read

Sandboxed execution: why it matters for “run code” tools

If your agent can execute code, isolation is not optional. Here is what a practical sandbox needs before it is ready for real users.

Read post →

Product•Dec 05, 2025

10 min read

Prompting vs product: where reliability really comes from

Prompts matter, but durable reliability comes from contracts, workflow design, and product choices users can actually feel.

Read post →

Security•Nov 22, 2025

10 min read

Building user trust in agent actions

Trust in agent products comes from previews, receipts, and reversible workflows—not from asking users to accept mysterious automation on faith.

Read post →

Engineering•Nov 05, 2025

11 min read

How to evaluate an agent: metrics that actually predict success

A practical framework for measuring agent quality with metrics that correlate to user outcomes, workflow reliability, and operational cost.

Read post →

Engineering•Oct 18, 2025

10 min read

From prototype to production: hardening your first agent workflow

A practical path from a promising demo to a dependable workflow, with better validation, timeouts, state handling, and user-visible recovery.

Read post →

Platform•Sep 29, 2025

10 min read

Connecting 1,000+ apps: the hidden costs

At integration scale, the hard part is not OAuth. It is lifecycle management, schema drift, support load, and the long tail of provider behavior.

Read post →

Security•Sep 12, 2025

10 min read

Scoping permissions: least privilege for agent actions

A pragmatic guide to least privilege when an agent acts on a user’s behalf across tools, workflows, and shared business systems.

Read post →

Platform•Aug 25, 2025

10 min read

Building a tool registry: taxonomy, ownership, and change control

A tool registry is not just a catalog. It is the governance layer that keeps a growing tool surface understandable, owned, and safe to evolve.

Read post →