Briefing
Getting an AI agent to run on your laptop is the easy part. The hard part starts the moment you want it to run somewhere other people depend on. That is when reliability, security, observability and plain old maintenance stop being abstract and start being the thing that wakes you at 3am.
There is a quiet shift happening in how teams ship agents. A year ago the question was "can we build one". In June 2026 the question is "where does it live, who can call it, and what happens when it falls over". The tooling has caught up enough that you now have real choices, from an app that never leaves your machine to a multi-region setup with failover. Each choice trades cost against control against effort, and picking the wrong one is expensive in a way that is hard to undo later.
This piece walks through six deployment patterns that hold up in practice, what each is good for, and where each one bites. Then it covers the checklist and the CI/CD wiring that separate a demo from something you can put your name on.
Pattern 1: Local-First (OpenHuman Model)
The agent runs entirely on the developer's machine. No server, no cloud, no deployment to speak of. OpenHuman from TinyHumans AI is the clearest example: an open-source desktop agent packaged as a native Tauri app, where everything happens locally by default.
Best for: personal productivity, privacy-sensitive work, individual developers. Setup: install the Tauri app (macOS DMG or Windows EXE) and configure your integrations. Pros: maximum privacy, no network latency, full control. Cons: no team sharing, it lives and dies with the machine, and there is no high availability.
Pattern 2: VPS Self-Hosted (Hermes Model)
The agent runs on a virtual private server, so you can reach it from anywhere. Hermes Agent installs with pip and pairs with Honcho for persistent memory across sessions.
# VPS setup (Hetzner, DigitalOcean, etc.)
# 2 vCPU, 4GB RAM, ~$5/month
sudo apt update && sudo apt install -y python3.11 python3-pip docker
pip install hermes-agent
hermes init --with-honcho
hermes start --daemonOne caveat on the price. That ~$5/month figure holds for Hetzner-class providers (a Hetzner CX23 with 2 vCPU and 4GB RAM was about $4.59/month as of June 2026, per Hetzner Cloud pricing). The same 2 vCPU / 4GB spec on DigitalOcean runs closer to $24/month, so do not assume the cheap number applies everywhere. The exact --with-honcho flag is consistent with how the tools fit together but is not confirmed verbatim in the docs, so treat the command above as a working sketch rather than gospel. The Hermes plus Honcho integration docs are the source to check.
Best for: small teams, cost-sensitive organisations, privacy-conscious deployments. Pros: cheap, controllable, works with any VPS provider. Cons: you manage it yourself, it is a single point of failure, and updates are manual.
Pattern 3: Managed Service (OpenClaw Model)
DigitalOcean's managed OpenClaw service handles deployment, updates and scaling for you.
# DigitalOcean managed OpenClaw
doctl apps create --spec openclaw.yaml
# $24/month, includes automatic updates and monitoringThe price is in the right neighbourhood but worth pinning down: the minimum for stable single-agent operation on DigitalOcean's App Platform is the apps-s-1vcpu-2gb tier at $25/month, while a 1-Click Droplet starts at $12/month. The doctl apps create --spec pattern is the standard App Platform deploy command.
Best for: teams that want managed infrastructure without vendor lock-in. Pros: no maintenance, automatic updates, monitoring built in. Cons: higher cost, less room to customise, and you depend on the provider.
Pattern 4: Cloud-Native (Google Agents CLI Model)
Deploy agents to managed cloud infrastructure that scales on its own. Google's Agents CLI, announced in April 2026, deploys to managed environments such as Cloud Run with automatic scaling, IAM integration and observability.
# Google's Agents CLI
gcloud agents init billing-agent --template=python
gcloud agents deploy billing-agent --region=us-central1
# Pay per invocation, automatic scalingA correction on that snippet. The capability is real, but the command syntax above is not. Google's tool is the standalone agents-cli, used like agents-cli deploy --project ... --region us-east1, rather than a gcloud agents subcommand. If you are wiring this up for real, follow the official CLI, not the lines shown here.
Best for: enterprise teams already on Google Cloud, variable workloads, event-driven agents. Pros: automatic scaling, IAM integration, managed security. Cons: vendor lock-in, per-invocation costs that can surprise you, and you are tied to GCP.
Pattern 5: Multi-Region Orchestrated
When you need high availability, you run agents in more than one region behind a load balancer.
# docker-compose.yml for multi-region
services:
hermes-primary:
image: hermes:latest
environment:
- HERMES_REGION=us-east
- HERMES_ROLE=primary
hermes-secondary:
image: hermes:latest
environment:
- HERMES_REGION=eu-west
- HERMES_ROLE=secondaryBest for: mission-critical agents, compliance requirements, global teams. Pros: high availability, disaster recovery, geographic distribution. Cons: complex setup, data consistency headaches, and reportedly somewhere in the order of 3-5x the cost of a single region (that multiplier is a rule of thumb rather than a published figure, so plan against your own numbers).
Pattern 6: Hybrid: Local Agent + Cloud Gateway
The agent runs locally for sensitive work but reaches out to cloud services for model inference and integrations. OpenRouter is a common choice for the inference gateway in this setup.
[Local Agent] <-- encrypted --> [Cloud Gateway] <-- --> [OpenRouter]Best for: privacy-sensitive organisations that still need team features. Pros: local data stays local, the cloud handles scale, you get some of each. Cons: a more complicated architecture, latency on the cloud calls, and a security model split across two places.
Deployment Checklist
Before you put any agent into production:
- Sandboxing configured (container or VM-based)
- Approval gates enabled for high-risk operations
- Secrets management (no API keys sitting in environment variables)
- Logging and monitoring configured
- Backup and recovery tested
- Rollback procedure documented
- Rate limiting enabled
- Health check endpoint configured
- TLS/SSL for all external communication
- Access controls (who is allowed to invoke the agent)
- Cost alerts and budgets set
- Documentation for whoever operates it
CI/CD for Agents
Agents deserve their own CI/CD pipelines, same as any other service you ship.
# .github/workflows/agent-ci.yml
name: Agent CI
on: [push]
jobs:
test:
runs-on: ubuntu-latest
steps:
- uses: actions/checkout@v4
- name: Test agent skills
run: hermes skills test --all
- name: Validate harness
run: omnigent validate
- name: Integration tests
run: pytest tests/integration/That omnigent validate step leans on Omnigent, the agent framework Databricks open-sourced in June 2026, which treats validation and test flows as first-class workflow artifacts. The exact omnigent validate and hermes skills test --all subcommands match how the tools work but were not confirmed word for word in the docs, so check before you copy.
Deploying an agent is not a single step. It is a loop: develop, test, deploy, monitor, update, repeat. The patterns above give you the infrastructure. The discipline you bring to that loop is what decides whether the thing stays up.




