agent-runner

One Rust binary agent without any user source code.

A minimal, non-interactive AI agent runner in your server or container. Give it a folder with AGENTS.md and skills, it uses tools, MCP and skills and iterates until the task is done.

Quick Start

cd agent-runner
cargo build --release
cp .env.example .env   # edit with your API key

./target/release/agent-runner --agent-dir ./my-agent --prompt "Refactor the auth module"

Docker

cd agent-runner
docker build -t agent-runner .
docker run --env-file .env agent-runner \
  --agent-dir /agents/my-agent --prompt "Fix the tests"

Agent = agent-runner + Folder

Everything an agent needs lives in one folder:

my-agent/
├── AGENTS.md              # System prompt — who the agent is and how it behaves
├── agent-runner.json      # MCP config, timeouts, permissions
└── skills/                # Optional: extra skills
    └── search/
        ├── SKILL.md       # Skill instructions injected into the system prompt
        ├── references/    # Reference documents for the skill
        └── scripts/       # Executable scripts (exposed as agent tools)

That's it. No database, no server, no setup beyond the folder.

agent-runner.json

MCP servers, timeouts, permissions, and agent behavior. LLM settings come from environment variables or .env:

{
  "mcp_servers": {},
  "timeouts": {
    "tool_timeout_secs": 120,
    "run_limit_secs": 3600
  },
  "agent": {
    "max_iterations": 50,
    "plan_required": true,
    "execute_enabled": false
  },
  "permissions": [
    { "operations": ["read"], "paths": ["./*"], "mode": "allow" }
  ]
}

Timeout Settings

Timeouts can be set in agent-runner.json and overridden by CLI flags:

Setting	Config Key	CLI Flag	Default
Per-tool timeout	`timeouts.tool_timeout_secs`	`--tool-timeout`	120s
Whole run limit	`timeouts.run_limit_secs`	`--run-limit`	3600s

MCP Servers

{
  "mcp_servers": {
    "filesystem": {
      "command": "npx",
      "args": ["-y", "@modelcontextprotocol/server-filesystem", "/data"],
      "env": {}
    }
  }
}

Environment Variables

Set LLM provider, model, and API key via environment variables or a .env file:

LLM_PROVIDER=anthropic
LLM_MODEL=claude-sonnet-4-20250514
LLM_BASE_URL=https://api.anthropic.com
ANTHROPIC_API_KEY=sk-ant-...

Variable	Required	Description
`LLM_PROVIDER`	yes	`anthropic` or `openai`
`LLM_MODEL`	yes	Model name (e.g. `claude-sonnet-4-20250514`, `gpt-4o`)
`LLM_BASE_URL`	no	Override base URL for OpenAI-compatible APIs
`LLM_API_KEY`	yes	API key (or use provider-specific name below)
`ANTHROPIC_API_KEY`	if provider=anthropic	Anthropic API key
`OPENAI_API_KEY`	if provider=openai	OpenAI API key

API Keys

API keys can be provided via:

.env file — place in the working directory (loaded automatically)
Environment variables — export ANTHROPIC_API_KEY=sk-ant-...

Full Configuration Reference

{
  "mcp_servers": {},
  "timeouts": {
    "tool_timeout_secs": 120,
    "run_limit_secs": 3600
  },
  "agent": {
    "max_iterations": 50,
    "plan_required": true,
    "tool_output_token_limit": 20000,
    "user_message_token_limit": 50000,
    "execute_timeout_secs": 3600,
    "execute_enabled": false
  },
  "summarization": {
    "enabled": true,
    "trigger_tokens": 80000,
    "keep_tokens": 20000,
    "trim_tokens": 4000
  },
  "permissions": [
    {
      "operations": ["read"],
      "paths": ["./*"],
      "mode": "allow"
    }
  ],
  "subagents": []
}

CLI

agent-runner --agent-dir <DIR> --prompt <TEXT|FILE> [OPTIONS]

Option	Default	Description
`--agent-dir`	(required)	Path to agent folder
`--prompt`	(required)	Task prompt or path to a text file
`--plan-only`	`false`	Generate plan and exit without executing
`--max-iterations`	`50`	Maximum agent loop iterations
`--output-dir`	`./agent-output`	Output directory for reports and traces
`--working-dir`	`.`	Working directory for filesystem/execute tools
`--tool-timeout`	`120`	Timeout in seconds for each tool call
`--run-limit`	`3600`	Maximum total run time in seconds
`--verbose`	`false`	Print iteration details to stderr
`--sandbox`	`false`	Enable shell execution regardless of config

Exit Codes

Code	Meaning
`0`	Task completed
`1`	Task failed
`2`	Max iterations or run limit exceeded
`3`	Configuration error

Built-in Tools

The agent has these tools available by default:

Tool	Description
`ls`	List directory entries
`read_file`	Read file contents with line-based pagination
`write_file`	Write content to a file (creates parent dirs)
`edit_file`	Find-and-replace strings in a file
`glob`	Find files matching a glob pattern
`grep`	Search file contents with regex
`execute`	Run a shell command (when enabled)
`task_done`	Signal task completion
`write_todos`	Update internal todo list
`compact_conversation`	Trigger conversation compaction

Permissions

Control which tools can access which paths:

{
  "permissions": [
    { "operations": ["read"], "paths": ["./*"], "mode": "allow" },
    { "operations": ["write"], "paths": ["./src/*"], "mode": "allow" },
    { "operations": ["write"], "paths": ["./secrets/*"], "mode": "deny" }
  ]
}

Operations: "read" covers ls, read_file, glob, grep. "write" covers write_file, edit_file, execute. Paths support /* for prefix matching.

Output

After execution, the output directory contains:

File	Description
`run.json`	Detailed run log with per-iteration and per-tool TAT, errors, and exceptions
`plan.md`	Generated execution plan
`report.json`	Status, token usage, iterations, duration, todos
`transcript.json`	Full message history
`trace.jsonl`	Structured event log (one JSON object per line)

run.json

Every run produces a run.json with full debugging details:

{
  "status": "completed",
  "exit_code": 0,
  "started_at": "2026-05-26T12:00:00.000Z",
  "finished_at": "2026-05-26T12:01:23.456Z",
  "duration_ms": 83456,
  "iterations": [
    {
      "iteration": 1,
      "llm_tat_ms": 3200,
      "llm_input_tokens": 1200,
      "llm_output_tokens": 340,
      "tool_calls": [
        {
          "tool": "read_file",
          "arguments": {"file_path": "src/main.rs"},
          "tat_ms": 12,
          "is_error": false,
          "timed_out": false
        }
      ]
    }
  ],
  "errors": []
}

Benchmarks

Comparison with other agent runners (approximate, based on community reports):

Metric	agent-runner	Claude Code	OpenClaw	Hermes Agent	OpenCode
Binary size	~3 MB	~80 MB	~120 MB	~15 MB	~20 MB
Runtime deps	Zero	Node.js	Python + Node	Go	Go
Mode	Batch	Interactive	Both	Batch	Interactive
Avg cost/task	$0.12	$0.18	$0.22	$0.15	$0.14

How It Works

Loads the agent folder (AGENTS.md, agent-runner.json, skills)
Optionally generates a step-by-step execution plan
Runs an autonomous loop: LLM call → tool execution → repeat
Summarizes conversation history when context gets long
Exits when the agent calls task_done or hits max iterations / run limit
Writes run.json, report, transcript, and trace log

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
.github/workflows		.github/workflows
agent-runner		agent-runner
web-site		web-site
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
agent-runner-diagram-philosophy.md		agent-runner-diagram-philosophy.md
agent-runner-diagram.html		agent-runner-diagram.html
vercel.json		vercel.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

agent-runner

Quick Start

Docker

Agent = agent-runner + Folder

agent-runner.json

Timeout Settings

MCP Servers

Environment Variables

API Keys

Full Configuration Reference

CLI

Exit Codes

Built-in Tools

Permissions

Output

run.json

Benchmarks

How It Works

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

agent-runner

Quick Start

Docker

Agent = agent-runner + Folder

agent-runner.json

Timeout Settings

MCP Servers

Environment Variables

API Keys

Full Configuration Reference

CLI

Exit Codes

Built-in Tools

Permissions

Output

run.json

Benchmarks

How It Works

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages