A comprehensive CLI tool for benchmarking AI coding assistants across different scenarios and metrics.
- Two Benchmark Modes: LLM_Evaluator and PR_Recreate
- Multiple Metrics: Response time, code quality, AST similarity, and LLM-assessed metrics
- Parallel Execution: Run multiple agents simultaneously for faster benchmarks
- Visual Reports: Generate charts and comprehensive analysis
- Flexible Configuration: Support for various AI assistants and custom setups
git clone https://github.com/augment-solutions/augbench.git
cd augbench
npm install# For LLM_Evaluator mode (prompt-based evaluation)
cp settings.json.llm.example settings.json
# For PR_Recreate mode (real PR recreation)
cp settings.json.pr.example settings.json
# Set up environment variables
cp .env.example .env
# Edit .env with your API keysnode bin/augbench.js validatenode bin/augbench.js benchmarknode bin/augbench.js report --chartsvalidate– Check prerequisites and configurationbenchmark– Execute benchmarking based on settings.json modereport– Generate console reports and chartshelp– Show usage examples
- Installation & Usage - Complete setup and usage guide
- Benchmark Modes - LLM_Evaluator and PR_Recreate mode details
- Metrics System - Comprehensive metrics documentation
- AST Similarity Testing - Manual testing guide for AST similarity logic
- AI Assistant Configuration - How to configure different AI assistants
- Testing Guide - Testing strategy and guidelines
- Augment CLI -
auggiecommand - Claude Code - Anthropic's Claude CLI
- Cursor CLI - Cursor IDE command line interface
- Custom Assistants - Easy integration for any CLI-based AI tool
- Node.js ≥22.0.0
- Git ≥2.30.0 (for worktree support)
- Disk Space >10GB for repository staging
- AI Assistant CLIs installed and configured
Analyze GitHub PR activity before and after an automation date. Supports analyzing all branches or a specific base branch.
- Script: roi_scripts/github_pr_metrics.py
- Configure via constants at top of the script or environment variables
Usage examples:
# Analyze ALL branches (default when BRANCH is empty)
GITHUB_TOKEN=ghp_xxx REPO_NAME=owner/repo WEEKS_BACK=2 AUTOMATED_DATE="2025-01-15" BRANCH="" \
python3 roi_scripts/github_pr_metrics.py
# Analyze a specific branch only (e.g., main)
GITHUB_TOKEN=ghp_xxx REPO_NAME=owner/repo WEEKS_BACK=4 AUTOMATED_DATE="2025-01-15T00:00:00Z" BRANCH=main \
python3 roi_scripts/github_pr_metrics.pyTip: You can also run it via npm:
npm run pr-metrics