AugBench

A comprehensive CLI tool for benchmarking AI coding assistants across different scenarios and metrics.

Features

Two Benchmark Modes: LLM_Evaluator and PR_Recreate
Multiple Metrics: Response time, code quality, AST similarity, and LLM-assessed metrics
Parallel Execution: Run multiple agents simultaneously for faster benchmarks
Visual Reports: Generate charts and comprehensive analysis
Flexible Configuration: Support for various AI assistants and custom setups

Quick Start

1. Installation

git clone https://github.com/augment-solutions/augbench.git
cd augbench
npm install

2. Configuration

# For LLM_Evaluator mode (prompt-based evaluation)
cp settings.json.llm.example settings.json

# For PR_Recreate mode (real PR recreation)
cp settings.json.pr.example settings.json

# Set up environment variables
cp .env.example .env
# Edit .env with your API keys

3. Validate Environment Setup

node bin/augbench.js validate

4. Run Benchmark

node bin/augbench.js benchmark

5. Generate Reports

node bin/augbench.js report --charts

Commands

validate – Check prerequisites and configuration
benchmark – Execute benchmarking based on settings.json mode
report – Generate console reports and charts
help – Show usage examples

Documentation

Installation & Usage - Complete setup and usage guide
Benchmark Modes - LLM_Evaluator and PR_Recreate mode details
Metrics System - Comprehensive metrics documentation
AST Similarity Testing - Manual testing guide for AST similarity logic
AI Assistant Configuration - How to configure different AI assistants
Testing Guide - Testing strategy and guidelines

Supported AI Assistants

Augment CLI - auggie command
Claude Code - Anthropic's Claude CLI
Cursor CLI - Cursor IDE command line interface
Custom Assistants - Easy integration for any CLI-based AI tool

Requirements

Node.js ≥22.0.0
Git ≥2.30.0 (for worktree support)
Disk Space >10GB for repository staging
AI Assistant CLIs installed and configured

ROI Scripts: GitHub PR Metrics

Analyze GitHub PR activity before and after an automation date. Supports analyzing all branches or a specific base branch.

Script: roi_scripts/github_pr_metrics.py
Configure via constants at top of the script or environment variables

Usage examples:

# Analyze ALL branches (default when BRANCH is empty)
GITHUB_TOKEN=ghp_xxx REPO_NAME=owner/repo WEEKS_BACK=2 AUTOMATED_DATE="2025-01-15" BRANCH="" \
  python3 roi_scripts/github_pr_metrics.py

# Analyze a specific branch only (e.g., main)
GITHUB_TOKEN=ghp_xxx REPO_NAME=owner/repo WEEKS_BACK=4 AUTOMATED_DATE="2025-01-15T00:00:00Z" BRANCH=main \
  python3 roi_scripts/github_pr_metrics.py

Tip: You can also run it via npm:

npm run pr-metrics

Name		Name	Last commit message	Last commit date
Latest commit History 53 Commits
.augment/rules		.augment/rules
bin		bin
docs		docs
grammars		grammars
roi_scripts		roi_scripts
scripts		scripts
src		src
.env.example		.env.example
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json
settings.json		settings.json
settings.json.llm.example		settings.json.llm.example
settings.json.pr.example		settings.json.pr.example

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AugBench

Features

Quick Start

1. Installation

2. Configuration

3. Validate Environment Setup

4. Run Benchmark

5. Generate Reports

Commands

Documentation

Supported AI Assistants

Requirements

ROI Scripts: GitHub PR Metrics

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AugBench

Features

Quick Start

1. Installation

2. Configuration

3. Validate Environment Setup

4. Run Benchmark

5. Generate Reports

Commands

Documentation

Supported AI Assistants

Requirements

ROI Scripts: GitHub PR Metrics

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages