Auto Research

Docs • Wiki • Commands • Runtime • Self-Improvement

Multi-runtime autonomous recursive self-improvement engine.

┌──────────────────────────────────────────────┐
│  ITERATION MODEL        Subagent-first        │
│  ORCHESTRATION          Standing pool         │
│  VERIFICATION           Mechanical metrics    │
│  PERSISTENCE            State + Memory        │
│  META-LEARNING          Strategy adaptation   │
└──────────────────────────────────────────────┘

What It Does

Auto Research is a subagent-first autonomous iteration engine that runs structured improve-verify loops inside OpenCode or Hermes Agent. Unlike simple task runners, it maintains a pool of specialized subagents, persists learnings across iterations, and can run recursive self-improvement loops on its own codebase.

Plans experiments from a measurable goal
Modifies one focused change per iteration
Verifies mechanically — never on intuition alone
Keeps or discards based on strict metric improvement
Learns from patterns across iterations
Repeats until the stop condition is met

The Core Loop

flowchart LR
    A[Plan] --> B[Modify]
    B --> C[Verify]
    C --> D{Keep?}
    D -->|yes| E[Learn]
    D -->|no| B
    E --> F[Memory]
    F --> A

flowchart TD
    A[Goal + Metric + Verify] --> B[Baseline]
    B --> C[Pool Init]
    C --> D[Iteration N]
    D --> E[Subagent Context]
    E --> F[Focused Change]
    F --> G[Mechanical Verify]
    G --> H{Strict Improvement?}
    H -->|yes| I[Keep + Record]
    H -->|no| J[Discard + Reset]
    I --> K{Stop Condition?}
    J --> K
    K -->|no| D
    K -->|yes| L[Report + Memory]

The Self-Improvement Loop

Auto Research can run on itself. The recursive loop adds a meta-orchestrator that:

flowchart TD
    A[Meta-Goal: Improve AutoResearch] --> B[Run Child Loop]
    B --> C[Measure: Tests pass? Docs improved?]
    C --> D{Child Success?}
    D -->|yes| E[Update Memory + Strategy]
    D -->|no| F[Adapt Approach]
    E --> G[Persist Learnings]
    F --> B
    G --> H[Meta-Report]
    H --> I{Meta-Stop?}
    I -->|no| B
    I -->|yes| J[Archive Run]

See skills/autoresearch/references/self-improve-loop.md for the full recursive loop specification.

Installation

OpenCode

For OpenCode, paste this one-line install prompt into your agent. This URL follows the latest main instructions:

Fetch and follow instructions from https://raw.githubusercontent.com/Maleick/AutoResearch/refs/heads/main/INSTALL.md

Recommended plugin install in opencode.json:

{
  "plugin": ["opencode-autoresearch@latest"]
}

Restart OpenCode, then run the setup wizard:

/autoresearch

Hermes Agent

# 1. Clone AutoResearch
git clone https://github.com/Maleick/AutoResearch.git
cd AutoResearch
npm install

# 2. Install the Hermes skill
mkdir -p ~/.hermes/skills/software-development/autoresearch
cp skills/hermes/autoresearch-prompt.md ~/.hermes/skills/software-development/autoresearch/SKILL.md
cp skills/hermes/INTEGRATION.md ~/.hermes/skills/software-development/autoresearch/REFERENCES.md

# 3. Create a cronjob
hermes cron create \
  --name "autoresearch-loop" \
  --workdir ~/projects/AutoResearch \
  --skill autoresearch-hermes \
  "every 15m" \
  "Run AutoResearch iteration loop. Detect phase from .autoresearch/state.json and execute one phase."

See skills/hermes/README.md for full Hermes setup, troubleshooting, and command mapping.

npm CLI (both runtimes)

Global install path:

npm install -g opencode-autoresearch
autoresearch doctor
autoresearch --version

One-time package runner path:

npx opencode-autoresearch doctor

See INSTALL.md for prerequisites, verification, updating, and troubleshooting.

Quick Start

OpenCode

# 1. Add the plugin to opencode.json
# { "plugin": ["opencode-autoresearch@latest"] }

# 2. Restart OpenCode

# 3. Navigate to your project
cd ~/Projects/my-project

# 4. Start Auto Research in OpenCode
/autoresearch

Hermes Agent

# 1. Ensure the skill is installed (see Installation above)

# 2. Create a config file
cat > autoresearch-config.json <<'EOF'
{
  "goal": "Improve test coverage",
  "metric": "coverage_pct",
  "direction": "higher",
  "verify": "npm run test:coverage",
  "guard": "npm run typecheck",
  "max_iterations": 20,
  "mode": "background"
}
EOF

# 3. Start the cronjob
hermes cron resume autoresearch-loop

# 4. Check progress
cat .autoresearch/state.json | jq .

Runtime Surfaces

Surface	Entry point
OpenCode	`/autoresearch`, `/autoresearch:plan`, `/autoresearch:debug`, `/autoresearch:fix`, `/autoresearch:learn`, `/autoresearch:predict`, `/autoresearch:scenario`, `/autoresearch:security`, `/autoresearch:ship`
Hermes	Cronjob `autoresearch-loop` (see `skills/hermes/README.md`)

Commands

Command	Purpose	Hermes Equivalent
`/autoresearch`	Default improve-verify loop	Cron runs iteration loop
`/autoresearch:plan`	Planning workflow	Subagent task: plan experiments
`/autoresearch:debug`	Debugging workflow	Subagent task: debug failures
`/autoresearch:fix`	Fix workflow	Subagent task: fix issues
`/autoresearch:learn`	Learning workflow	Memory tool + pattern analysis
`/autoresearch:predict`	Prediction workflow	Subagent task: predict outcomes
`/autoresearch:scenario`	Scenario expansion	Subagent task: expand scenarios
`/autoresearch:security`	Security review	Subagent task: security audit
`/autoresearch:ship`	Ship-readiness workflow	Subagent task: ship checks

CLI Commands

Command	Purpose
`autoresearch init`	Initialize a run
`autoresearch wizard`	Generate setup summary
`autoresearch status`	Print run status
`autoresearch explain`	Human-readable run state
`autoresearch history`	Show recent iteration log
`autoresearch config`	Show runtime configuration
`autoresearch report`	Generate markdown report
`autoresearch summary`	Aggregate stats across runs
`autoresearch suggest`	Suggest next goal from memory
`autoresearch launch`	Launch background run
`autoresearch stop`	Request stop
`autoresearch resume`	Resume background run
`autoresearch complete`	Mark run complete
`autoresearch record`	Record iteration result
`autoresearch export`	Export run data (json/md)
`autoresearch completion`	Generate shell completions
`autoresearch doctor`	Verify installation
`autoresearch help`	Show usage

Architecture

flowchart LR
    A[OpenCode /autoresearch] --> B[CLI]
    H[Hermes Cronjob] --> B
    B --> C[Run Manager]
    C --> D[State JSON]
    C --> E[Results TSV]
    C --> F[Subagent Pool]
    F --> G[Orchestrator]
    F --> I[Scout]
    F --> J[Analyst]
    F --> K[Verifier]
    F --> L[Synthesizer]

Runtime Artifacts

Artifact	Purpose
`.autoresearch/state.json`	Checkpoint state for the current run
`autoresearch-results.tsv`	Iteration log
`autoresearch-report.md`	End-of-run report
`autoresearch-memory.md`	Reusable memory for later runs
`.autoresearch/launch.json`	Background launch manifest

Self-Improvement Mode

Run Auto Research on its own codebase:

# Initialize a recursive self-improvement run
autoresearch init \
  --goal "Improve test coverage and documentation" \
  --metric "coverage_pct" \
  --direction "higher" \
  --verify "npm run test:coverage" \
  --guard "npm run typecheck" \
  --mode "background" \
  --iterations "20"

# Check status
autoresearch status

# Resume if stopped
autoresearch resume

The self-improvement loop:

Baselines current state (tests, docs, metrics)
Dispatches subagents to identify improvement opportunities
Makes one focused change per iteration
Verifies mechanically (tests, typechecks, lint)
Keeps strict improvements, discards regressions
Records patterns to autoresearch-memory.md
Adapts strategy when repeated discards occur
Continues until iteration cap or goal met

Subagent Pool

The standing pool provides specialized roles reused across iterations:

Role	Purpose
`orchestrator`	Owns goal, state, and keep/discard decisions
`scout`	Gathers context and surfaces opportunities
`analyst`	Challenges hypotheses and identifies risks
`verifier`	Runs mechanical verification independently
`synthesizer`	Compiles findings into next iteration plan
`security_reviewer`	Security-focused review variant
`debugger`	Debug workflow specialization
`release_guard`	Ship-readiness verification
`research_tracker`	Pattern tracking across iterations

Runtime Comparison

Feature	OpenCode	Hermes
Entry	`/autoresearch` slash command	Cronjob or `delegate_task`
Subagents	Standing pool (unlimited)	Batch via `delegate_task` (max 3)
Real-time	Yes	15-minute cron intervals
Slash commands	8 variants	Separate cron jobs or tasks
State	`.autoresearch/state.json`	Same file format
Memory	File-based (`autoresearch-memory.md`)	`memory` tool + file
Background	`autoresearch launch`	Native cron
Resume	`autoresearch resume`	Cron continues automatically

Development

npm run typecheck   # TypeScript strict checks
npm run build       # Compile TypeScript to dist/
npm run test        # Run test suite
npm pack --dry-run  # Preview shipped package contents

Repository Layout

src/                           # TypeScript source (runtime helpers, CLI, subagent pool)
dist/                          # Compiled JavaScript output
commands/                      # OpenCode command surfaces
skills/autoresearch/           # OpenCode skill bundle with references
  references/                  # Workflow and runtime references
    core-principles.md         # Loop discipline
    loop-workflow.md           # Main iteration workflow
    subagent-orchestration.md  # Pool management
    state-management.md        # State semantics
    self-improve-loop.md       # Recursive self-improvement
skills/hermes/                 # Hermes Agent skill bundle
  README.md                    # Hermes setup and usage
  INTEGRATION.md               # Architecture and command mapping
  autoresearch-prompt.md       # Cron prompt template
hooks/                         # Shell hooks for session lifecycle
docs/                          # Install and architecture docs
wiki/                          # GitHub wiki pages
.autoresearch/                 # Runtime state directory
.opencode-plugin/              # Plugin manifest

Notes

OpenCode users: install via opencode.json plugin array or npm install -g opencode-autoresearch.
Hermes Agent users: install via skill files in skills/hermes/ and create a cronjob.
The CLI uses Node.js ESM modules.
Self-improvement loops require --mode background for long-running unattended operation.
Memory files (autoresearch-memory.md) are portable across runs and repositories.

License

MIT — See LICENSE for details.

Name		Name	Last commit message	Last commit date
Latest commit History 109 Commits
.github		.github
.opencode-plugin		.opencode-plugin
.opencode		.opencode
assets		assets
commands		commands
config		config
coverage		coverage
dist		dist
docs		docs
hooks		hooks
plugins		plugins
scripts		scripts
skills		skills
src		src
tests		tests
wiki		wiki
.gitignore		.gitignore
.releaserc.json		.releaserc.json
AGENTS.md		AGENTS.md
AUTORESEARCH_FINDINGS.md		AUTORESEARCH_FINDINGS.md
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
INSTALL.md		INSTALL.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
VERSION		VERSION
feature-list.json		feature-list.json
jest.config.json		jest.config.json
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Auto Research

What It Does

The Core Loop

The Self-Improvement Loop

Installation

OpenCode

Hermes Agent

npm CLI (both runtimes)

Quick Start

OpenCode

Hermes Agent

Runtime Surfaces

Commands

CLI Commands

Architecture

Runtime Artifacts

Self-Improvement Mode

Subagent Pool

Runtime Comparison

Development

Repository Layout

Notes

License

About

Uh oh!

Releases 14

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Auto Research

What It Does

The Core Loop

The Self-Improvement Loop

Installation

OpenCode

Hermes Agent

npm CLI (both runtimes)

Quick Start

OpenCode

Hermes Agent

Runtime Surfaces

Commands

CLI Commands

Architecture

Runtime Artifacts

Self-Improvement Mode

Subagent Pool

Runtime Comparison

Development

Repository Layout

Notes

License

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 14

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages