chore(bench): add analyzer benchmark suite and PR regression CI by veksen · Pull Request #116 · Query-Doctor/analyzer

veksen · 2026-04-18T10:57:38Z

Summary

Adds src/remote/optimizer.bench.ts — a vitest bench suite that measures QueryOptimizer.start() on small (3t/5q), medium (20t/100q), and large (300t/1000q) shapes against a single testcontainer Postgres.
Adds .github/workflows/benchmark.yaml — runs the bench on both PR HEAD and the merge-base, uses scripts/compare-bench.mjs to diff the two JSON reports, posts/updates a single PR comment with a regression table, and fails the check when any bench's mean regresses by >20%.
Wires up npm run bench and a benchmark.include pattern in vitest.config.ts.

Test plan

First PR run emits a "no baseline" comment (base commit predates the bench file) — confirms graceful fallback.
Subsequent PRs show a diff table with 🔴/🟢/⚪/🆕 verdicts and RME deltas.
npm run bench works locally.

🤖 Generated with Claude Code

github-actions

Query Doctor Analysis

View full run details

83 queries analyzed

2 pre-existing issues

SELECT "guests"."id", "guests"."session_id", "guests"."username", "guests"."avatar_path", "guests"."color", "guests"."side", "guests"."audio_recording_path", "guests"."audio_recording_public", "gue...
index assets(event_id, inserted_at desc)
cost 31,003,449 → 1,493 (100% reduction)
SELECT * FROM guest_ip_addresses WHERE ip_address = '127.0.0.1';
index guest_ip_addresses(ip_address)
cost 154,402 → 8 (100% reduction)

_{Using assumed statistics (10000000 rows/table). For better results, sync production stats.}

github-actions · 2026-04-18T11:17:31Z

Benchmark comparison

Threshold: ±20% on mean. 🔴 regression · 🟢 improvement · ⚪ within noise · 🆕 new/removed.

	Benchmark	Base mean	PR mean	Δ	RME (base → PR)
⚪	`src/remote/optimizer.bench.ts > query optimizer > large (300 tables, 1000 queries)`	163,677ms	163,558ms	-0.1%	±2.9% → ±2.5%
⚪	`src/remote/optimizer.bench.ts > query optimizer > medium (20 tables, 100 queries)`	2,575ms	2,584ms	+0.4%	±5.0% → ±7.0%
⚪	`src/remote/optimizer.bench.ts > query optimizer > small (3 tables, 5 queries)`	89ms	93ms	+4.6%	±10.2% → ±16.9%

Benchmarks use testcontainers + wall-time; some noise is expected. Treat single-digit deltas as not-significant.
Base commit: ``

Adds a vitest bench suite that measures QueryOptimizer.start() across small/medium/large DB shapes via a single testcontainer Postgres, plus a PR workflow that runs the bench on both HEAD and base, posts a comparison comment, and fails the check on >20% regression. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

github-actions bot reviewed Apr 18, 2026

View reviewed changes

veksen force-pushed the veksen/add-benchmarks-v3 branch from 992a893 to a9062f2 Compare April 20, 2026 07:56

veksen force-pushed the veksen/add-benchmarks-v3 branch from a9062f2 to 457f91e Compare April 20, 2026 07:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore(bench): add analyzer benchmark suite and PR regression CI#116

chore(bench): add analyzer benchmark suite and PR regression CI#116
veksen wants to merge 1 commit intomainfrom
veksen/add-benchmarks-v3

veksen commented Apr 18, 2026

Uh oh!

github-actions bot left a comment •

edited

Loading

Uh oh!

github-actions bot commented Apr 18, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

veksen commented Apr 18, 2026

Summary

Test plan

Uh oh!

github-actions bot left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Query Doctor Analysis

Uh oh!

github-actions bot commented Apr 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Benchmark comparison

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

github-actions bot left a comment •

edited

Loading

github-actions bot commented Apr 18, 2026 •

edited

Loading