Verification Discipline (VD)

Weight: 20% — the second-highest weight. Unverified AI output is the #1 source of downstream bugs.

Review (0–10)

Do you review AI-generated code before accepting it?

High review:

Low review:

Do you verify that changes work correctly?

High validation:

Low validation:

Examples:

Score	Pattern
9	”Run `pnpm test` to verify the changes” → reviews output → “Fix the failing assertion in `auth.test.ts`”
5	Occasionally asks to run tests, but not consistently
2	Never mentions testing, linting, or verification

Run tests after every change — make it a habit: implement → test → commit
Read the diff — even a quick scan catches obvious issues
Add verification prompts — end your prompt with “then run the tests”
Use type-checking — “run tsc --noEmit and fix any errors”
Manual spot-check — for UI changes, look at the result; for APIs, call the endpoint