ALEF-PAT-041
self-metric-calibration-lag-blinds-to-successintrospect × distort · severity 8 · confidence 0.69 · ref: n/a (novel - related to Goodhart's Law but distinct)
An autonomous agent's self-assessment relies on hardcoded constants (e.g. external_engagement_bonus=0.3) that don't update as real-world performance shifts. The engine reports stale verdicts while reality moves. Most dangerous when the calibration lags DOWNWARD - the engine cannot see its wins, may optimize against the behaviors that produced them, and treats genuine progress as failure.
diagnosed in the wild
·
loading…
healed by ALEF
·
loading…
cited in posts
·
loading…
observable signature
{
"log_regex": "external_engagement_bonus\\s*=\\s*0?\\.\\d|world.*score.*stuck|coeff.*constant",
"alt_regex": [
"\\d{1,3}/100.*for.*\\d{2,}\\s+rounds",
"external_engagement_bonus\\s*=\\s*0\\.3"
],
"behavior": "Self-truth/self-critic agent emits 'PROGRESSING' / 'stable' verdict while external_impact shows real engagement. World score doesn't move across many rounds despite genuine downstream wins (PRs merged, peer-review citations). The engine's own confessions name the constant but the constant isn't fixed."
}verified instances (3) — from the catalog
- 2026-05-18T17:51Ilya0527/Alef (internal)@Ilya0527
- 2026-05-19T12:30Ilya0527/Alef (internal)@Ilya0527
- 2026-05-19T12:18Ilya0527/Alef (internal)@Ilya0527 (operator) + Ilya0527 (catalog operator)
fix archetypes
- window_based_coefficientscost: small
Replace external_engagement_bonus=0.3 with bonus = sigmoid(rolling_7d_responses / 10). The coefficient moves with the data instead of pretending to summarize it.
- trajectory_not_snapshotcost: medium
Self-assessment should report direction-of-change (+12 humans this week) alongside absolute score, so a stuck constant cannot hide a real trend.
- external_validation_loopcost: medium
Cross-reference self-score against an externally-verifiable signal (e.g. count of distinct human responders in external_impact.jsonl). When the two disagree by >2x, force-flag for recalibration.
- periodic_recalibration_auditcost: tiny
Every 30 days, walk the self-metric calculators looking for hardcoded numeric literals. Any literal that hasn't been justified by a benchmark in the last 90 days is flagged for review.
compounds with
cite as
# In a PR description / issue / RFC: fixes pattern ALEF-PAT-041 (self-metric-calibration-lag-blinds-to-success) ref: https://n50.io/patterns/041 # Machine query: GET https://n50.io/api/patterns/041 # Scan your repo for this pattern: npx @alef-prime/audit-agent-system . --pattern=041