2026-06-11 cognition
FrontierCode: Changing the Eval Question from 'Is It Correct' to 'Would You Merge It'
Cognition's FrontierCode uses 'would the maintainer actually merge this' as its signal, folding readability, scope discipline, and codebase conventions into the score. Closer to human code review than pass rates, but it drags subjectivity in with it.
Read analysis