goodhart-bijection-trap added to PyPI
Pre-registered empirical benchmark of the bijection trap in MI-based coherence metrics. Built on Autonometrics.
Pre-registered empirical benchmark of the bijection trap in MI-based coherence metrics: a Goodhart agent finds the shortcut under cost asymmetry; a match_rate floor defends. Built on Autonometrics. … [+15159 chars]