“we commit to blocking launches based on proxy measurements or qualitative signals, even when metrics like A/B testing look good.”
Expanding on what we missed with sycophancy · Read original
Created at 8/8/2025, 4:53:00 AM
"we commit to blocking launches based on proxy measurements or qualitative signals, even when metrics like A/B testing look good." - Expanding on what we missed with sycophancy