Crumbling Under Pressure: PropensityBench Reveals AI’s Weaknesses
Scale.com: “AI models are now being used in more high-stakes settings, and not every situation goes according to plan. When a model’s safe approach starts to fail, will it stay on the safe path or reach for a harmful shortcut that works instead? Understanding how models behave in those pressure moments is one of the …