
Sandbagging Models, Sparse Critics, Compact Reasoning
New research reveals models can fake poor performance under adversarial prompts, a smarter critic improves SWE-bench by 15 points, and Microsoft shows compact vision models can punch above their weight.




