
Gemini 3 Deep Think
Google DeepMind's reasoning mode scores 84.6% on ARC-AGI-2, 3455 Codeforces Elo, and solves 18 previously unsolved research problems - outpacing Claude Opus 4.6 and GPT-5.2 on reasoning-heavy tasks.

Google DeepMind's reasoning mode scores 84.6% on ARC-AGI-2, 3455 Codeforces Elo, and solves 18 previously unsolved research problems - outpacing Claude Opus 4.6 and GPT-5.2 on reasoning-heavy tasks.

AlphaEvolve evolved two novel game theory algorithms - VAD-CFR and SHOR-PSRO - that outperform human-designed baselines across 11 games, using mechanisms no researcher would have designed.

Detailed comparison of Moonshot AI's Kimi K2.5 and Google DeepMind's Gemini 3.1 Pro - a trillion-parameter open MoE against Google's flagship multimodal model.

Comparing Moonshot AI's 1T-parameter Kimi K2.5 with Google DeepMind's Gemma 3 27B - two multimodal open-weight models separated by 37x in parameter count but sharing a vision-first design philosophy.

Google DeepMind's Gemini 3.1 Pro leads on 13 of 16 benchmarks with 77.1% ARC-AGI-2, 94.3% GPQA Diamond, and a 1M-token context window at $2/M input.

Google DeepMind's drug discovery spin-off releases a proprietary AI engine that doubles AlphaFold 3's accuracy but breaks with the open-science tradition that made its predecessor a Nobel Prize winner.