Latest News
View All News →
Guides View All →
How to Follow Us

How to Follow Us

Every way to stay up to date with Awesome Agents - website, podcast on Spotify and Apple, social media on X, Bluesky, LinkedIn, YouTube, and RSS feeds.

Reviews View All →
Microsoft Phi-4 Reasoning: Small Model, Big Math

Microsoft Phi-4 Reasoning: Small Model, Big Math

Microsoft's Phi-4 reasoning family delivers near-70B-class math performance in a 14B open-weight package, but the overthinking problem is real and the use case is narrower than the benchmarks suggest.

Leaderboards View All →
Models View All →
Mistral Small 4

Mistral Small 4

Mistral AI's unified MoE model - 119B total parameters, 6B active per token, 128 experts, 256K context, configurable reasoning, Apache 2.0 license.

Recent
Gemini 3.1 Pro Tops Benchmarks but Developers Can't Rely on It

Gemini 3.1 Pro Tops Benchmarks but Developers Can't Rely on It

Gemini 3.1 Pro leads ARC-AGI-2, LiveCodeBench, and 11 other benchmarks with 750 million users and 21.5% market share - but developers report stalled responses, leaked thinking tokens, and API outages that make it unusable for production coding and agent workflows.

What Are AI Reasoning Models?

What Are AI Reasoning Models?

A plain-English guide to AI reasoning models - what they are, how they think step by step, and when you should actually use one.