Latest News
View All News →
Guides View All →
How to Follow Us

How to Follow Us

Every way to stay up to date with Awesome Agents - website, podcast on Spotify and Apple, social media on X, Bluesky, LinkedIn, YouTube, and RSS feeds.

Reviews View All →
Microsoft Phi-4 Reasoning: Small Model, Big Math

Microsoft Phi-4 Reasoning: Small Model, Big Math

Microsoft's Phi-4 reasoning family delivers near-70B-class math performance in a 14B open-weight package, but the overthinking problem is real and the use case is narrower than the benchmarks suggest.

Leaderboards View All →
Models View All →
Mistral Small 4

Mistral Small 4

Mistral AI's unified MoE model - 119B total parameters, 6B active per token, 128 experts, 256K context, configurable reasoning, Apache 2.0 license.

Recent
LLMs Can Unmask Online Users for $4, Study Finds

LLMs Can Unmask Online Users for $4, Study Finds

Researchers from ETH Zurich and Anthropic show that LLM agents can strip pseudonymity from forum posts at scale for as little as $1.41 per target - matching what human investigators could do in hours.

Nvidia Rules Out $100B OpenAI Bet as IPO Nears

Nvidia Rules Out $100B OpenAI Bet as IPO Nears

Jensen Huang confirmed Nvidia's $30B OpenAI investment will likely be its last direct equity stake, killing the $100B pledge as OpenAI races toward a public listing at a $730B valuation.

Meta's $100B AMD Bet Is a Direct Shot at Nvidia

Meta's $100B AMD Bet Is a Direct Shot at Nvidia

Meta and AMD signed a 6-gigawatt, multi-year GPU pact worth up to $100B - announced days after a separate Nvidia expansion, signaling Meta's deliberate strategy to break single-vendor dependence in AI compute.

Best LLM Observability Tools in 2026

Best LLM Observability Tools in 2026

A data-driven comparison of Langfuse, LangSmith, Helicone, Braintrust, and Phoenix - the top LLM observability platforms for teams building AI in production.

Best GEO Tools in 2026 - Top 5 Platforms Ranked

Best GEO Tools in 2026 - Top 5 Platforms Ranked

A ranked review of the five best Generative Engine Optimization platforms in 2026 - from full-stack content generation to enterprise monitoring, with pricing, benchmarks, and honest trade-offs.