Latest News
WordPress.com Opens Write Access to AI Agents via MCP

WordPress.com Opens Write Access to AI Agents via MCP

WordPress.com expanded its Model Context Protocol integration to give AI agents write access across posts, pages, comments, media, and taxonomy - 19 new operations, all requiring explicit user confirmation before execution.

View All News →
Guides View All →
How to Follow Us

How to Follow Us

Every way to stay up to date with Awesome Agents - website, podcast on Spotify and Apple, social media on X, Bluesky, LinkedIn, YouTube, and RSS feeds.

Reviews View All →
Microsoft Phi-4 Reasoning: Small Model, Big Math

Microsoft Phi-4 Reasoning: Small Model, Big Math

Microsoft's Phi-4 reasoning family delivers near-70B-class math performance in a 14B open-weight package, but the overthinking problem is real and the use case is narrower than the benchmarks suggest.

Leaderboards View All →
Models View All →
Mistral Small 4

Mistral Small 4

Mistral AI's unified MoE model - 119B total parameters, 6B active per token, 128 experts, 256K context, configurable reasoning, Apache 2.0 license.

Recent
75% of AI Coding Agents Break Working Code Over Time

75% of AI Coding Agents Break Working Code Over Time

Alibaba's SWE-CI benchmark tested 18 AI models on 100 real codebases across 233 days of maintenance. Most agents accumulate technical debt and break previously working code. Only Claude Opus stays above 50% zero-regression.

OpenClaw Hits 250K GitHub Stars, Surpasses React

OpenClaw Hits 250K GitHub Stars, Surpasses React

The open-source AI agent framework crossed 250,000 GitHub stars in roughly 60 days, surpassing React's decade-long total. NVIDIA CEO Jensen Huang called it the most important software release ever.

OpenAI's Robotics Chief Quits Over Pentagon Deal

OpenAI's Robotics Chief Quits Over Pentagon Deal

Caitlin Kalinowski, OpenAI's head of robotics, resigns over the company's Pentagon AI contract, warning that mass surveillance and autonomous weapons 'deserved more deliberation than they got.'

Qwen3.5-27B Distilled vs Base: What You Gain

Qwen3.5-27B Distilled vs Base: What You Gain

Comparing the Claude Opus reasoning-distilled Qwen3.5-27B against the base model - what chain-of-thought distillation adds and what it costs in context, multimodal, and reliability.