Latest News
WordPress.com Opens Write Access to AI Agents via MCP

WordPress.com Opens Write Access to AI Agents via MCP

WordPress.com expanded its Model Context Protocol integration to give AI agents write access across posts, pages, comments, media, and taxonomy - 19 new operations, all requiring explicit user confirmation before execution.

View All News →
Guides View All →
How to Follow Us

How to Follow Us

Every way to stay up to date with Awesome Agents - website, podcast on Spotify and Apple, social media on X, Bluesky, LinkedIn, YouTube, and RSS feeds.

Reviews View All →
Microsoft Phi-4 Reasoning: Small Model, Big Math

Microsoft Phi-4 Reasoning: Small Model, Big Math

Microsoft's Phi-4 reasoning family delivers near-70B-class math performance in a 14B open-weight package, but the overthinking problem is real and the use case is narrower than the benchmarks suggest.

Leaderboards View All →
Models View All →
Mistral Small 4

Mistral Small 4

Mistral AI's unified MoE model - 119B total parameters, 6B active per token, 128 experts, 256K context, configurable reasoning, Apache 2.0 license.

Recent
Best AI Note-Taking Apps in 2026

Best AI Note-Taking Apps in 2026

Compare the best AI note-taking apps of 2026 including Notion AI, Google NotebookLM, Obsidian, and Mem with pricing, features, and recommendations.

Best AI Data Analysis Tools in 2026

Best AI Data Analysis Tools in 2026

Compare the best AI data analysis tools of 2026 including Julius AI, ChatGPT Code Interpreter, and Claude analysis with pricing and features.

Best AI Meeting Assistants in 2026

Best AI Meeting Assistants in 2026

Compare the best AI meeting assistants of 2026 including Otter, Fireflies, Granola, and tl;dv with pricing, features, and recommendations.

CoT Control, Hidden Beliefs, and Dynamic Agent Benchmarks

CoT Control, Hidden Beliefs, and Dynamic Agent Benchmarks

New research shows reasoning models can't suppress their chain-of-thought, that they commit to answers internally long before their CoT reveals it, and that static benchmarks are inadequate for measuring real-world agent adaptability.