API

Claude's 1M Context Window Now GA - No Premium Pricing

Anthropic made the 1M-token context window generally available for Claude Opus 4.6 and Sonnet 4.6, dropping the long-context pricing premium entirely - a 900K-token request now costs the same per token as a 9K one.

Perplexity CTO Moves Away from MCP Toward APIs and CLIs

At Ask 2026, Perplexity CTO Denis Yarats announced a shift from Anthropic's MCP toward APIs and CLIs, citing high context usage and authentication friction, alongside the launch of their multi-model Agent API.

Grok 4 - xAI's Flagship Reasoning Model

Grok 4 is xAI's frontier reasoning model, the first to break 50% on Humanity's Last Exam, with a 256K context window, $3/M input pricing, and a Heavy multi-agent variant built on 200,000 GPUs.

Meta Opens WhatsApp AI API Under EU Pressure - For a Price

Meta reversed its WhatsApp ban on rival AI chatbots in Europe and Brazil, but now charges rivals up to €0.13 per message - pricing critics call a disguised ban.

Gemini 3.1 Pro Tops Benchmarks but Developers Can't Rely on It

Gemini 3.1 Pro leads ARC-AGI-2, LiveCodeBench, and 11 other benchmarks with 750 million users and 21.5% market share - but developers report stalled responses, leaked thinking tokens, and API outages that make it unusable for production coding and agent workflows.

Google Launches Gemini 3.1 Flash-Lite as Gemini 3 Pro Dies

Google quietly launched Gemini 3.1 Flash-Lite Preview - the first Flash-Lite in the Gemini 3 series - while announcing Gemini 3 Pro will shut down March 9. Developers have six days to migrate.

← Previous

API