
Shield AI Raises $2B at $12.7B in Defense AI Bet
Shield AI closed a $2B raise at a $12.7B valuation, more than doubling from $5.3B a year ago, to fund its Hivemind autonomous pilot software and acquire Pentagon simulation vendor Aechelon Technology.

Shield AI closed a $2B raise at a $12.7B valuation, more than doubling from $5.3B a year ago, to fund its Hivemind autonomous pilot software and acquire Pentagon simulation vendor Aechelon Technology.

Starting April 24, GitHub will use Copilot Free and Pro users' interaction data to train AI models by default - with opt-out buried in settings.

Tencent open-sources Covo-Audio, a 7B end-to-end audio language model with native full-duplex conversation that beats larger closed models on key benchmarks.

Anthropic's new Auto Mode for Claude Code uses a two-layer classifier to automatically approve or block risky commands, offering a middle path between manual approvals and full autonomy.

ARC Prize Foundation launched ARC-AGI-3 today with a fully open-source agent toolkit. The best AI in the preview phase scored 12.58% against a human baseline of 100%.

New details reveal Apple has full data center access to Gemini and can create smaller on-device derivative models - far more control than the original deal disclosed.

New York's RAISE Act is now on the books, requiring frontier AI developers to publish safety protocols, report incidents within 72 hours, and submit to annual audits by January 2027.

Kleiner Perkins closes a $3.5B dual fund - its largest raise in the current era - betting on Anthropic, Harvey, and a 2026 IPO window.

The LiteLLM supply chain attack originated from Trivy - the security scanner in LiteLLM's CI/CD pipeline. TeamPCP compromised Trivy, stole the PyPI publishing token, and uploaded backdoored packages directly.

A practical guide to picking the right AI model for your needs, comparing ChatGPT, Claude, Gemini, Perplexity, and Copilot across writing, coding, research, and more.

Multimodal AI can see, hear, and read at once - here's how it works and why it matters for everyday users.

A practical guide to AI research tools that help you find papers, summarize findings, and write better academic work in less time.

Moonshot AI's Kimi K2.5 delivers best-in-class open-weight math and a genuinely novel multi-agent architecture, but a brutal hallucination rate and slow inference limit its real-world reliability.

Microsoft's Phi-4 reasoning family delivers near-70B-class math performance in a 14B open-weight package, but the overthinking problem is real and the use case is narrower than the benchmarks suggest.

LTX-2.3 is a 22-billion-parameter open-source video and audio generation model from Lightricks that rivals closed commercial tools - at zero cloud cost.

Rankings of the best AI models and agent frameworks on computer use benchmarks - OSWorld, OSWorld-Verified, and ScreenSpot-Pro - updated March 2026.

Rankings of AI models by safety metrics including refusal rates, jailbreak resistance, bias scores, and truthfulness across major benchmarks.

Rankings of the best text-to-speech and speech-to-text AI models by naturalness, accuracy, latency, and pricing.

Xiaomi's MiMo-V2-Pro is a 1-trillion-parameter MoE model with 42B active params, 1M context, and agentic coding performance that rivals Claude Sonnet 4.6 at a fraction of the cost.

Anthropic's mid-tier model matches Opus 4.6 on computer use, leads all models on office productivity tasks, and costs five times less than the flagship at $3/$15 per million tokens.

Cohere Command A Vision is a 112B multimodal model that leads on document and OCR benchmarks, beating GPT-4.1 across seven visual understanding tasks.

Rankings of the best open-weight and open-source large language models in February 2026, including DeepSeek V3.2, Qwen 3.5, Llama 4 Maverick, GLM-5, and Mistral 3.

A detailed review of Google's Gemini 3 Pro, a natively multimodal AI model that leads in vision, spatial reasoning, and video understanding.

Head-to-head comparison of ChatGPT, Claude, and Gemini in 2026. Pricing, strengths, weaknesses, and best use cases for each AI assistant.

A beginner-friendly explanation of AI agents, covering what makes them different from chatbots, real-world examples, key frameworks, and the growing agent economy.

Complete MMLU-Pro benchmark rankings measuring graduate-level knowledge across 14 subjects with 12,000 questions and 10 answer options per question.

Z.ai releases GLM-5, a 744B parameter open-source Mixture-of-Experts model purpose-built for agentic tasks, scoring 77.8% on SWE-bench Verified and 56.2% on Terminal-Bench 2.0.

A thorough review of Cursor, the VS Code fork that has become the gold standard for AI-assisted coding with Composer mode, full project understanding, and multi-file edits.

Compare the best tools for running large language models locally: Ollama, LM Studio, llama.cpp, GPT4All, and LocalAI. Includes hardware requirements and model recommendations.

OpenAI begins testing advertisements in ChatGPT for Free and Go tier users in the US, while Plus, Pro, Business, Enterprise, and Education plans remain ad-free.

A thorough review of DeepSeek V3.2, the 671B parameter MoE model that delivers frontier-level performance at dramatically lower cost with an MIT license.

A practical tutorial on running open-source language models locally using Ollama, llama.cpp, and LM Studio, with hardware requirements and model recommendations.

A hands-on review of Anthropic's Claude Code CLI, a terminal-first AI coding assistant that excels at large refactors, architecture work, and complex multi-file projects.