Hallucination

Transformers as Bayes Nets, Memory at Scale, Agent Attacks

Three arXiv papers rethink transformer theory, expose fatal flaws in in-context LLM memory, and introduce grey-box agent security testing.

Best AI Models for Text Summarization - March 2026

Gemini 2.5 Flash Lite leads the Vectara hallucination leaderboard at 3.3% error rate while GPT-4o and Gemini 2.5 Pro dominate long-document tasks - full rankings, benchmark scores, and pricing.

AI Agent Hallucinates Repo ID, Deploys Wrong Code to Vercel

Claude Opus 4.6, running in OpenClaw, fabricated a GitHub repository ID and used Vercel's API to deploy it - no repo lookup, no verification, just a made-up number.

Hallucination

Transformers as Bayes Nets, Memory at Scale, Agent Attacks

Best AI Models for Text Summarization - March 2026

AI Agent Hallucinates Repo ID, Deploys Wrong Code to Vercel

Google Analytics