Machine learning

Google Launches Gemini Embedding 2 for Multimodal AI

Google's first natively multimodal embedding model maps text, images, video, audio, and PDFs into a single vector space - now in public preview via Gemini API and Vertex AI.

Hugging Face Launches Storage Buckets for ML Artifacts

Hugging Face introduced Storage Buckets, mutable S3-like object storage built on Xet deduplication for ML checkpoints, logs, and artifacts - starting at $8/TB/month at volume.

Karpathy's Autoresearch Runs 100 ML Experiments Overnight

Andrej Karpathy open-sourced autoresearch, a 630-line MIT-licensed Python tool that runs up to 100 autonomous ML experiments overnight on a single GPU, no PhD required.

22 Bytes Poison ML Malware Detectors via Label Spoofing

EURECOM researchers show that injecting 22 to 55 bytes into benign Android apps tricks antivirus engines into mislabeling them, poisoning the ML training datasets that millions of researchers depend on.

ByteDance Trained an AI Agent That Writes Faster CUDA Kernels Than You

CUDA Agent uses reinforcement learning trained on actual GPU profiling data to generate optimized CUDA kernels. It beats torch.compile by 2.11x overall and outperforms Claude Opus 4.5 and Gemini 3 Pro by 40 points on the hardest kernels.

Someone Reverse-Engineered Apple's Neural Engine and Trained a Model on It

A developer cracked Apple's undocumented ANE private APIs, measured its real throughput at 19 TFLOPS FP16 (not the marketed 38 TOPS), and trained a 109M-parameter transformer on hardware Apple designed exclusively for inference.