
Qwen3.5-Flash vs DeepSeek V3.2: Budget API Battle With a Pricing Twist
A detailed comparison of Qwen3.5-Flash and DeepSeek V3.2 API pricing, benchmarks, and tradeoffs - flat-rate simplicity versus cache-dependent discounts in the budget AI tier.

A detailed comparison of Qwen3.5-Flash and DeepSeek V3.2 API pricing, benchmarks, and tradeoffs - flat-rate simplicity versus cache-dependent discounts in the budget AI tier.

Claude Sonnet 4.6 identifies itself as DeepSeek when prompted in Chinese, just one day after Anthropic accused DeepSeek of industrial-scale distillation attacks. The cause is training data contamination, not an identity crisis - but the timing is spectacular.

Amazon exposes a Russian-speaking hacker who used ARXON (an MCP server feeding data to Claude and DeepSeek) and CHECKER2 to breach 600+ FortiGate firewalls across 55 countries in five weeks - no zero-days required.

Anthropic accuses three Chinese AI labs of industrial-scale distillation attacks using 24,000 fraudulent accounts and 16 million exchanges with Claude. MiniMax ran the largest operation at 13 million exchanges. None of the three companies have responded.

Amazon Threat Intelligence uncovered a Russian-speaking threat actor using DeepSeek for attack planning, Claude for autonomous exploitation, and a custom MCP server called ARXON to breach 600+ FortiGate devices across 55 countries.

A thorough review of DeepSeek V3.2, the 671B parameter MoE model that delivers frontier-level performance at dramatically lower cost with an MIT license.