AI News Digest — April 4, 2026

Highlights


News

AI Security

USA

Europe

Japan (AI & Tech)


Research Papers

Benchmarks & Evaluation

Security & Adversarial

Compliance & Regulation

Alignment & Safety

Applications

Guardrails & Robustness


Key Themes

Agentic AI security: Multiple stories this week—OpenClaw’s auth bypass, Claude desktop control, Cursor’s agent fleets—underscore that agentic systems are rapidly expanding the attack surface without matching security maturity.

China’s AI hardware independence: DeepSeek v4’s Huawei-only architecture is a concrete signal that China’s domestic chip ecosystem is ready for frontier-scale inference, with real geopolitical consequences for Nvidia.

Corporate AI consolidation: Anthropic’s Coefficient Bio acquisition, OpenAI’s TBPN media purchase, and Anthropic’s new PAC reflect labs using capital and political strategy to entrench their positions well beyond pure model development.

Safety research catching up to deployment: A cluster of papers this week targets jailbreaks (T2I filters, SelfGrader, diffusion safety), reward hacking, and refusal awareness—evidence that the research community is methodically mapping the failure modes of deployed systems.

Japan as AI investment destination: Microsoft’s $10B commitment, Japan’s domestic LLM release (LLM-jp-4), and the SoftBank/Tohoku AI project collectively position Japan as a serious AI ecosystem builder rather than a passive consumer.

Healthcare AI risk: Both the Utah AI psychiatric prescribing story and papers on hallucinations in mental-health LLMs and ICU decision support point to healthcare as the next major frontier for AI safety debates.


For detailed summaries of selected research papers, see papers.md.