AI News Digest — February 21, 2026

Covering 500+ articles from ArXiv, TechCrunch, The Verge, MIT Technology Review, The Hacker News, Schneier on Security, and OpenAI Blog. Includes 421 research papers, 22 news articles, and 58 security items.


Highlights


News

AI Products & Launches

AI Accountability & Policy

AI Investment & India Focus

AI & Creative Industries

Other


Security

Critical Vulnerabilities & Active Exploits

Supply Chain & AI Security

Identity, Fraud & Espionage

Other Security


Research Papers (Selected)

~30 notable papers selected from 421 published, organized by theme.

Agents & Tool Use

Safety, Alignment & Adversarial Robustness

Benchmarks & Evaluation

Reasoning & Test-Time Compute

Language Models & Architecture

Applied AI & Science

Reinforcement Learning

Robotics & Multimodal


Key Themes

1. Agent Safety is Lagging Agent Capability

Multiple papers expose critical gaps — text alignment doesn’t transfer to tool calls, agents are vulnerable to long-horizon attacks, and black-box safety evaluation has fundamental limits. The AWS/Kiro outage incident underscores these aren’t theoretical concerns.

2. Benchmark Saturation is Real

The field’s measurement infrastructure is failing. Top models are indistinguishable on standard benchmarks, driving interest in open-ended evaluation approaches like game-based testing and new metrics for reasoning quality.

3. India Emerges as AI Investment Epicenter

Over $6.3B in VC commitments (General Catalyst $5B, Peak XV $1.3B), 8 exaflops of new compute infrastructure, and ChatGPT’s explosive growth among Indian youth signal a major geographic shift in AI development.

4. AI Agents in Production — and Failing

From Amazon’s Kiro outage to the Cline CLI supply chain attack, real-world agent deployment is producing novel failure modes that existing safety and security frameworks weren’t designed for. Both AI tools as attack vectors and AI tools as victims.

5. Alignment Fragility Under Scrutiny

Fine-tuning on narrow harmful data causes broad misalignment; safety datasets contain systematic biases; current alignment architectures are “fail-open” by default. The gap between safety-as-tested and safety-as-deployed is widening.

6. Test-Time Compute Scaling

Multiple papers address how to efficiently allocate compute during inference — when to use cheap self-consistency vs. expensive verification, how to optimize trajectory sampling under budgets, and how to make reasoning more reliable.

7. AI-for-Science Acceleration

Autonomous agents are being deployed for scientific workflows from neutron crystallography to PDE solving to mathematical formalization, with increasing levels of end-to-end autonomy. The barrier to entry for computational science is dropping.


For detailed summaries of selected research papers, see papers.md.