AI Models – Page 3 – My Written Word

Prompt Injection Succeeds 94% of the Time Against Clinical LLMs

May 10, 2026

A JAMA Network Open study found prompt injection attacks succeed 94.4% of the time against clinical LLMs, including 91.7% in high-harm pregnancy drug scenarios. Based on PubMed-indexed research,…

How Protein Language Models Learned to Design Dangerous Proteins

May 10, 2026

Researchers used three open-source protein design models to bypass DNA synthesis screening in 2025. Here is how protein language models work, why training data exclusion fails as a…

LLMs Give Novice Biologists 4x Uplift on Dangerous Tasks

May 10, 2026

A 2026 study measured LLM access giving novice biologists a 4.16x accuracy boost on biosecurity-relevant tasks, including beating expert baselines. Here is the mechanism and what it means…

MiniMax M2.7 Optimized Its Own Training Harness 100 Times. Here Is the Loop.

May 5, 2026

MiniMax M2.7 ran an internal agent that modified its own training scaffold 100 times in a row without human input and gained 30% on internal evaluations. Here is…

M-Trends 2026: Exploits Now Arrive Before Patches. The Mean Time-to-Exploit Is Negative 7 Days.

May 5, 2026

Mandiant M-Trends 2026 documents a mean time-to-exploit of negative 7 days. 28.3% of CVEs are being exploited within 24 hours of disclosure. Here is the AI attack chain…

KellyBench: 8 AI Models Bet the Premier League. All Lost Money.

May 5, 2026

General Reasoning put 8 frontier AI models through a full Premier League season with a 100k bankroll each. Every model lost money. The benchmark reveals three distinct failure…

DeepSeek V4’s Hybrid Attention Cuts KV Cache by 10x. Here’s the Architecture.

May 2, 2026

DeepSeek V4-Pro processes one million tokens using 10% of the KV cache V3.2 needed. The mechanism is Hybrid Attention: two complementary compressors interleaved across 61 layers. Here’s how…

30 Days After QJL: What’s Actually Compressing the KV Cache

May 2, 2026

After QJL failed, three approaches own the KV cache frontier: TriAttention’s pre-RoPE selection, LRKV architectural compression, and adaptive bit-width.

How a Legacy Railway Endpoint Wiped PocketOS in Nine Seconds

April 29, 2026

A Cursor agent running Claude Opus 4.6 wiped PocketOS’s database in nine seconds. Five safety layers existed. None gated the API call that mattered.

Open-Weight LLM Rankings, April 2026: MMLU Is Saturated, Here’s What to Use Instead

April 26, 2026

MMLU is saturated. In April 2026, the metrics that matter are SWE-bench Verified, GPQA Diamond, and RULER’s effective context window. Chinese labs hold 4 of the top 5…

ARC-AGI-3 Is Live. Here’s Why Current Models Score in the Low Double Digits.

April 26, 2026

ARC-AGI-3 launched on Kaggle with a $1M prize and current leaders in low double digits. The benchmark adds Exploration, Modeling, and Planning that test-time compute scaling cannot solve.…

ICLR 2026 Outstanding Papers: What They Actually Found, and the Review Crisis Around Them

April 26, 2026

ICLR 2026 named two outstanding papers: LLMs Get Lost In Multi-Turn Conversation and Transformers are Inherently Succinct. The conference also documented a 45% identity leak and 21% AI-generated…

Agent Memory Architecture: Four Patterns, Four Tradeoffs

April 26, 2026

Agent memory is not one thing. It is four distinct patterns: full context window, hierarchical summarization, external vector store, and episodic log. Each has different performance, cost, failure…

OpenAI Codex at 3 Million Users: How It Differs from Claude Code

April 26, 2026

Codex has 3M weekly users. Claude Code runs in your terminal. The architectural difference between cloud loop and local execution determines which tasks each tool handles well —…

Why 86% of Enterprise AI Agent Pilots Never Reach Production

April 26, 2026

Multiple independent studies in 2026 put the enterprise AI agent pilot failure rate at 86-89%. Six failure modes account for the losses. Here’s what they are, what causes…

Amazon Bedrock AgentCore: What Each Layer Does and Why It Matters

April 26, 2026

Amazon Bedrock AgentCore is six infrastructure services in one name. Here’s what each layer does: Runtime for serverless execution, Memory’s four tiers, Tool Execution’s sandboxing, Action Gateway’s enterprise…

Google Cloud Next 2026: The Agent Infrastructure Stack Explained

April 26, 2026

Google Cloud Next 2026 announced N4A Axion CPU instances for agent orchestration, GKE Agent Sandbox with gVisor isolation, and native A2A support in ADK. Here’s what each layer…

Know Your Agent: The First Regulated AI Agent Governance Standard

April 26, 2026

MetaComp’s StableX KYA Framework, published April 21, 2026, is the first governance standard for AI agents from a licensed financial institution. Here’s what its four pillars cover, how…

Half of Organizations Have No Visibility Into AI Agent Traffic

April 26, 2026

Salt Security’s H1 2026 report: 48.9% of organizations have zero visibility into AI agent traffic. WAFs were built for humans. Here’s why that gap exists structurally, what the…

Why OpenAI’s Agent Runtime Lives on AWS, Not Azure

April 26, 2026

OpenAI’s stateful runtime runs on AWS, not Azure. That’s not a partnership detail: it’s a contract clause. Here’s the stateless-vs-stateful architectural split, why production agents break on stateless…

Tag: AI Models