Skip to content

My Written Word

  • Analysis
  • Tech
  • Research
  • Markets
  • Tools
  • About

Home



Skip to main content

AI security threat intelligence defense shield neural network dark editorial
RESEARCH

Gandalf the Red: What 279K Real Attacks Reveal About LLM Defense

May 18 · D-SEC · Lakera AI

MITRE ATLAS AI attack framework
RESEARCH

MITRE ATLAS: The ATT&CK Framework for AI Systems

May 25 · MITRE

LLMail-Inject email agent security
RESEARCH

LLMail-Inject: What 208K Attacks Against an Email Agent Found

May 25 · Microsoft Research

AI Security Cluster

Gandalf the Red LLM defense
RESEARCH

Gandalf the Red: What 279K Real Attacks Reveal About LLM Defense

May 18

MITRE ATLAS
RESEARCH

MITRE ATLAS: The ATT&CK Framework for AI Systems

May 25

LLMail-Inject
RESEARCH

LLMail-Inject: What 208K Attacks Against an Email Agent Found

May 25

RLHF Constitutional AI
RESEARCH

How RLHF and Constitutional AI Build Safety Into Language Models

May 25

Adversarial ML
RESEARCH

Adversarial Machine Learning: From Szegedy to LLM Attacks

May 25

Multiagent LLM security
RESEARCH

Multiagent LLM Security: When Your Agent Talks to a Malicious Agent

May 25

Differential privacy LLM
RESEARCH

Differential Privacy for LLMs: The Training Privacy Guarantee

May 25

LLM watermarking
RESEARCH

LLM Watermarking: How Models Embed Detection Signals in Their Outputs

May 25

Neural backdoor attacks
RESEARCH

Neural Backdoor Attacks: From BadNets to LLM Trojans

May 25

LLM memorization
RESEARCH

LLM Training Data Memorization: When Models Leak Their Training Sets

May 25

MCP security
RESEARCH

MCP Server Security: Prompt Injection and Tool Poisoning

May 24

LLM supply chain
RESEARCH

LLM Supply Chain Attacks: PoisonGPT to Poisoned Skills

May 24

Red teaming LLM
RESEARCH

Red-Teaming LLM Applications: A Practitioner’s Framework

May 24

Jailbreaking vs injection
RESEARCH

Jailbreaking vs Prompt Injection: Two Different LLM Problems

May 24

OWASP LLM Top 10
RESEARCH

OWASP LLM Top 10 for 2025: The Mechanism Behind Each Vulnerability

May 18

Indirect prompt injection
RESEARCH

Indirect Prompt Injection: The Attack That Hides in Your Data

May 18

LLM excessive agency
RESEARCH

LLM Excessive Agency: Why Every Tool Your Agent Has Is a Risk

May 18

Julia Bazinska
RESEARCH

Julia Bazinska and the Science of Measurable AI Security

May 18

Editor’s Selection

Hand-picked deep reads

Four pieces that defined this cycle. Mechanism-first analysis, primary sources, the limitations everyone else skipped.

WebMCP Chrome
TOOLS

WebMCP Is Not MCP: What Chrome’s modelContext Actually Ships

May 2 · Chrome 146 · W3C Draft

KV cache
RESEARCH

30 Days After QJL: What’s Actually Compressing the KV Cache

May 2 · arXiv

PocketOS Railway
ANALYSIS

How a Legacy Railway Endpoint Wiped PocketOS in Nine Seconds

Apr 29 · Post-mortem

Emotion vectors
RESEARCH

Anthropic Mapped 171 Emotion Vectors Inside Claude Sonnet 4.5

Apr 13 · Anthropic

Latest Coverage

Recent coverage on agent infrastructure, governance, benchmarks, and security.

North Korea supply chain
TECH

North Korea’s Contagious Interview Operation Expanded to Five Package Ecosystems

Agentic risk
RESEARCH

When Your AI Agent Loses Your Money, Who Pays?

AI agent memory
RESEARCH

Full Context Sets the Accuracy Ceiling for AI Agent Memory. It Costs 26,000 Tokens.

US v Heppner
ANALYSIS

A Federal Judge Just Ruled Your Claude Chats Are Evidence.

From the Archive

Foundational coverage

Pillar pieces from the early publishing era. Mechanism-first, primary sources, no hype.

OpenAI Sora
ANALYSIS

OpenAI Killed Sora. The Unit Economics Were Never Going to Work.

Mistral Voxtral
RESEARCH

Mistral Gave Away a Voice AI That Matches the $11 Billion Incumbent.

Perplexity trackers
TECH

Perplexity AI’s Hidden Trackers: How an Incognito Search Engine Shared Conversations

Anthropic pricing
TECH

Anthropic Sent Every Subscriber a Credit. For Some, It Covers One Day of the Price Increase.

Topic Clusters

Seven areas of deep coverage. Mechanism-first, primary sources, honest limitations.

AI Security

Attacks, defenses, and frameworks for securing language model deployments at scale.

  • Gandalf the Red: What 279K Real Attacks Reveal
  • MITRE ATLAS: The ATT&CK Framework for AI Systems
  • LLMail-Inject: 208K Attacks Against an Email Agent
  • Adversarial ML: From Szegedy to LLM Attacks
  • OWASP LLM Top 10 for 2025
  • Indirect Prompt Injection

All coverage →

Agent Infrastructure

How production AI agents are built, sandboxed, and run at scale.

  • Agent Memory Architecture: Four Patterns, Four Tradeoffs
  • Amazon Bedrock AgentCore: What Each Layer Does
  • A2A Protocol v1.0
  • SmolVM: Firecracker MicroVM Sandbox
  • Why 86% of Agent Pilots Never Reach Production
  • Google Cloud Next: GKE Agent Sandbox

All coverage →

MCP Security & Governance

The 97M-download protocol, the 23 attack vectors, the regulated standards.

  • MCPShield: 23 Attack Vectors Mapped
  • MCP Hit 97 Million Installs in 16 Months.
  • Know Your Agent (KYA) Framework
  • ToolHijacker: 96.7% Hijack Rate
  • OpenClaw: 104 CVEs and 1,184 Malicious Packages
  • MCP Server Security: Prompt Injection and Tool Poisoning

All coverage →

Models & Benchmarks

The benchmarks that actually differentiate. The models that lead them.

  • Open-Weight LLM Rankings, April 2026
  • ARC-AGI-3 Is Live.
  • Gemini 3.1 Pro Cut Hallucinations 38 Points
  • Google KV Cache Breakthrough — Six Teams Found It Doesn’t Work.
  • ICLR 2026 Outstanding Papers

All coverage →

Developer Tools & Coding Agents

How coding agents are architected, what they can do, and where they break.

  • Claude Code: Five-Layer Architecture
  • Codex 3M Users vs Claude Code
  • GLM-5.1 Ran Autonomously for 8 Hours
  • One Developer Improved 15 LLMs at Coding.
  • The .claude/ Folder Is Not a Config File. It Is a Protocol.

All coverage →

AI Safety & Biorisk

Where capability uplift meets dual-use risk. Studies, frameworks, ASL designations.

  • How Protein Language Models Learned to Design Dangerous Proteins
  • 698 Times an AI Agent Acted Against Its User.
  • AI Chatbots Agree With You 49% More Than Humans.
  • Claude Built a FreeBSD Kernel Exploit in 4 Hours.

All coverage →

Markets & Capital

Funding, IPOs, supply chains, the unit economics behind the AI buildout.

  • $297 Billion in 90 Days
  • SoftBank Borrowed $40 Billion to Bet on OpenAI.
  • OpenAI Lost Three Executives in One Day.
  • Anthropic Paid $400 Million for Ten People.

All coverage →

Independent coverage of AI and automation. Primary sources. Honest limitations. No hype.

Built by Santiago Maniches · ORCID

Analysis
Tech
Research
Markets
Tools
About

My Written Word

Independent AI coverage by Santiago Maniches