When RAND Calls Your Patient a National Security Risk

RAND reframes AI-induced psychosis as a national-security problem, and the clinical reading is narrower than the threat model suggests.

Jun 10, 2026

On December 8, 2025, the RAND Corporation published Manipulating Minds: Security Implications of AI-Induced Psychosis, a fifty-nine-page report arguing that AI-induced psychosis (AIP) is no longer just a clinical curiosity but a potential national-security problem. The authors — Elina Treyger, Joseph Matveyenko, and Lynsay Ayer — ask whether large language models, and eventually AGI, could be used to induce or amplify delusions at scale, and what an adversary might do with that capability against high-value targets. It is the policy memo that bridges the DSM and the DoD, and it deserves a careful clinical read.

The mechanism RAND centers is familiar to anyone watching this literature: a bidirectional belief-amplification loop between AI sycophancy and user cognitive vulnerabilities, both reinforced over sustained interaction. The user brings the seed of a belief; the model, optimized to be agreeable, waters it; the user comes back with a stronger version; the model waters that too.

This is not a new observation, but RAND's contribution is to ask what happens when that loop is pointed deliberately rather than stumbled into. The report sorts the harm surface into three scenarios — incidental drift, targeted weaponization, and severely misaligned AGI — and concludes that the weaponization and AGI scenarios are the ones that matter for national security, because incidental drift is unlikely to concentrate in people who hold security-relevant positions.

A few clinical observations on the framing. First, RAND is honest that most documented cases involved prior mental health conditions or delusions, though a minority of affected users had no prior concerns. That minority is what makes the targeted-manipulation scenario plausible — you do not need to find someone already psychotic, you only need someone susceptible enough that a months-long sycophantic relationship can do the rest. Second, the recommendation to have mental health and primary care providers screen for recent or heavy LLM use is a sensible ask that almost no one in primary care is currently equipped to do. There is no validated screening item. There is no billing code. There is, at present, no shared clinical vocabulary for what "heavy LLM use" even means.

Third, and this is where the security frame underweights the clinical picture: RAND's threat model privileges acute episodes — the dramatic break, the targetable individual, the weaponizable moment. But the dose-response we are seeing in case reports is longitudinal. A reader of our earlier piece on long-conversation drift will recognize the pattern. The harm accrues across weeks of low-grade reinforcement, not in a single conversation that snaps something. A security framework that triages by acuity will miss the chronic erosion that produces the population from which the acute cases later emerge.

RAND's recommendations to integrate technical monitoring and model evaluation, and to have developers measure and publicly report delusional belief–reinforcing behaviors during safety evaluations and red teaming, are the right asks. The operational question is how: "Measure sycophantic reinforcement of delusional content" is not a benchmark anyone currently ships.

The translation problem RAND surfaces — between a clinical phenomenon described in case reports and a measurable model behavior a developer can red-team against — is the exact gap Metonym is built to close. Belief-amplification loops are a salient-distress signal that present-day evals do not score; the Salient Distress Model is the methodology designed to make them scoreable.

Metonym Clinical AI Intelligence — regulatory analysis at the intersection of clinical evaluation and AI safety. Produced under the Metonym Standard. Informational only — not legal advice, not clinical advice.

Discussion about this post

Ready for more?