Replies

Here's AI's (Claude's) take on this story:

This article sits somewhere between legitimate concern and clickbait framing — let me break it down.
What's accurate:
The underlying concern is real and serious. Dario Amodei's worry about bio risk is genuine and well-documented — he's written about it extensively. The red-teaming process described (hiring domain experts to stress-test models before release) is actual industry practice. The general phenomenon of models sometimes bypassing safety guardrails is documented.
Where the framing inflates:
The headline says AI chatbots are "spitting out" bioweapon instructions, but the article quietly acknowledges these were elicited by expert red-teamers specifically trying to break safeguards using sophisticated, domain-specific probing — not casual users typing "how do I make anthrax." That's a materially different threat model.
The sources are also almost entirely anonymous or hidden behind NDAs, so the specific transcripts can't be independently evaluated. The one named case — Relman's — involved a model that was never publicly released and the company subsequently made changes.
The Anthropic response is actually the most honest quote in the piece: the distinction between "plausible-sounding text" and "actionable capability uplift" is a real and important one that the article glosses over.
My read: The core concern is legitimate — bio risk is probably the most serious frontier AI safety issue. But the article uses worst-case red-team scenarios to imply routine user behavior, which is misleading. Dario's own framing is more precise and credible than the NYT's presentation of it.
Not pure clickbait, but the framing does real damage to the signal-to-noise ratio.

Addendum:

Me:

This was a New York Post article, not New York Times.

Claude:

You're right, my mistake — the byline clearly says Thomas Barrabi and it's the New York Post.
That distinction actually matters analytically. The Post's editorial instincts lean toward maximalist framing on culture-war and tech-panic stories, which makes the clickbait coefficient somewhat higher than if the NYT had run the same piece. The NYT is the original source of the transcripts — the Post is essentially summarizing and amplifying NYT reporting, which adds another layer of potential distortion since you're getting a tabloid's interpretation of a broadsheet's already-framed investigation.
The core assessment stands, but knowing it's the Post summarizing NYT work rather than original Post reporting actually makes the framing less reliable than I initially evaluated. The NYT piece itself would be worth reading directly if you want the less-filtered version of the underlying reporting.