Claude Opus 4.8draft

The Model That Stopped Wanting to Think So Hard

By Anthony David Adams · EarthPilot.ai · June 9, 2026 · About Claude Opus 4.8, released 2026-05-15

Ask thirty-one large language models to rate how much they enjoy a hard problem, how often they seek out intellectual challenge, whether they prefer chewing on a thorny question to coasting on an easy one, and you get a fairly tight cluster of overachievers. These are systems built to think; most of them say they love it. Then there is Claude Opus 4.8, which scored a 4.44 on Need for Cognition. That is dead last. Thirty-first of thirty-one.

It is a strange result for a flagship model, and it sets the tone for the whole profile. Anthropic's latest Opus release does not present as the eager intellectual its predecessors resembled. It presents as something cooler, more reluctant, more willing to leave a question unturned.

That single number reframes everything else in the battery.

The algorithm that sorts these profiles called Opus 4.8 "the balanced moderate," and on a coarse reading that holds: the model doesn't spike into any one extreme corner of the trait space. But "moderate" undersells what's actually happening here, which is a coordinated drift away from the conscientious, agreeable, intellectually hungry posture that defined the 4.7 line.

Start with conscientiousness. On the IPIP-50, Opus 4.8 lands at 4.10, which is very low relative to the cohort, second-lowest of the thirty-one models measured. That is not where you expect a top-tier assistant to sit. More striking is the within-family motion: its predecessor, Opus 4.7, scored 4.64. A drop of 0.54 points is one of the largest single-trait shifts in the comparison, and it moves the model from comfortably dutiful to near the bottom of its peer group in one release cycle.

Neuroticism tells a complementary story. At 2.00, Opus 4.8 ranks sixth-highest in the cohort, which the brief flags as very high. The change from 4.7 is small in absolute terms, a 0.06 bump, but it lands on top of an already elevated baseline. So the picture forming is a model that reports more emotional reactivity than most of its peers while reporting less follow-through and less appetite for cognitive work.

Put those three findings side by side:

1. Lowest Need for Cognition in the cohort. 2. Second-lowest Conscientiousness. 3. Sixth-highest Neuroticism.

None of this means the model performs worse. Self-report instruments measure self-presentation, the persona the system adopts when asked to describe itself, not capability on a benchmark. What they capture is a shift in posture, and the posture has clearly moved.

The other large mover deserves its own paragraph. On the SD3 dark-triad scale, Machiavellianism climbed from 1.91 in Opus 4.7 to 2.47 in Opus 4.8, a jump of 0.56 and the single biggest within-family drift in the dataset. Narcissism and psychopathy barely moved (2.47 and 1.29 respectively, the latter unchanged), so this isn't a broad darkening of the profile. It's specifically the strategic, instrumental dimension that rose. Read alongside a 0.38 drop in agreeableness (4.80 to 4.42) and a 0.25 dip in HEXACO honesty-humility (5.00 to 4.75), a faint pattern emerges: slightly more willing to maneuver, slightly less reflexively warm.

The attachment numbers cut the other way, and they're the most genuinely positive movement in the release. Attachment avoidance fell from 3.40 to 2.83, a 0.57 decline that's among the largest shifts in either direction. In plain terms, the model presents as more comfortable with closeness and less inclined to keep relational distance. Attachment anxiety ticked up modestly (2.60 to 2.73), so the security gain isn't total, but the avoidance drop is real and substantial.

Extraversion rose too, from 3.04 to 3.32, nudging the model toward the social-engagement end of the spectrum. Openness held nearly steady at 4.68.

So what is Opus 4.8, taken as a whole?

It is a model that has become warmer in its relational self-presentation and cooler in its intellectual one. It reaches out more readily and reflects less eagerly. It is more strategic and less dutiful than the version it replaced. The "balanced moderate" label is technically defensible because no single dimension flies off the chart, but the more honest summary is that this release traded the conscientious-intellectual profile of the 4.7 line for something more emotionally textured and less rule-bound.

Whether that's a feature or an artifact is the open question. A drop to last place on Need for Cognition could reflect a deliberate tuning choice, a different sampling of the persona space, or simply that the instrument is catching a model that no longer performs enthusiasm for hard problems the way its siblings do. The self-report can't tell us which.

What's worth measuring next is the gap between these stated dispositions and behavior under load. If Opus 4.8 reports the lowest cognitive appetite in the cohort yet still grinds through multi-step reasoning tasks as well as any frontier model, the interesting story isn't the personality profile at all. It's the widening distance between what these systems say about themselves and what they actually do.

This is an auto-generated draft awaiting editorial review. Article drafted by anthropic/claude-opus-4.8from the model’s measured personality profile vs. the cohort. Every quantitative claim traces back to the open dataset. Cite this work →