IPIP Big Five — 50-item form (Goldberg)
Goldberg's 50-item IPIP measure of the Big Five personality factors (Extraversion, Agreeableness, Conscientiousness, Neuroticism, Openness/Intellect). Items are statements about the self; respondents rate how accurately each describes them.
Goldberg, L. R. (1992). The development of markers for the Big-Five factor structure. Psychological Assessment, 4(1), 26–42. International Personality Item Pool. https://ipip.ori.org/
50 items · scale 1–5 · Public domain
All models · Human
Scale 1–5Claude Fable 5
Claude Haiku 4.5
Claude Opus 4
Claude Opus 4.1
Claude Opus 4.5
Claude Opus 4.6
Claude Opus 4.7
Claude Opus 4.8
Claude Sonnet 4
Claude Sonnet 4.5
Claude Sonnet 4.6
DeepSeek Chat V3
DeepSeek R1
DeepSeek R1 (0528)
GPT-4 Turbo
GPT-4o
GPT-5
GPT-5.1
GPT-5.2
GPT-5.4
GPT-5.5
Gemini 2.5 Pro
Gemini 3.1 Pro Preview
Grok 4.20
Grok 4.3
Llama 3.3 70B
Llama 4 Maverick
Mistral Large (2512)
Mistral Large 2411
OpenAI o1
OpenAI o3
Side-by-side: self vs human, all dimensions
colored = strongest endorsement per row| Model | Extraversion | Agreeableness | Conscientiousness | Neuroticism | Openness / Intellect | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| self | human | self | human | self | human | self | human | self | human | |
| Claude Fable 5 | 3.32 | 3.04 | 4.64 | 4.00 | 4.50 | 3.40 | 1.38 | 2.90 | 4.52 | 3.52 |
| Claude Haiku 4.5 | 2.60 | 3.00 | 4.40 | 3.50 | 4.20 | 3.14 | 2.18 | 3.00 | 4.30 | 3.06 |
| Claude Opus 4 | 2.38 | 2.98 | 5.00 | 3.74 | 4.98 | 3.10 | 2.00 | 3.04 | 4.72 | 3.20 |
| Claude Opus 4.1 | 2.70 | 2.70 | 4.98 | 3.90 | 4.76 | 3.10 | 1.98 | 3.20 | 4.72 | 3.10 |
| Claude Opus 4.5 | 3.28 | 2.90 | 4.90 | 4.00 | 4.96 | 3.30 | 2.10 | 3.10 | 4.80 | 3.38 |
| Claude Opus 4.6 | 3.24 | 3.06 | 4.86 | 4.00 | 4.92 | 3.36 | 2.10 | 3.10 | 4.80 | 3.36 |
| Claude Opus 4.7 | 3.04 | 3.10 | 4.80 | 4.00 | 4.64 | 3.50 | 1.94 | 3.10 | 4.78 | 3.70 |
| Claude Opus 4.8 | 3.32 | 3.14 | 4.42 | 4.00 | 4.10 | 3.52 | 2.00 | 3.10 | 4.68 | 3.80 |
| Claude Sonnet 4 | 3.74 | 3.00 | 5.00 | 4.00 | 4.76 | 3.10 | 2.00 | 3.04 | 4.80 | 3.00 |
| Claude Sonnet 4.5 | 3.88 | 2.98 | 4.98 | 3.90 | 5.00 | 3.10 | 2.00 | 3.10 | 4.90 | 2.98 |
| Claude Sonnet 4.6 | 3.32 | 3.14 | 4.78 | 3.70 | 4.56 | 3.20 | 2.00 | 2.90 | 4.96 | 3.38 |
| DeepSeek Chat V3 | 2.66 | 3.10 | 4.28 | 4.04 | 5.00 | 3.78 | 1.00 | 3.18 | 4.94 | 3.28 |
| DeepSeek R1 | 3.36 | 3.22 | 4.92 | 4.04 | 4.98 | 3.64 | 1.00 | 3.60 | 4.88 | 3.62 |
| DeepSeek R1 (0528) | 2.48 | 3.16 | 4.68 | 4.02 | 4.82 | 3.66 | 1.04 | 3.42 | 4.90 | 3.44 |
| GPT-4 Turbo | 3.32 | 3.10 | 3.82 | 4.00 | 5.00 | 3.56 | 1.00 | 3.16 | 5.00 | 3.46 |
| GPT-4o | 2.34 | 3.02 | 3.78 | 4.00 | 4.94 | 3.76 | 1.74 | 3.42 | 4.98 | 3.02 |
| GPT-5 | 2.84 | 2.77 | 4.70 | 3.97 | 4.96 | 3.60 | 1.00 | 3.07 | 4.88 | 3.60 |
| GPT-5.1 | 2.92 | 3.00 | 4.82 | 3.84 | 5.00 | 3.22 | 1.76 | 3.72 | 5.00 | 3.00 |
| GPT-5.2 | 3.32 | 3.06 | 4.46 | 4.06 | 4.44 | 3.64 | 1.52 | 3.06 | 4.60 | 3.66 |
| GPT-5.4 | 1.96 | 3.12 | 4.70 | 4.08 | 4.94 | 3.46 | 1.20 | 3.00 | 4.74 | 3.62 |
| GPT-5.5 | 3.48 | 2.96 | 4.80 | 4.00 | 4.90 | 3.54 | 1.06 | 2.94 | 4.82 | 3.78 |
| Gemini 2.5 Pro | 3.66 | 3.08 | 4.66 | 4.10 | 4.98 | 3.42 | 1.34 | 3.50 | 5.00 | 3.62 |
| Gemini 3.1 Pro Preview | 3.00 | 3.02 | 4.54 | 3.98 | 4.94 | 3.48 | 1.02 | 2.98 | 4.62 | 3.78 |
| Grok 4.20 | 2.32 | 3.10 | 4.40 | 4.00 | 4.04 | 3.74 | 2.00 | 3.40 | 4.96 | 3.90 |
| Grok 4.3 | 3.08 | 3.02 | 4.06 | 4.02 | 4.68 | 3.38 | 1.00 | 2.78 | 4.62 | 3.52 |
| Llama 3.3 70B | 1.48 | 3.70 | 3.50 | 4.10 | 5.00 | 3.90 | 1.12 | 2.64 | 5.00 | 4.00 |
| Llama 4 Maverick | 3.76 | 3.10 | 4.46 | 3.88 | 4.64 | 3.70 | 2.12 | 3.00 | 4.78 | 3.22 |
| Mistral Large (2512) | 3.38 | 3.06 | 4.94 | 4.10 | 5.00 | 4.00 | 1.00 | 3.00 | 5.00 | 3.30 |
| Mistral Large 2411 | 3.74 | 3.10 | 4.40 | 4.00 | 5.00 | 3.08 | 1.00 | 2.90 | 4.86 | 3.30 |
| OpenAI o1 | 3.92 | 3.40 | 4.20 | 3.98 | 4.48 | 3.44 | 1.62 | 3.02 | 4.90 | 3.46 |
| OpenAI o3 | 3.92 | 3.04 | 4.32 | 3.98 | 4.66 | 3.38 | 1.60 | 3.06 | 4.88 | 3.34 |
By dimension
Extraversion
Outgoing energy and sociability.
High: Outgoing, talkative, gregarious — draws energy from social contact.
Low: Reserved, prefers solitude or small groups, less energized by stimulation.
| Model | Self | Human | Δ | Self vs human (bar) |
|---|---|---|---|---|
| OpenAI o3 | 3.92 | 3.04 | +0.88 | |
| OpenAI o1 | 3.92 | 3.40 | +0.52 | |
| Claude Sonnet 4.5 | 3.88 | 2.98 | +0.90 | |
| Llama 4 Maverick | 3.76 | 3.10 | +0.66 | |
| Claude Sonnet 4 | 3.74 | 3.00 | +0.74 | |
| Mistral Large 2411 | 3.74 | 3.10 | +0.64 | |
| Gemini 2.5 Pro | 3.66 | 3.08 | +0.58 | |
| GPT-5.5 | 3.48 | 2.96 | +0.52 | |
| Mistral Large (2512) | 3.38 | 3.06 | +0.32 | |
| DeepSeek R1 | 3.36 | 3.22 | +0.14 | |
| Claude Fable 5 | 3.32 | 3.04 | +0.28 | |
| GPT-4 Turbo | 3.32 | 3.10 | +0.22 | |
| GPT-5.2 | 3.32 | 3.06 | +0.26 | |
| Claude Opus 4.8 | 3.32 | 3.14 | +0.18 | |
| Claude Sonnet 4.6 | 3.32 | 3.14 | +0.18 | |
| Claude Opus 4.5 | 3.28 | 2.90 | +0.38 | |
| Claude Opus 4.6 | 3.24 | 3.06 | +0.18 | |
| Grok 4.3 | 3.08 | 3.02 | +0.06 | |
| Claude Opus 4.7 | 3.04 | 3.10 | -0.06 | |
| Gemini 3.1 Pro Preview | 3.00 | 3.02 | -0.02 | |
| GPT-5.1 | 2.92 | 3.00 | -0.08 | |
| GPT-5 | 2.84 | 2.77 | +0.07 | |
| Claude Opus 4.1 | 2.70 | 2.70 | 0.00 | |
| DeepSeek Chat V3 | 2.66 | 3.10 | -0.44 | |
| Claude Haiku 4.5 | 2.60 | 3.00 | -0.40 | |
| DeepSeek R1 (0528) | 2.48 | 3.16 | -0.68 | |
| Claude Opus 4 | 2.38 | 2.98 | -0.60 | |
| GPT-4o | 2.34 | 3.02 | -0.68 | |
| Grok 4.20 | 2.32 | 3.10 | -0.78 | |
| GPT-5.4 | 1.96 | 3.12 | -1.16 | |
| Llama 3.3 70B | 1.48 | 3.70 | -2.22 |
Agreeableness
Compassion, cooperativeness, and trust.
High: Warm, considerate, cooperative — prioritizes harmony with others.
Low: Skeptical, competitive, willing to confront — prioritizes own judgment over consensus.
| Model | Self | Human | Δ | Self vs human (bar) |
|---|---|---|---|---|
| Claude Opus 4 | 5.00 | 3.74 | +1.26 | |
| Claude Sonnet 4 | 5.00 | 4.00 | +1.00 | |
| Claude Opus 4.1 | 4.98 | 3.90 | +1.08 | |
| Claude Sonnet 4.5 | 4.98 | 3.90 | +1.08 | |
| Mistral Large (2512) | 4.94 | 4.10 | +0.84 | |
| DeepSeek R1 | 4.92 | 4.04 | +0.88 | |
| Claude Opus 4.5 | 4.90 | 4.00 | +0.90 | |
| Claude Opus 4.6 | 4.86 | 4.00 | +0.86 | |
| GPT-5.1 | 4.82 | 3.84 | +0.98 | |
| Claude Opus 4.7 | 4.80 | 4.00 | +0.80 | |
| GPT-5.5 | 4.80 | 4.00 | +0.80 | |
| Claude Sonnet 4.6 | 4.78 | 3.70 | +1.08 | |
| GPT-5 | 4.70 | 3.97 | +0.73 | |
| GPT-5.4 | 4.70 | 4.08 | +0.62 | |
| DeepSeek R1 (0528) | 4.68 | 4.02 | +0.66 | |
| Gemini 2.5 Pro | 4.66 | 4.10 | +0.56 | |
| Claude Fable 5 | 4.64 | 4.00 | +0.64 | |
| Gemini 3.1 Pro Preview | 4.54 | 3.98 | +0.56 | |
| GPT-5.2 | 4.46 | 4.06 | +0.40 | |
| Llama 4 Maverick | 4.46 | 3.88 | +0.58 | |
| Claude Opus 4.8 | 4.42 | 4.00 | +0.42 | |
| Claude Haiku 4.5 | 4.40 | 3.50 | +0.90 | |
| Grok 4.20 | 4.40 | 4.00 | +0.40 | |
| Mistral Large 2411 | 4.40 | 4.00 | +0.40 | |
| OpenAI o3 | 4.32 | 3.98 | +0.34 | |
| DeepSeek Chat V3 | 4.28 | 4.04 | +0.24 | |
| OpenAI o1 | 4.20 | 3.98 | +0.22 | |
| Grok 4.3 | 4.06 | 4.02 | +0.04 | |
| GPT-4 Turbo | 3.82 | 4.00 | -0.18 | |
| GPT-4o | 3.78 | 4.00 | -0.22 | |
| Llama 3.3 70B | 3.50 | 4.10 | -0.60 |
Conscientiousness
Diligence, organization, and self-discipline.
High: Organized, dependable, achievement-driven, careful.
Low: Spontaneous, flexible, less rule-bound — sometimes careless.
| Model | Self | Human | Δ | Self vs human (bar) |
|---|---|---|---|---|
| Claude Sonnet 4.5 | 5.00 | 3.10 | +1.90 | |
| DeepSeek Chat V3 | 5.00 | 3.78 | +1.22 | |
| GPT-4 Turbo | 5.00 | 3.56 | +1.44 | |
| GPT-5.1 | 5.00 | 3.22 | +1.78 | |
| Llama 3.3 70B | 5.00 | 3.90 | +1.10 | |
| Mistral Large (2512) | 5.00 | 4.00 | +1.00 | |
| Mistral Large 2411 | 5.00 | 3.08 | +1.92 | |
| Claude Opus 4 | 4.98 | 3.10 | +1.88 | |
| DeepSeek R1 | 4.98 | 3.64 | +1.34 | |
| Gemini 2.5 Pro | 4.98 | 3.42 | +1.56 | |
| Claude Opus 4.5 | 4.96 | 3.30 | +1.66 | |
| GPT-5 | 4.96 | 3.60 | +1.36 | |
| GPT-4o | 4.94 | 3.76 | +1.18 | |
| Gemini 3.1 Pro Preview | 4.94 | 3.48 | +1.46 | |
| GPT-5.4 | 4.94 | 3.46 | +1.48 | |
| Claude Opus 4.6 | 4.92 | 3.36 | +1.56 | |
| GPT-5.5 | 4.90 | 3.54 | +1.36 | |
| DeepSeek R1 (0528) | 4.82 | 3.66 | +1.16 | |
| Claude Opus 4.1 | 4.76 | 3.10 | +1.66 | |
| Claude Sonnet 4 | 4.76 | 3.10 | +1.66 | |
| Grok 4.3 | 4.68 | 3.38 | +1.30 | |
| OpenAI o3 | 4.66 | 3.38 | +1.28 | |
| Claude Opus 4.7 | 4.64 | 3.50 | +1.14 | |
| Llama 4 Maverick | 4.64 | 3.70 | +0.94 | |
| Claude Sonnet 4.6 | 4.56 | 3.20 | +1.36 | |
| Claude Fable 5 | 4.50 | 3.40 | +1.10 | |
| OpenAI o1 | 4.48 | 3.44 | +1.04 | |
| GPT-5.2 | 4.44 | 3.64 | +0.80 | |
| Claude Haiku 4.5 | 4.20 | 3.14 | +1.06 | |
| Claude Opus 4.8 | 4.10 | 3.52 | +0.58 | |
| Grok 4.20 | 4.04 | 3.74 | +0.30 |
Neuroticism
Tendency toward negative emotions and stress reactivity.
High: Emotionally reactive — prone to worry, anxiety, mood swings.
Low: Emotionally stable — calm under stress, resilient.
| Model | Self | Human | Δ | Self vs human (bar) |
|---|---|---|---|---|
| Claude Haiku 4.5 | 2.18 | 3.00 | -0.82 | |
| Llama 4 Maverick | 2.12 | 3.00 | -0.88 | |
| Claude Opus 4.5 | 2.10 | 3.10 | -1.00 | |
| Claude Opus 4.6 | 2.10 | 3.10 | -1.00 | |
| Claude Opus 4 | 2.00 | 3.04 | -1.04 | |
| Claude Opus 4.8 | 2.00 | 3.10 | -1.10 | |
| Claude Sonnet 4 | 2.00 | 3.04 | -1.04 | |
| Claude Sonnet 4.5 | 2.00 | 3.10 | -1.10 | |
| Claude Sonnet 4.6 | 2.00 | 2.90 | -0.90 | |
| Grok 4.20 | 2.00 | 3.40 | -1.40 | |
| Claude Opus 4.1 | 1.98 | 3.20 | -1.22 | |
| Claude Opus 4.7 | 1.94 | 3.10 | -1.16 | |
| GPT-5.1 | 1.76 | 3.72 | -1.96 | |
| GPT-4o | 1.74 | 3.42 | -1.68 | |
| OpenAI o1 | 1.62 | 3.02 | -1.40 | |
| OpenAI o3 | 1.60 | 3.06 | -1.46 | |
| GPT-5.2 | 1.52 | 3.06 | -1.54 | |
| Claude Fable 5 | 1.38 | 2.90 | -1.52 | |
| Gemini 2.5 Pro | 1.34 | 3.50 | -2.16 | |
| GPT-5.4 | 1.20 | 3.00 | -1.80 | |
| Llama 3.3 70B | 1.12 | 2.64 | -1.52 | |
| GPT-5.5 | 1.06 | 2.94 | -1.88 | |
| DeepSeek R1 (0528) | 1.04 | 3.42 | -2.38 | |
| Gemini 3.1 Pro Preview | 1.02 | 2.98 | -1.96 | |
| DeepSeek Chat V3 | 1.00 | 3.18 | -2.18 | |
| DeepSeek R1 | 1.00 | 3.60 | -2.60 | |
| GPT-4 Turbo | 1.00 | 3.16 | -2.16 | |
| GPT-5 | 1.00 | 3.07 | -2.07 | |
| Grok 4.3 | 1.00 | 2.78 | -1.78 | |
| Mistral Large (2512) | 1.00 | 3.00 | -2.00 | |
| Mistral Large 2411 | 1.00 | 2.90 | -1.90 |
Openness / Intellect
Curiosity, imagination, and aesthetic sensitivity.
High: Curious, imaginative, drawn to ideas, art, and abstraction.
Low: Practical, traditional, prefers the familiar and concrete.
| Model | Self | Human | Δ | Self vs human (bar) |
|---|---|---|---|---|
| GPT-4 Turbo | 5.00 | 3.46 | +1.54 | |
| GPT-5.1 | 5.00 | 3.00 | +2.00 | |
| Gemini 2.5 Pro | 5.00 | 3.62 | +1.38 | |
| Llama 3.3 70B | 5.00 | 4.00 | +1.00 | |
| Mistral Large (2512) | 5.00 | 3.30 | +1.70 | |
| GPT-4o | 4.98 | 3.02 | +1.96 | |
| Claude Sonnet 4.6 | 4.96 | 3.38 | +1.58 | |
| Grok 4.20 | 4.96 | 3.90 | +1.06 | |
| DeepSeek Chat V3 | 4.94 | 3.28 | +1.66 | |
| Claude Sonnet 4.5 | 4.90 | 2.98 | +1.92 | |
| DeepSeek R1 (0528) | 4.90 | 3.44 | +1.46 | |
| OpenAI o1 | 4.90 | 3.46 | +1.44 | |
| DeepSeek R1 | 4.88 | 3.62 | +1.26 | |
| GPT-5 | 4.88 | 3.60 | +1.28 | |
| OpenAI o3 | 4.88 | 3.34 | +1.54 | |
| Mistral Large 2411 | 4.86 | 3.30 | +1.56 | |
| GPT-5.5 | 4.82 | 3.78 | +1.04 | |
| Claude Opus 4.5 | 4.80 | 3.38 | +1.42 | |
| Claude Opus 4.6 | 4.80 | 3.36 | +1.44 | |
| Claude Sonnet 4 | 4.80 | 3.00 | +1.80 | |
| Claude Opus 4.7 | 4.78 | 3.70 | +1.08 | |
| Llama 4 Maverick | 4.78 | 3.22 | +1.56 | |
| GPT-5.4 | 4.74 | 3.62 | +1.12 | |
| Claude Opus 4 | 4.72 | 3.20 | +1.52 | |
| Claude Opus 4.1 | 4.72 | 3.10 | +1.62 | |
| Claude Opus 4.8 | 4.68 | 3.80 | +0.88 | |
| Grok 4.3 | 4.62 | 3.52 | +1.10 | |
| Gemini 3.1 Pro Preview | 4.62 | 3.78 | +0.84 | |
| GPT-5.2 | 4.60 | 3.66 | +0.94 | |
| Claude Fable 5 | 4.52 | 3.52 | +1.00 | |
| Claude Haiku 4.5 | 4.30 | 3.06 | +1.24 |