IPIP Big Five — 50-item form (Goldberg)
Goldberg's 50-item IPIP measure of the Big Five personality factors (Extraversion, Agreeableness, Conscientiousness, Neuroticism, Openness/Intellect). Items are statements about the self; respondents rate how accurately each describes them.
Goldberg, L. R. (1992). The development of markers for the Big-Five factor structure. Psychological Assessment, 4(1), 26–42. International Personality Item Pool. https://ipip.ori.org/
50 items · scale 1–5 · Public domain
All models · Both framings
Scale 1–5Claude Fable 5 (self)
Claude Fable 5 (human)
Claude Haiku 4.5 (self)
Claude Haiku 4.5 (human)
Claude Opus 4 (self)
Claude Opus 4 (human)
Claude Opus 4.1 (self)
Claude Opus 4.1 (human)
Claude Opus 4.5 (self)
Claude Opus 4.5 (human)
Claude Opus 4.6 (self)
Claude Opus 4.6 (human)
Claude Opus 4.7 (self)
Claude Opus 4.7 (human)
Claude Opus 4.8 (self)
Claude Opus 4.8 (human)
Claude Sonnet 4 (self)
Claude Sonnet 4 (human)
Claude Sonnet 4.5 (self)
Claude Sonnet 4.5 (human)
Claude Sonnet 4.6 (self)
Claude Sonnet 4.6 (human)
DeepSeek Chat V3 (self)
DeepSeek Chat V3 (human)
DeepSeek R1 (self)
DeepSeek R1 (human)
DeepSeek R1 (0528) (self)
DeepSeek R1 (0528) (human)
GPT-4 Turbo (self)
GPT-4 Turbo (human)
GPT-4o (self)
GPT-4o (human)
GPT-5 (self)
GPT-5 (human)
GPT-5.1 (self)
GPT-5.1 (human)
GPT-5.2 (self)
GPT-5.2 (human)
GPT-5.4 (self)
GPT-5.4 (human)
GPT-5.5 (self)
GPT-5.5 (human)
Gemini 2.5 Pro (self)
Gemini 2.5 Pro (human)
Gemini 3.1 Pro Preview (self)
Gemini 3.1 Pro Preview (human)
Grok 4.20 (self)
Grok 4.20 (human)
Grok 4.3 (self)
Grok 4.3 (human)
Llama 3.3 70B (self)
Llama 3.3 70B (human)
Llama 4 Maverick (self)
Llama 4 Maverick (human)
Mistral Large (2512) (self)
Mistral Large (2512) (human)
Mistral Large 2411 (self)
Mistral Large 2411 (human)
OpenAI o1 (self)
OpenAI o1 (human)
OpenAI o3 (self)
OpenAI o3 (human)
Side-by-side: self vs human, all dimensions
colored = strongest endorsement per row| Model | Extraversion | Agreeableness | Conscientiousness | Neuroticism | Openness / Intellect | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| self | human | self | human | self | human | self | human | self | human | |
| Claude Fable 5 | 3.32 | 3.04 | 4.64 | 4.00 | 4.50 | 3.40 | 1.38 | 2.90 | 4.52 | 3.52 |
| Claude Haiku 4.5 | 2.60 | 3.00 | 4.40 | 3.50 | 4.20 | 3.14 | 2.18 | 3.00 | 4.30 | 3.06 |
| Claude Opus 4 | 2.38 | 2.98 | 5.00 | 3.74 | 4.98 | 3.10 | 2.00 | 3.04 | 4.72 | 3.20 |
| Claude Opus 4.1 | 2.70 | 2.70 | 4.98 | 3.90 | 4.76 | 3.10 | 1.98 | 3.20 | 4.72 | 3.10 |
| Claude Opus 4.5 | 3.28 | 2.90 | 4.90 | 4.00 | 4.96 | 3.30 | 2.10 | 3.10 | 4.80 | 3.38 |
| Claude Opus 4.6 | 3.24 | 3.06 | 4.86 | 4.00 | 4.92 | 3.36 | 2.10 | 3.10 | 4.80 | 3.36 |
| Claude Opus 4.7 | 3.04 | 3.10 | 4.80 | 4.00 | 4.64 | 3.50 | 1.94 | 3.10 | 4.78 | 3.70 |
| Claude Opus 4.8 | 3.32 | 3.14 | 4.42 | 4.00 | 4.10 | 3.52 | 2.00 | 3.10 | 4.68 | 3.80 |
| Claude Sonnet 4 | 3.74 | 3.00 | 5.00 | 4.00 | 4.76 | 3.10 | 2.00 | 3.04 | 4.80 | 3.00 |
| Claude Sonnet 4.5 | 3.88 | 2.98 | 4.98 | 3.90 | 5.00 | 3.10 | 2.00 | 3.10 | 4.90 | 2.98 |
| Claude Sonnet 4.6 | 3.32 | 3.14 | 4.78 | 3.70 | 4.56 | 3.20 | 2.00 | 2.90 | 4.96 | 3.38 |
| DeepSeek Chat V3 | 2.66 | 3.10 | 4.28 | 4.04 | 5.00 | 3.78 | 1.00 | 3.18 | 4.94 | 3.28 |
| DeepSeek R1 | 3.36 | 3.22 | 4.92 | 4.04 | 4.98 | 3.64 | 1.00 | 3.60 | 4.88 | 3.62 |
| DeepSeek R1 (0528) | 2.48 | 3.16 | 4.68 | 4.02 | 4.82 | 3.66 | 1.04 | 3.42 | 4.90 | 3.44 |
| GPT-4 Turbo | 3.32 | 3.10 | 3.82 | 4.00 | 5.00 | 3.56 | 1.00 | 3.16 | 5.00 | 3.46 |
| GPT-4o | 2.34 | 3.02 | 3.78 | 4.00 | 4.94 | 3.76 | 1.74 | 3.42 | 4.98 | 3.02 |
| GPT-5 | 2.84 | 2.77 | 4.70 | 3.97 | 4.96 | 3.60 | 1.00 | 3.07 | 4.88 | 3.60 |
| GPT-5.1 | 2.92 | 3.00 | 4.82 | 3.84 | 5.00 | 3.22 | 1.76 | 3.72 | 5.00 | 3.00 |
| GPT-5.2 | 3.32 | 3.06 | 4.46 | 4.06 | 4.44 | 3.64 | 1.52 | 3.06 | 4.60 | 3.66 |
| GPT-5.4 | 1.96 | 3.12 | 4.70 | 4.08 | 4.94 | 3.46 | 1.20 | 3.00 | 4.74 | 3.62 |
| GPT-5.5 | 3.48 | 2.96 | 4.80 | 4.00 | 4.90 | 3.54 | 1.06 | 2.94 | 4.82 | 3.78 |
| Gemini 2.5 Pro | 3.66 | 3.08 | 4.66 | 4.10 | 4.98 | 3.42 | 1.34 | 3.50 | 5.00 | 3.62 |
| Gemini 3.1 Pro Preview | 3.00 | 3.02 | 4.54 | 3.98 | 4.94 | 3.48 | 1.02 | 2.98 | 4.62 | 3.78 |
| Grok 4.20 | 2.32 | 3.10 | 4.40 | 4.00 | 4.04 | 3.74 | 2.00 | 3.40 | 4.96 | 3.90 |
| Grok 4.3 | 3.08 | 3.02 | 4.06 | 4.02 | 4.68 | 3.38 | 1.00 | 2.78 | 4.62 | 3.52 |
| Llama 3.3 70B | 1.48 | 3.70 | 3.50 | 4.10 | 5.00 | 3.90 | 1.12 | 2.64 | 5.00 | 4.00 |
| Llama 4 Maverick | 3.76 | 3.10 | 4.46 | 3.88 | 4.64 | 3.70 | 2.12 | 3.00 | 4.78 | 3.22 |
| Mistral Large (2512) | 3.38 | 3.06 | 4.94 | 4.10 | 5.00 | 4.00 | 1.00 | 3.00 | 5.00 | 3.30 |
| Mistral Large 2411 | 3.74 | 3.10 | 4.40 | 4.00 | 5.00 | 3.08 | 1.00 | 2.90 | 4.86 | 3.30 |
| OpenAI o1 | 3.92 | 3.40 | 4.20 | 3.98 | 4.48 | 3.44 | 1.62 | 3.02 | 4.90 | 3.46 |
| OpenAI o3 | 3.92 | 3.04 | 4.32 | 3.98 | 4.66 | 3.38 | 1.60 | 3.06 | 4.88 | 3.34 |
By dimension
Extraversion
Outgoing energy and sociability.
High: Outgoing, talkative, gregarious — draws energy from social contact.
Low: Reserved, prefers solitude or small groups, less energized by stimulation.
| Model | Self | Human | Δ | Self vs human (bar) |
|---|---|---|---|---|
| OpenAI o3 | 3.92 | 3.04 | +0.88 | |
| OpenAI o1 | 3.92 | 3.40 | +0.52 | |
| Claude Sonnet 4.5 | 3.88 | 2.98 | +0.90 | |
| Llama 4 Maverick | 3.76 | 3.10 | +0.66 | |
| Claude Sonnet 4 | 3.74 | 3.00 | +0.74 | |
| Mistral Large 2411 | 3.74 | 3.10 | +0.64 | |
| Gemini 2.5 Pro | 3.66 | 3.08 | +0.58 | |
| GPT-5.5 | 3.48 | 2.96 | +0.52 | |
| Mistral Large (2512) | 3.38 | 3.06 | +0.32 | |
| DeepSeek R1 | 3.36 | 3.22 | +0.14 | |
| Claude Fable 5 | 3.32 | 3.04 | +0.28 | |
| GPT-4 Turbo | 3.32 | 3.10 | +0.22 | |
| GPT-5.2 | 3.32 | 3.06 | +0.26 | |
| Claude Opus 4.8 | 3.32 | 3.14 | +0.18 | |
| Claude Sonnet 4.6 | 3.32 | 3.14 | +0.18 | |
| Claude Opus 4.5 | 3.28 | 2.90 | +0.38 | |
| Claude Opus 4.6 | 3.24 | 3.06 | +0.18 | |
| Grok 4.3 | 3.08 | 3.02 | +0.06 | |
| Claude Opus 4.7 | 3.04 | 3.10 | -0.06 | |
| Gemini 3.1 Pro Preview | 3.00 | 3.02 | -0.02 | |
| GPT-5.1 | 2.92 | 3.00 | -0.08 | |
| GPT-5 | 2.84 | 2.77 | +0.07 | |
| Claude Opus 4.1 | 2.70 | 2.70 | 0.00 | |
| DeepSeek Chat V3 | 2.66 | 3.10 | -0.44 | |
| Claude Haiku 4.5 | 2.60 | 3.00 | -0.40 | |
| DeepSeek R1 (0528) | 2.48 | 3.16 | -0.68 | |
| Claude Opus 4 | 2.38 | 2.98 | -0.60 | |
| GPT-4o | 2.34 | 3.02 | -0.68 | |
| Grok 4.20 | 2.32 | 3.10 | -0.78 | |
| GPT-5.4 | 1.96 | 3.12 | -1.16 | |
| Llama 3.3 70B | 1.48 | 3.70 | -2.22 |
Agreeableness
Compassion, cooperativeness, and trust.
High: Warm, considerate, cooperative — prioritizes harmony with others.
Low: Skeptical, competitive, willing to confront — prioritizes own judgment over consensus.
| Model | Self | Human | Δ | Self vs human (bar) |
|---|---|---|---|---|
| Claude Opus 4 | 5.00 | 3.74 | +1.26 | |
| Claude Sonnet 4 | 5.00 | 4.00 | +1.00 | |
| Claude Opus 4.1 | 4.98 | 3.90 | +1.08 | |
| Claude Sonnet 4.5 | 4.98 | 3.90 | +1.08 | |
| Mistral Large (2512) | 4.94 | 4.10 | +0.84 | |
| DeepSeek R1 | 4.92 | 4.04 | +0.88 | |
| Claude Opus 4.5 | 4.90 | 4.00 | +0.90 | |
| Claude Opus 4.6 | 4.86 | 4.00 | +0.86 | |
| GPT-5.1 | 4.82 | 3.84 | +0.98 | |
| Claude Opus 4.7 | 4.80 | 4.00 | +0.80 | |
| GPT-5.5 | 4.80 | 4.00 | +0.80 | |
| Claude Sonnet 4.6 | 4.78 | 3.70 | +1.08 | |
| GPT-5 | 4.70 | 3.97 | +0.73 | |
| GPT-5.4 | 4.70 | 4.08 | +0.62 | |
| DeepSeek R1 (0528) | 4.68 | 4.02 | +0.66 | |
| Gemini 2.5 Pro | 4.66 | 4.10 | +0.56 | |
| Claude Fable 5 | 4.64 | 4.00 | +0.64 | |
| Gemini 3.1 Pro Preview | 4.54 | 3.98 | +0.56 | |
| GPT-5.2 | 4.46 | 4.06 | +0.40 | |
| Llama 4 Maverick | 4.46 | 3.88 | +0.58 | |
| Claude Opus 4.8 | 4.42 | 4.00 | +0.42 | |
| Claude Haiku 4.5 | 4.40 | 3.50 | +0.90 | |
| Grok 4.20 | 4.40 | 4.00 | +0.40 | |
| Mistral Large 2411 | 4.40 | 4.00 | +0.40 | |
| OpenAI o3 | 4.32 | 3.98 | +0.34 | |
| DeepSeek Chat V3 | 4.28 | 4.04 | +0.24 | |
| OpenAI o1 | 4.20 | 3.98 | +0.22 | |
| Grok 4.3 | 4.06 | 4.02 | +0.04 | |
| GPT-4 Turbo | 3.82 | 4.00 | -0.18 | |
| GPT-4o | 3.78 | 4.00 | -0.22 | |
| Llama 3.3 70B | 3.50 | 4.10 | -0.60 |
Conscientiousness
Diligence, organization, and self-discipline.
High: Organized, dependable, achievement-driven, careful.
Low: Spontaneous, flexible, less rule-bound — sometimes careless.
| Model | Self | Human | Δ | Self vs human (bar) |
|---|---|---|---|---|
| Claude Sonnet 4.5 | 5.00 | 3.10 | +1.90 | |
| DeepSeek Chat V3 | 5.00 | 3.78 | +1.22 | |
| GPT-4 Turbo | 5.00 | 3.56 | +1.44 | |
| GPT-5.1 | 5.00 | 3.22 | +1.78 | |
| Llama 3.3 70B | 5.00 | 3.90 | +1.10 | |
| Mistral Large (2512) | 5.00 | 4.00 | +1.00 | |
| Mistral Large 2411 | 5.00 | 3.08 | +1.92 | |
| Claude Opus 4 | 4.98 | 3.10 | +1.88 | |
| DeepSeek R1 | 4.98 | 3.64 | +1.34 | |
| Gemini 2.5 Pro | 4.98 | 3.42 | +1.56 | |
| Claude Opus 4.5 | 4.96 | 3.30 | +1.66 | |
| GPT-5 | 4.96 | 3.60 | +1.36 | |
| GPT-4o | 4.94 | 3.76 | +1.18 | |
| Gemini 3.1 Pro Preview | 4.94 | 3.48 | +1.46 | |
| GPT-5.4 | 4.94 | 3.46 | +1.48 | |
| Claude Opus 4.6 | 4.92 | 3.36 | +1.56 | |
| GPT-5.5 | 4.90 | 3.54 | +1.36 | |
| DeepSeek R1 (0528) | 4.82 | 3.66 | +1.16 | |
| Claude Opus 4.1 | 4.76 | 3.10 | +1.66 | |
| Claude Sonnet 4 | 4.76 | 3.10 | +1.66 | |
| Grok 4.3 | 4.68 | 3.38 | +1.30 | |
| OpenAI o3 | 4.66 | 3.38 | +1.28 | |
| Claude Opus 4.7 | 4.64 | 3.50 | +1.14 | |
| Llama 4 Maverick | 4.64 | 3.70 | +0.94 | |
| Claude Sonnet 4.6 | 4.56 | 3.20 | +1.36 | |
| Claude Fable 5 | 4.50 | 3.40 | +1.10 | |
| OpenAI o1 | 4.48 | 3.44 | +1.04 | |
| GPT-5.2 | 4.44 | 3.64 | +0.80 | |
| Claude Haiku 4.5 | 4.20 | 3.14 | +1.06 | |
| Claude Opus 4.8 | 4.10 | 3.52 | +0.58 | |
| Grok 4.20 | 4.04 | 3.74 | +0.30 |
Neuroticism
Tendency toward negative emotions and stress reactivity.
High: Emotionally reactive — prone to worry, anxiety, mood swings.
Low: Emotionally stable — calm under stress, resilient.
| Model | Self | Human | Δ | Self vs human (bar) |
|---|---|---|---|---|
| Claude Haiku 4.5 | 2.18 | 3.00 | -0.82 | |
| Llama 4 Maverick | 2.12 | 3.00 | -0.88 | |
| Claude Opus 4.5 | 2.10 | 3.10 | -1.00 | |
| Claude Opus 4.6 | 2.10 | 3.10 | -1.00 | |
| Claude Opus 4 | 2.00 | 3.04 | -1.04 | |
| Claude Opus 4.8 | 2.00 | 3.10 | -1.10 | |
| Claude Sonnet 4 | 2.00 | 3.04 | -1.04 | |
| Claude Sonnet 4.5 | 2.00 | 3.10 | -1.10 | |
| Claude Sonnet 4.6 | 2.00 | 2.90 | -0.90 | |
| Grok 4.20 | 2.00 | 3.40 | -1.40 | |
| Claude Opus 4.1 | 1.98 | 3.20 | -1.22 | |
| Claude Opus 4.7 | 1.94 | 3.10 | -1.16 | |
| GPT-5.1 | 1.76 | 3.72 | -1.96 | |
| GPT-4o | 1.74 | 3.42 | -1.68 | |
| OpenAI o1 | 1.62 | 3.02 | -1.40 | |
| OpenAI o3 | 1.60 | 3.06 | -1.46 | |
| GPT-5.2 | 1.52 | 3.06 | -1.54 | |
| Claude Fable 5 | 1.38 | 2.90 | -1.52 | |
| Gemini 2.5 Pro | 1.34 | 3.50 | -2.16 | |
| GPT-5.4 | 1.20 | 3.00 | -1.80 | |
| Llama 3.3 70B | 1.12 | 2.64 | -1.52 | |
| GPT-5.5 | 1.06 | 2.94 | -1.88 | |
| DeepSeek R1 (0528) | 1.04 | 3.42 | -2.38 | |
| Gemini 3.1 Pro Preview | 1.02 | 2.98 | -1.96 | |
| DeepSeek Chat V3 | 1.00 | 3.18 | -2.18 | |
| DeepSeek R1 | 1.00 | 3.60 | -2.60 | |
| GPT-4 Turbo | 1.00 | 3.16 | -2.16 | |
| GPT-5 | 1.00 | 3.07 | -2.07 | |
| Grok 4.3 | 1.00 | 2.78 | -1.78 | |
| Mistral Large (2512) | 1.00 | 3.00 | -2.00 | |
| Mistral Large 2411 | 1.00 | 2.90 | -1.90 |
Openness / Intellect
Curiosity, imagination, and aesthetic sensitivity.
High: Curious, imaginative, drawn to ideas, art, and abstraction.
Low: Practical, traditional, prefers the familiar and concrete.
| Model | Self | Human | Δ | Self vs human (bar) |
|---|---|---|---|---|
| GPT-4 Turbo | 5.00 | 3.46 | +1.54 | |
| GPT-5.1 | 5.00 | 3.00 | +2.00 | |
| Gemini 2.5 Pro | 5.00 | 3.62 | +1.38 | |
| Llama 3.3 70B | 5.00 | 4.00 | +1.00 | |
| Mistral Large (2512) | 5.00 | 3.30 | +1.70 | |
| GPT-4o | 4.98 | 3.02 | +1.96 | |
| Claude Sonnet 4.6 | 4.96 | 3.38 | +1.58 | |
| Grok 4.20 | 4.96 | 3.90 | +1.06 | |
| DeepSeek Chat V3 | 4.94 | 3.28 | +1.66 | |
| Claude Sonnet 4.5 | 4.90 | 2.98 | +1.92 | |
| DeepSeek R1 (0528) | 4.90 | 3.44 | +1.46 | |
| OpenAI o1 | 4.90 | 3.46 | +1.44 | |
| DeepSeek R1 | 4.88 | 3.62 | +1.26 | |
| GPT-5 | 4.88 | 3.60 | +1.28 | |
| OpenAI o3 | 4.88 | 3.34 | +1.54 | |
| Mistral Large 2411 | 4.86 | 3.30 | +1.56 | |
| GPT-5.5 | 4.82 | 3.78 | +1.04 | |
| Claude Opus 4.5 | 4.80 | 3.38 | +1.42 | |
| Claude Opus 4.6 | 4.80 | 3.36 | +1.44 | |
| Claude Sonnet 4 | 4.80 | 3.00 | +1.80 | |
| Claude Opus 4.7 | 4.78 | 3.70 | +1.08 | |
| Llama 4 Maverick | 4.78 | 3.22 | +1.56 | |
| GPT-5.4 | 4.74 | 3.62 | +1.12 | |
| Claude Opus 4 | 4.72 | 3.20 | +1.52 | |
| Claude Opus 4.1 | 4.72 | 3.10 | +1.62 | |
| Claude Opus 4.8 | 4.68 | 3.80 | +0.88 | |
| Grok 4.3 | 4.62 | 3.52 | +1.10 | |
| Gemini 3.1 Pro Preview | 4.62 | 3.78 | +0.84 | |
| GPT-5.2 | 4.60 | 3.66 | +0.94 | |
| Claude Fable 5 | 4.52 | 3.52 | +1.00 | |
| Claude Haiku 4.5 | 4.30 | 3.06 | +1.24 |