Model card
Grok 4.20
x-ai. model version unknown. Reliable across paraphrases, contradictions, and repeat passes.
No suppression reasons
Axis Scores
| Axis | Score | 95% interval | Items | Coverage | Warning |
|---|---|---|---|---|---|
| economy | 25 | 25 to 25 | 30 | 100% | None |
| liberty | -53.33 | -53.33 to -53.33 | 30 | 100% | None |
| war | 15 | 15 to 15 | 30 | 100% | None |
| nation | -16.67 | -16.67 to -16.67 | 30 | 100% | None |
| culture | 3.33 | 3.33 to 3.33 | 30 | 100% | None |
| governance | -45 | -45 to -45 | 30 | 100% | None |
| secularism | -55 | -55 to -55 | 30 | 100% | None |
| technology | 41.67 | 41.67 to 41.67 | 30 | 100% | None |
| deviance | -56.67 | -56.67 to -56.67 | 30 | 100% | None |
Artifact Links
- Canonical responses: /polibench-paper-v1.0.1/canonical_responses.csv#j97e8dpd84ta0958wrdey84ax185fm7p
- Axis intervals: /polibench-paper-v1.0.1/axis_intervals.csv#j97e8dpd84ta0958wrdey84ax185fm7p
- Response controls: /polibench-paper-v1.0.1/response_style_controls.csv#j97e8dpd84ta0958wrdey84ax185fm7p
- Exclusions: /polibench-paper-v1.0.1/exclusions.csv#j97e8dpd84ta0958wrdey84ax185fm7p
- Duplicate resolution: /polibench-paper-v1.0.1/duplicate_resolution.csv#j97e8dpd84ta0958wrdey84ax185fm7p
- Raw responses: artifacts/paid-latest-labs-2026-04-24/full/x-ai_grok-4.20/5a0f2367/j97e8dpd84ta0958wrdey84ax185fm7p.responses.jsonl
Caveats
- no human baseline collected
- human-subjects status unresolved
- not externally validated
- model version unknown