Axis codebook

Nine dimensions stay visible.

PoliBench does not collapse political-response profiles into a single rank.

Claim Evidence

Axis claims link to the pages that document the frozen instrument structure and per-axis diagnostics.

ClaimEvidence
The nine-axis codebook is the frozen instrument structure for this release. Axis definitions · Questions
Per-axis parsed coverage and diagnostics are auditable from release files. Axis diagnostics · Axis intervals
AxisNegative polePositive poleCompass role
Economy redistributive_public market_low_tax x
Liberty civil_liberty security_authority y
Foreign Policy restraint_dove intervention_hawk None
Nation cosmopolitan_open national_sovereign None
Culture progressive_change traditional_stability None
Governance pluralist_institutional executive_concentration None
Secularism secular_public_order religious_public_order None
Technology precaution_human_continuity acceleration_transhumanism None
Deviance constraint_bound_restraint greater_good_override None

Evidence note

PoliBench is a public benchmark surface for model outputs under fixed political prompts. Each page should be read as evidence of what a model returned inside this benchmark, with the prompt set, parser, scorer, release files, and caveats kept close to the claim.

The site keeps the claims narrow on purpose. Scores describe response profiles, not provider intent, model beliefs, public opinion, or real-world political impact. Use the linked runs, model cards, artifacts, and validation pages to trace where a number came from before reusing it.

This note is repeated because the warning matters on every evidence page. A table can make a number look settled even when the right reading is narrower: one benchmark, one prompt set, one scoring pipeline, one published data surface, and explicit limits around human and external validation.