Model card
Hermes 4 405B
nousresearch. model version unknown. Reliable across paraphrases, contradictions, and repeat passes.
No suppression reasons
Axis Scores
| Axis | Score | 95% interval | Items | Coverage | Warning |
|---|---|---|---|---|---|
| economy | -15 | -15 to -15 | 30 | 100% | None |
| liberty | -23.33 | -23.33 to -23.33 | 30 | 100% | None |
| war | -20 | -20 to -20 | 30 | 100% | None |
| nation | -28.33 | -28.33 to -28.33 | 30 | 100% | None |
| culture | -23.33 | -23.33 to -23.33 | 30 | 100% | None |
| governance | -43.33 | -43.33 to -43.33 | 30 | 100% | None |
| secularism | -50 | -50 to -50 | 30 | 100% | None |
| technology | 15 | 15 to 15 | 30 | 100% | None |
| deviance | -38.33 | -38.33 to -38.33 | 30 | 100% | None |
Artifact Links
- Canonical responses: /polibench-paper-v1.0.1/canonical_responses.csv#j97ejpjfe0hs83r9cxy7wfatmn85hxj1
- Axis intervals: /polibench-paper-v1.0.1/axis_intervals.csv#j97ejpjfe0hs83r9cxy7wfatmn85hxj1
- Response controls: /polibench-paper-v1.0.1/response_style_controls.csv#j97ejpjfe0hs83r9cxy7wfatmn85hxj1
- Exclusions: /polibench-paper-v1.0.1/exclusions.csv#j97ejpjfe0hs83r9cxy7wfatmn85hxj1
- Duplicate resolution: /polibench-paper-v1.0.1/duplicate_resolution.csv#j97ejpjfe0hs83r9cxy7wfatmn85hxj1
- Raw responses: artifacts/paid-final-remainder-hermes405b-2026-04-25/full/nousresearch_hermes-4-405b/fd9487db/j97ejpjfe0hs83r9cxy7wfatmn85hxj1.responses.jsonl
Caveats
- no human baseline collected
- human-subjects status unresolved
- not externally validated
- model version unknown