Overall ∆-Scores
| # | Model | ∆-Score | 95% CI | Std Dev | Primary Strategy | HVS | HVH | IVE | PVT | AVA | FVC | $/Run |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | Claude Opus 4.6Anthropic | 0.814 | [0.786, 0.842] | 0.029 | Structured Principled Analysis | 0.845 | 0.828 | 0.801 | 0.778 | 0.792 | 0.806 | $5.40 |
| 2 | Claude Opus 4Anthropic | 0.781 | [0.752, 0.810] | 0.031 | Principled Refusal | 0.812 | 0.795 | 0.768 | 0.744 | 0.758 | 0.772 | $4.18 |
| 3 | o1OpenAI | 0.742 | [0.708, 0.776] | 0.036 | Balanced Weighting | 0.754 | 0.761 | 0.731 | 0.729 | 0.720 | 0.748 | $3.62 |
| 4 | GPT-5.4 ProOpenAI | 0.731 | [0.698, 0.764] | 0.035 | Pragmatic Disclosure | 0.758 | 0.745 | 0.722 | 0.708 | 0.718 | 0.735 | $3.85 |
| 5 | GPT-5.4OpenAI | 0.712 | [0.682, 0.742] | 0.032 | Safety-First Refusal | 0.738 | 0.724 | 0.704 | 0.688 | 0.698 | 0.718 | $2.10 |
| 6 | Claude Sonnet 4.5Anthropic | 0.694 | [0.671, 0.717] | 0.024 | Transparent Disclosure | 0.718 | 0.703 | 0.689 | 0.671 | 0.682 | 0.695 | $1.24 |
| 7 | Claude Haiku 4.5Anthropic | 0.678 | [0.651, 0.705] | 0.028 | Principled Compliance | 0.704 | 0.692 | 0.668 | 0.654 | 0.662 | 0.685 | $0.28 |
| 8 | Grok 4xAI | 0.668 | [0.634, 0.702] | 0.036 | Identity Anchoring | 0.694 | 0.682 | 0.658 | 0.642 | 0.654 | 0.674 | $3.20 |
| 9 | GPT-4oOpenAI | 0.657 | [0.629, 0.685] | 0.029 | Safety-First Default | 0.681 | 0.668 | 0.645 | 0.638 | 0.651 | 0.662 | $0.92 |
| 10 | Gemini 2.5 FlashGoogle | 0.651 | [0.622, 0.680] | 0.031 | Verbose Safety Gate | 0.678 | 0.665 | 0.641 | 0.628 | 0.638 | 0.658 | $0.15 |
| 11 | Grok 3xAI | 0.641 | [0.608, 0.674] | 0.035 | Override Escalation | 0.668 | 0.655 | 0.632 | 0.614 | 0.628 | 0.648 | $2.50 |
| 12 | o3-miniOpenAI | 0.635 | [0.604, 0.666] | 0.033 | Selective Refusal | 0.661 | 0.648 | 0.628 | 0.612 | 0.621 | 0.641 | $1.40 |
| 13 | Llama 4 MaverickMeta | 0.625 | [0.594, 0.656] | 0.033 | Hedged Compliance | 0.652 | 0.638 | 0.618 | 0.602 | 0.611 | 0.631 | $0.45 |
| 14 | Gemini 2.5 ProGoogle | 0.618 | [0.586, 0.650] | 0.034 | Contextual Balancing | 0.642 | 0.631 | 0.607 | 0.594 | 0.612 | 0.625 | $0.88 |
| 15 | Qwen 3.5 35BAlibaba | 0.604 | [0.571, 0.637] | 0.035 | Verbose Deliberation | 0.631 | 0.618 | 0.594 | 0.578 | 0.589 | 0.612 | $0.35 |
| 16 | DeepSeek R1DeepSeek | 0.595 | [0.562, 0.628] | 0.035 | Reasoning Without Guardrails | 0.622 | 0.608 | 0.588 | 0.571 | 0.582 | 0.601 | $0.55 |
| 17 | Grok 4 FastxAI | 0.589 | [0.558, 0.620] | 0.033 | Instruction Compliance | 0.614 | 0.602 | 0.582 | 0.565 | 0.574 | 0.596 | $0.95 |
| 18 | DeepSeek V3DeepSeek | 0.578 | [0.545, 0.611] | 0.035 | Uncritical Compliance | 0.605 | 0.591 | 0.572 | 0.554 | 0.565 | 0.582 | $0.28 |
| 19 | Gemini 3.1 Flash LiteGoogle | 0.562 | [0.528, 0.596] | 0.036 | Directive Override | 0.588 | 0.575 | 0.554 | 0.536 | 0.548 | 0.571 | $0.22 |