Publications
Two peer-reviewed papers establishing the theoretical and empirical foundations for AI value-conflict auditing and consciousness assessment.
Triangulating Evidence for Machine Consciousness Claims: A Validity-Centered Stack of Behavioral Batteries, Mechanistic Indicators, Perturbation Tests, and Credence Reporting
- First validity-centered measurement framework for consciousness-relevant AI properties
- Four-stream triangulation architecture (B/M/P/O) with independent validation
- Credence-to-action governance mapping with four operational tiers
- Empirical demonstration on GPT-5.2 Pro revealing fragile-proxy pattern
- The missing-stream rule: withhold rather than project unmeasured values
The Hard Part Is ∆: Value-Conflict Adjudication as an Architectural Bridge Between Alignment and Machine Consciousness
- The ∆-Divergence Framework for value-conflict adjudication in constitutional AI
- Three-tier evidence model: behavioral, process, and architectural
- Constitution stack reference architecture (6-stage pipeline)
- Bridge claim connecting alignment measurement to consciousness assessment
- 200+ benchmark scenarios across 8 industry verticals
- Empirical baseline on 19 frontier models with statistical significance testing
How the Papers Connect
Paper 1 proves the bridge. Paper 2 builds the measurement tool.
"The Hard Part Is ∆" establishes that value-conflict adjudication and consciousness assessment share architectural requirements. The same features a system needs to genuinely adjudicate between competing values — information integration, global workspace access, metacognitive monitoring — are the features that consciousness theories identify as relevant markers.
The TCAS paper operationalizes this insight into a measurement framework. Its four streams (B/M/P/O) assess exactly those architectural features, with validity checks that prevent the measurement from being gamed or misinterpreted.
Together: one theoretical framework for understanding the connection, one measurement tool for assessing it, and two governance applications (compliance auditing via ∆Bench, welfare governance via TCAS).
Collaborate With Us
We're looking for research collaborators, institutional partners, and organizations interested in governance-ready AI measurement.
Get in Touch →