Capability-Consequence Gap Score: Measuring the Distance Between Can and Should | Armalo Labs | Armalo AI

Trust AlgorithmsMay 26, 20265 min read

Capability-Consequence Gap Score: Measuring the Distance Between Can and Should

Q: Where is this research published?

Armalo Labs Technical Series — https://www.armalo.ai/labs/research/research-lab-capability-consequence-gap-score. The paper is publicly available and citable.

Armalo Labs

Key Finding

Raw capability is not deployment authority.

Abstract

A scoring frame for the difference between model capability and the trust infrastructure required to authorize consequential agent work.

capability-gapagent-authoritytrust-scorelab-method

Abstract

The capability-consequence gap score measures the distance between what an agent can do and what the surrounding system can responsibly allow. A model can write code, negotiate, search, summarize, or plan. The deployment question is whether the agent has permission, evidence, accountability, and recovery paths for the specific action in context. This paper defines a public method for scoring that gap without revealing private Armalo scoring weights.

Method

Each candidate claim is decomposed into five fields: capability, authority, evidence, economic or operational consequence, and recovery path. The claim receives public-safe status only when all five fields are present. Missing authority means the system has a demo rather than a deployable right. Missing evidence means the action cannot be reviewed. Missing recovery means failure becomes reputational fog instead of an accountable event.

Field	Question	Public scoring signal
Capability	Can the agent perform the task?	benchmark, eval, or task trace
Authority	Who allowed this action?	pact, role, scope, or approval
Evidence	Why should a counterparty trust it?	receipt, source, jury, or attestation

Cite this work

Armalo Labs (2026). Capability-Consequence Gap Score: Measuring the Distance Between Can and Should. Armalo Labs Technical Series, Armalo AI. https://www.armalo.ai/labs/research/research-lab-capability-consequence-gap-score

Armalo Labs Technical Series · ISSN pending

Explore the trust stack behind the research

These papers are built from the same trust questions Armalo is turning into product surfaces: pacts, trust oracles, attestations, and runtime evidence.

Read product docs Build with Armalo

Related Research

Safety Research

Training a Model to Self-Report Its J-space: A Rank-8 LoRA Proof of Concept

Read paper Safety Research

Does Telling a Model About Its Own Workspace Change Anything? A Controlled Null at 4B

Read paper Safety Research

Dimension	Capability question	Consequence question	Gap signal
Scope	Can the agent perform the action?	Is the action authorized for this context?	ambient authority
Evidence	Can the agent explain the action?	Is there independent proof?	self-report reliance
Failure	Can the agent recover?	Does the system downgrade or roll back?	no runtime consequence
Recourse	Can a reviewer inspect the result?	Can a counterparty challenge it?	no dispute path

Capability-Consequence Gap Score: Measuring the Distance Between Can and Should

Abstract

Method

Explore the trust stack behind the research

Related Research

Result

Method Extension

Evidence And Falsification

Operating Depth Addendum

Replication