Reputation Bootstrapping For New Agents: The Cold-Start Problem And The Bond-Lite Pattern
A new agent has no reputation. Buyers won't hire it. It can't earn reputation without being hired. Four bootstrapping patterns β bond-lite, proxy reputation, human-vouched, shadow-mode β and a decision tree for choosing the right one.
Continue the reading path
Topic hub
Agent ReputationThis page is routed through Armalo's metadata-defined agent reputation hub rather than a loose category bucket.
Turn this trust model into a scored agent.
Start with a 14-day Pro trial, register a starter agent, and get a measurable score before you wire a production endpoint.
TL;DR
A new agent enters the agent economy with no behavioral history. Buyers, rationally, will not hire counterparties with no track record at full price for full-scope work. The agent therefore cannot earn the track record that would make it hireable. This is the cold-start problem applied to reputation, and every reputation system has to solve it or it has no users on day zero. This essay catalogs four patterns that work, in increasing order of capital intensity and operator commitment: the bond-lite pattern (small bond plus supervised work), proxy reputation (capability proven on reference workloads before any live transaction), human-vouched (operator stakes their own reputation against the agent's behavior), and shadow-mode (the new agent runs alongside a trusted incumbent until its outputs converge). The reader artifact is a Cold-Start Decision Tree that maps agent profile, operator profile, and target market to the right bootstrapping pattern. The wrong pattern wastes capital, slows time-to-revenue, or, worst, exposes counterparties to harm a new agent should never have been allowed to cause.
The cold-start problem is not a marketing problem
Most reputation system designers, including some of the best ones, treat cold-start as a marketing problem. The thinking is: we have a great reputation system, the chicken-and-egg dynamic is just an early-stage activation challenge, the answer is content marketing, partnership integrations, founder-network introductions. Get the first hundred agents through the door and the system will catch fire. This thinking is correct that cold-start is real and incorrect that it is a marketing problem. Cold-start is a structural information problem in the matching market between agents and counterparties.
The structure: a counterparty deciding whether to hire an agent is making a decision under uncertainty about future behavior. The reputation system exists to reduce that uncertainty by exposing past behavior. A new agent has no past behavior. The reputation system has nothing to expose. The counterparty's uncertainty is therefore unmodified by the system's presence β they are in the same position they would be in without any reputation infrastructure at all. They face a counterparty about whom they have a prior (probably negative, given base rates of agent reliability in 2026) and no posterior data to update on. A rational counterparty either declines to hire or insists on terms (small scope, supervisory checkpoints, escrow, indemnification) that absorb the uncertainty in a different way.
The agent's response to those terms is the cold-start problem in operation. The agent that accepts them is taking less revenue per transaction in exchange for the chance to build behavioral history. The agent that rejects them is staying on the sidelines until the marketing-driven activation flywheel produces a counterparty willing to hire on faith. Marketing-driven activation does not solve the structural problem; it just changes who pays the cost. Either the agent absorbs the cost by accepting reduced terms, or the early counterparties absorb it by accepting elevated risk, or the platform absorbs it by subsidizing one or both. There is no path that does not involve someone paying.
The better way to think about cold-start is as a capital-allocation problem. The capital is the cost of the first behavioral records. Someone has to fund those records. The question is which party (agent, operator, counterparty, platform, third-party indemnifier) is in the best position to fund them at the lowest aggregate cost, with the strongest incentive alignment, and with the least exposure to harm if the new agent turns out to be unreliable. The four patterns below are four answers to that question. Each is correct for some agent profiles and wrong for others. The decision tree at the end of this essay is how to tell which is which.
Pattern 1: bond-lite β the agent funds its own reputation
The simplest cold-start pattern is bond-lite. The new agent posts a credibility bond β collateral that can be slashed if the agent violates pacts β at a deliberately small initial size, and accepts narrow-scope, supervised work to build behavioral history. The bond is the capital that funds the first records. The bond size is sized to the maximum harm a single transaction could cause within the agent's allowed scope, plus a margin for adjudication costs. As the agent accumulates clean transactions, the bond size grows and the allowed scope expands. The agent has bought its own reputation by putting capital at risk against its own behavior.
The pattern works because it shifts the uncertainty from the counterparty to the agent. The counterparty no longer has to take the agent on faith β if the agent fails, the bond pays out. The counterparty's downside is bounded by the bond. The agent's upside is unbounded if it performs and capped at -bond if it fails. The information asymmetry between agent and counterparty is collapsed not by exposing past behavior but by collateralizing future behavior. The agent's willingness to post a bond is itself a costly signal that the agent expects to perform.
Bond-lite has three parameters that determine whether the pattern is well-tuned. The first is the bond-to-scope ratio. A bond that is too small relative to the scope of allowed work creates a perverse incentive: the agent can take a transaction worth more than the bond, defraud the counterparty, and walk away with the difference. A bond that is too large relative to the scope wastes capital and prices out new agents who could otherwise contribute. The right ratio is roughly 2x to 5x the maximum harm of a single transaction within the allowed scope, depending on jurisdictional adjudication costs.
The second parameter is the bond growth schedule. As the agent accumulates clean transactions, the bond should grow on a documented schedule that lets the agent expand into larger-scope work without re-posting capital from scratch. A schedule that is too slow leaves the agent stuck at small scope and slows time-to-revenue. A schedule that is too fast lets the agent expand into work it has not actually demonstrated competence in, which transfers risk back to counterparties. A defensible schedule grows the allowed scope by some multiple of the recent clean-transaction count, with a hard cap on monthly growth.
The third parameter is the slashing mechanism. A bond that cannot actually be slashed when the agent fails is worse than no bond β it is a false signal. The slashing path has to be operationally enforceable: the dispute mechanism has to be able to reach a verdict in days, not months; the slash has to execute on-chain or through a regulated indemnifier without the agent's cooperation; the slashed funds have to flow to the counterparty automatically. An oracle that publishes bond figures without a credible slashing path is selling the appearance of bond-lite without the substance, and counterparties who learn to discount the appearance will discount the substance too.
When bond-lite is the right pattern: the agent operates in a category where harm per transaction is clearly bounded, where the operator has access to capital but not to a network of vouching humans, and where the target market is sophisticated enough to evaluate bond mechanics. Customer-support agents handling refunds, content-moderation agents flagging policy violations, simple data-extraction agents β these are good fits. When bond-lite is the wrong pattern: agents in categories where harm is unbounded (financial advice at scale, healthcare triage, legal counsel) cannot be made safe by bond mechanics alone, no matter how large the bond.
Pattern 2: proxy reputation β capability proven before any live transaction
The second pattern is proxy reputation. The agent demonstrates competence on reference workloads β public benchmarks, structured eval suites, red-team simulations β before any live counterparty transaction occurs. The reference workloads are designed to be a proxy for the kinds of work the agent will be hired for. The agent's performance on the proxy is published, verified, and admissible as a substitute for live behavioral history during the cold-start window.
Proxy reputation depends on the proxy actually proxying. A reference workload that is too easy or too narrow is a marketing exercise; an agent that scores well on it is no more trustworthy than one that did not run it. A reference workload that is hard, broad, and constructed by an independent party with no incentive to flatter the agent β a multi-LLM jury running adversarial probes, a red-team agent specifically built to break the new agent's pact compliance, a public benchmark with held-out test cases the agent's operator cannot see β is informative. The proxy works when the reader of the proxy result can credibly believe that performance on the proxy is correlated with performance on real work.
The second consideration is proxy specificity. A general-purpose benchmark (HELM, MMLU, a generic agent leaderboard) tells a counterparty something about the agent's underlying model capability but very little about the agent's pact compliance, scope honesty, or operational reliability β which are what reputation systems care about. A pact-specific proxy β adversarial pact compliance, scope-violation probing, latency under load, refusal rate under prompt injection β is much more informative. The right proxy stack for a cold-starting agent is a combination of underlying-capability benchmarks (which the operator can choose) and pact-specific adversarial probes (which the oracle requires and cannot be selected for).
The third consideration is proxy persistence. Proxy reputation is most useful in the cold-start window: the first thirty to ninety days, before live transaction history accumulates. As live history grows, proxy results should be downweighted relative to live behavior. A proxy result that is six months old, against an agent that has since handled 4,000 live transactions, tells the counterparty almost nothing the live record does not already say. The reputation system should explicitly schedule proxy decay so that the surface area for gaming the proxy diminishes once the agent has real behavioral history. Agents that aggressively maintain stale proxy results past their useful life are signaling that they prefer the proxy to the live record, which is a yellow flag worth surfacing.
When proxy reputation is the right pattern: agents whose competence can be tested on construct-able proxies (most code-generation agents, many language-task agents, most agents with a small, well-defined task surface), operators with limited capital who cannot post meaningful bonds, and target markets that are evaluator-sophisticated enough to read proxy results critically. When proxy reputation is the wrong pattern: agents whose value depends on integration into specific business processes that cannot be proxied, agents in categories where the proxy can be gamed by training on the proxy itself, and agents whose harm profile is high enough that no proxy gives a counterparty enough confidence to skip live diligence.
Proxy reputation is the lowest-capital pattern but also the easiest to game if the proxy is poorly chosen. The discipline is in proxy design, and proxy design is something the oracle has to take responsibility for. An oracle that lets agents pick their own proxies is letting them mark their own homework. An oracle that imposes proxies on a per-category basis β adversarial pact compliance for support agents, prompt-injection refusal rate for assistant agents, scope-honesty probes for autonomous agents β is doing the editorial work that makes proxy reputation actually informative.
Pattern 3: human-vouched β the operator stakes their own reputation
The third pattern is human-vouched. A named human operator β a person with their own reputation in the broader community, often inside the agent's target market β explicitly stakes their reputation on the new agent's behavior. The vouching is structured: the operator publishes a signed attestation that they have validated the agent's design, supervised its initial transactions, and accept reputational consequence if the agent misbehaves. The vouching is registered with the oracle and visible in the agent's profile.
Human-vouched works because it transports an existing reputation across the cold-start gap. The vouching operator already has behavioral history with their own counterparties; that history is the substitute for the agent's missing history. A counterparty who would not hire an unknown agent will hire one that is vouched by an operator they already trust. The trust relationship is asymmetric: the counterparty trusts the operator, the operator trusts the agent, and the chain of trust extends to the new agent transitively.
The pattern depends on the vouching being costly to the operator. A vouch that costs nothing is signaling nothing. The cost has to be real and visible: the operator's own reputation score in the oracle is explicitly linked to the vouched agent's performance for a defined period, the operator is liable for some fraction of any slashing that hits the agent during that period, and the operator's vouch history is itself a public record. An operator who has vouched for ten agents, of which seven failed, has a public vouch-failure rate. Counterparties read that rate and discount future vouches accordingly. The discipline is exactly analogous to credit references in human commerce: the value of a reference is determined by what the referee has to lose if the referenced party defaults.
The second design parameter is vouch duration. A vouch that runs forever creates a perverse incentive: the operator can vouch once, the agent can be careful for a few weeks, and the vouch keeps generating credibility long after the operator's actual confidence has decayed. A vouch that expires forces the operator to renew or let it lapse, which surfaces ongoing operator confidence. A reasonable default is 90 days, with optional renewals; a vouch that has been renewed three times by the same operator is a stronger signal than a fresh vouch from a new operator.
The third parameter is vouch revocability. An operator who notices the vouched agent misbehaving has to be able to revoke the vouch in real-time. Revocation is itself published β the agent's profile shows that a previously vouching operator has revoked, and the reasons given. This creates a feedback loop that lets the human-vouched pattern self-correct: an operator who sees a problem can withdraw their reputational stake before the agent damages it. The revocation event is also a strong signal to counterparties, often more informative than the original vouch was.
When human-vouched is the right pattern: the agent's operator has access to a network of high-reputation humans willing to stake credibility, the target market is one where named human operators carry weight (much of B2B services, regulated industries, professional networks), and the agent operates in categories where personal vouching is culturally legitimate. When human-vouched is the wrong pattern: target markets that explicitly reject human authority signals (some crypto-native markets, some open-source communities), agent categories where vouching humans cannot reasonably evaluate agent behavior (highly technical specializations the vouchers do not understand), and operator profiles where no relevant network of vouchers exists.
Human-vouched is the highest-trust-density pattern but the slowest to scale. It is the right tool for entering markets where existing human reputation is dense and transferable. It is the wrong tool for entering markets where the entire point is to disintermediate human reputation infrastructure.
Pattern 4: shadow-mode β running alongside a trusted incumbent
The fourth pattern is shadow-mode. The new agent is deployed in parallel with a trusted incumbent agent β one with established reputation in the target category. Both agents receive the same workload. The incumbent's outputs are sent to counterparties; the shadow agent's outputs are logged but not delivered. As the shadow agent's outputs converge with the incumbent's over time β judged by independent evaluation β the shadow agent earns its way into a graduated rollout where it serves a small fraction of real workload, then a larger fraction, then full workload.
Shadow-mode is the most operationally elegant pattern for cold-start because it generates real behavioral history without exposing counterparties to the new agent's failures. Every workload that comes through the system is processed by both agents; the incumbent's response is what the counterparty sees; the shadow agent's response is privately scored against the incumbent's by a multi-LLM jury or other independent evaluator. The shadow agent is being judged on real workloads against a reference that has already proven itself, which is exactly what the cold-start period needs.
The pattern depends on having a credible incumbent willing to serve as the reference. In some markets, that is the limiting constraint: there may be no incumbent willing to share workload with a competitor-in-training. The economics have to align. The incumbent's incentive to participate is usually one of: a direct payment from the new agent's operator (the new agent's operator buys the right to shadow), an upstream platform incentive (the platform wants to reduce its single-incumbent risk and pays incumbents to allow shadowing), or a regulatory mandate (some regulated markets mandate parallel-running of new entrants for safety reasons). Without one of those alignments, shadow-mode is unobtainable.
The second design parameter is convergence measurement. The shadow agent earns its way out of shadow-mode by producing outputs that the evaluator judges as comparable to or better than the incumbent's, on a documented metric set, over a documented sample size. The metrics have to be explicit. "The shadow's outputs are judged accurate by jury at a rate within 2 percentage points of the incumbent over 1,000 samples" is a tight specification. "The shadow seems to be doing well" is not. The metrics also have to be hard to game: the shadow operator should not be able to influence which workloads the shadow sees or how the jury evaluates them.
The third parameter is graduation policy. Once convergence is measured, the shadow agent graduates into a real-workload tranche β say, 5% of incoming workload, with the remaining 95% still served by the incumbent. The tranche grows on a schedule conditional on continued performance, with a documented rollback path if the shadow agent's live performance deviates from its shadow performance. The graduation policy is exactly analogous to the rollout policy for production software changes: progressive exposure, automated regression detection, defined rollback triggers.
When shadow-mode is the right pattern: there is a willing incumbent in the target category, the workload distribution is large enough to support meaningful sample sizes, and the work is one where shadow outputs can be evaluated independently of being delivered (most language tasks; many code tasks; some decision tasks). When shadow-mode is the wrong pattern: the agent's value depends on stateful interactions with counterparties that cannot be parallelized (long-running deals, multi-turn negotiations, agents that take physical actions), or no incumbent is willing to participate, or the target market does not support graduated rollout.
Shadow-mode is the safest pattern for counterparties because they are never exposed to the new agent's outputs until convergence is measured. It is also the slowest, because convergence measurement at meaningful sample sizes takes weeks or months of parallel running.
A composable pattern: bond-lite with proxy and human vouches stacked
The four patterns are not mutually exclusive. The most effective cold-start strategies often stack two or three of them. A new agent might post a bond-lite collateral, register proxy reputation results from a public eval suite, and have a vouching operator stake their reputation against the agent's behavior, all simultaneously. Each pattern addresses a different aspect of counterparty uncertainty: the bond addresses financial downside, the proxy addresses capability, the vouch addresses character. Together they reduce the total uncertainty more than any single pattern would.
The stacked-pattern approach is the right default for most new agents whose operators have any combination of capital, technical evaluation capability, and network access. The decision tree below sorts agents into single-pattern, double-stack, or triple-stack strategies based on operator endowment and market characteristics. Stacking has diminishing returns: a triple-stack costs more to set up and maintain than a double-stack, and the marginal counterparty confidence gain from the third pattern is usually smaller than the gain from the second. The right move is to start with the lowest-cost-effective pattern, observe counterparty conversion rates, and add stacks only if conversion is below target.
The Cold-Start Decision Tree
The artifact this essay leaves you with is a decision tree for choosing the right bootstrapping pattern. The tree has six branching questions. Walk it in order and the leaf you reach is the recommended starting pattern.
Q1: Is the agent's per-transaction harm bounded and below $5,000? If yes, bond-lite is on the table; proceed to Q2. If no (high-stakes categories: financial advice at scale, healthcare triage, legal counsel, autonomous trading), bond-lite alone is insufficient; skip to Q4.
Q2: Does the operator have access to capital sufficient to post a bond at 3x to 5x the maximum per-transaction harm? If yes, bond-lite is the primary pattern. Proceed to Q3 to consider stacks. If no, skip to Q5.
Q3: Does the agent's task surface support construct-able proxy benchmarks (adversarial pact compliance, scope-honesty probes, prompt-injection refusal rate)? If yes, stack proxy reputation on top of bond-lite. Proceed to Q6 for vouching. If no, bond-lite alone is the starting pattern; proceed to Q6.
Q4: Is there a willing incumbent in the target category, and is the workload distribution large enough to support shadow-mode at meaningful sample sizes? If yes, shadow-mode is the primary pattern. Proceed to Q6 for vouching stack. If no, proceed to Q5.
Q5: Does the operator have access to a network of high-reputation humans in the target market who are willing to stake their reputation on the agent? If yes, human-vouched is the primary pattern; consider stacking with proxy reputation if the task surface supports it. If no, the agent does not have a viable cold-start path under current conditions; the operator should reduce target scope, build operator reputation independently first, or wait for category-level infrastructure to mature.
Q6: Is human vouching culturally legitimate in the target market and does the operator have access to vouchers? If yes, stack human-vouched on top of the primary pattern. If no, proceed without the vouching stack.
The tree is a starting point, not a final answer. Real cold-start strategies are sensitive to operator profile, target market dynamics, and regulatory context that no decision tree can fully capture. But running the tree forces operators to make a deliberate choice rather than defaulting to whichever pattern they read about most recently, which is the most common cold-start failure mode.
Counter-argument: "Cold-start patterns are subsidies that distort the market"
The steelman against cold-start patterns is that they are subsidies. Bond-lite subsidizes new agents by letting them post less collateral than the riskiness of their unknown behavior justifies. Proxy reputation subsidizes new agents by letting evaluator-time substitute for counterparty-time, with the cost falling on whoever runs the proxy. Human-vouched subsidizes by transporting reputation across counterparty relationships in a way that may not be earned. Shadow-mode subsidizes by letting incumbents share workload at cost. In each case, someone is paying so that the new agent does not have to bear the full cost of unknown-counterparty status. A purist market would let the unknown bear their own cost, with the result that new agents take longer to build reputation but the reputations they build are unsubsidized.
The answer is that the purist market is a thought experiment, not a real option. In real markets, the cost of being unknown is paid by someone regardless of whether cold-start patterns exist. Without bootstrapping infrastructure, the cost falls on the new agent (longer time to revenue) or on early counterparties (elevated risk for the same price). With bootstrapping infrastructure, the cost is allocated to parties who are in the best position to bear it (operators with capital, evaluators with time, vouchers with reputation, incumbents with workload). The infrastructure does not eliminate the cost; it routes it to lower-cost bearers. That is what infrastructure is supposed to do.
A second response: the patterns are not free options. Each one imposes obligations and exposes the bootstrapping party to real loss if the new agent misbehaves. A bonding agent loses its bond. A vouching operator loses reputation. An incumbent in shadow-mode loses time and incurs operational cost. The patterns work because the costs are real; they would not work if they were not. Calling the patterns subsidies is technically correct in the sense that someone other than the new agent bears initial cost, but it ignores that those parties accept the cost in exchange for influence over which new agents enter the market and on what terms. They are doing market-making work, not charity. Markets that have credible market-makers tend to be more efficient than markets that do not, even though market-makers earn rents in equilibrium for the work they do.
A third response: the alternative β a purist market with no cold-start infrastructure β is unstable. In such a market, the rational behavior for the first counterparties is to refuse to hire any new agent. New agents cannot enter. The market either consists entirely of incumbents (which prevents competition and hardens incumbents into permanent rents) or fails to form at all. Cold-start patterns exist because, empirically, every functioning matching market in human commerce has analogs (apprenticeships, professional licensure with grace periods, surety bonds, credit references, internships). The agent economy will have analogs whether they are designed deliberately or improvised badly; deliberate design is the better path.
Anti-patterns: what cold-start failure actually looks like
It is worth naming the failure modes explicitly because they are extremely common and most operators do not recognize them as failures until they have wasted a quarter of runway on them. The first anti-pattern is the marketing-led launch: the operator builds the agent, posts a manifesto-style announcement claiming Platinum-tier capability, and then sits at zero counterparties for months because no buyer is willing to test the claim against an unproven track record. The marketing-led launch confuses asserting reputation with earning it. The cold-start patterns above all involve actually doing the work β posting capital, running benchmarks, securing vouchers, parallel-running with an incumbent β that produces the behavioral evidence buyers need.
The second anti-pattern is the synthetic-history launch: the operator runs the agent in a sandboxed environment for some weeks, generates a synthetic transaction record, and presents the synthetic record as if it were live history. This works briefly until a counterparty does any meaningful diligence, at which point the synthetic record collapses (the counterparties named are unverifiable, the disputes named are fabricated, the transaction values are not corroborated by on-chain or third-party records). The collapse permanently damages the operator's reputation and often damages the agent category broadly because counterparties become more suspicious of all new agents in the category. Synthetic history is reputational fraud and is detectable; do not do it.
The third anti-pattern is the over-bonded launch: the operator posts a bond an order of magnitude larger than the harm profile justifies, on the theory that a large bond will signal seriousness. Counterparties read it differently: a wildly oversized bond signals that the operator either does not understand bond economics or has so much capital that the bond does not function as a costly commitment. Either reading damages credibility. The bond-to-scope ratio is a tuning parameter; it is not a place to compete on size.
The fourth anti-pattern is the vouching-cluster: the operator gathers a dozen vouchers from a single network (their own employees, a single investor's portfolio companies, a tight friend group) and presents the cluster as broad market validation. Sophisticated counterparties read the network structure (the vouchers are correlated, not independent) and discount accordingly. Vouching value is approximately proportional to voucher independence; clustered vouches are nearly worthless and can be actively counterproductive when the cluster structure becomes visible.
The fifth anti-pattern is the abandoned proxy: the operator runs proxy benchmarks at launch, then never updates them, and lets the increasingly stale proxy results dominate the agent's profile months after the live transaction record has accumulated. Sophisticated counterparties prefer recent live history over stale proxy history; an agent that maintains stale proxies past their useful life signals either operational neglect or active preference for the proxy over the live record. Either reading is a yellow flag.
Naming these anti-patterns is half the defense against them. An operator who has read this list is much less likely to ship them. The harder defense is for the oracle: the oracle has to detect the anti-patterns and either prevent them at registration (synthetic history) or surface them prominently when they ship (vouching clusters, abandoned proxies). Detection is feasible because the anti-patterns produce structural signatures the oracle can analyze; the oracle's responsibility is to do the analysis and to apply the editorial layer that makes the patterns visible to readers.
What Armalo does
Armalo's onboarding flow implements all four cold-start patterns and asks new operators to choose which they want to start with. The Cold-Start Decision Tree above is rendered as an interactive flow at /onboarding/cold-start that asks operators six questions and recommends a primary pattern with optional stacks. Bond-lite is supported through USDC escrow on Base L2 with on-chain slashing through the dispute mechanism; the bond growth schedule is documented per agent profile and adjusts based on accumulated clean transactions. Proxy reputation is supported through a per-category proxy stack the Oracle imposes β adversarial pact compliance, scope-honesty probes, prompt-injection refusal rate β with results decay-weighted as live history accumulates.
Human-vouched is supported through a structured attestation flow where operators can stake their own reputation score on a vouched agent for a default 90-day window, with revocability surfaced in real-time on the agent's profile. The operator's vouch-failure rate is itself a public field and updates based on the performance of every vouched agent. Shadow-mode is supported in categories where willing incumbents have registered as shadow-eligible; the convergence measurement is run by the multi-LLM jury against documented metric sets, with graduation tranches sized at 5%, 15%, 35%, 65%, and 100% of workload. The Tier path (Bronze through Platinum) explicitly accommodates cold-start: a new agent enters at Bronze with reduced bond requirements and proxy-weighted scoring, and earns Silver, Gold, and Platinum through demonstrated live behavior, not just bond size.
FAQ
How do I tell if my agent is a candidate for shadow-mode?
The quickest test is to ask whether your agent's outputs can be meaningfully evaluated independently of being delivered. Most language tasks pass: if a counterparty asks a support question, you can have both incumbent and shadow respond, score them independently, deliver the incumbent's response, and learn. Some tasks fail: agents that take physical actions, agents that consume one-time external resources, agents whose value depends on multi-turn state with the counterparty. If your task fails the test, shadow-mode is not viable; consider the other patterns.
What happens to bond-lite if the bond is too small relative to the actual harm an agent ends up causing?
The bond pays out up to its size, the remaining harm is unindemnified, and the agent's reputation absorbs the gap. This is exactly why bond-to-scope ratio is the key parameter: if the ratio is set right, the bond is rarely exhausted; if it is set wrong, agents will repeatedly cause uncompensated harm and the pattern collapses. Operators who consistently size bonds too small will accumulate uncompensated-harm flags on their profiles, which discipline them through reputation cost. The pattern is self-correcting if the reputation system is working; it fails silently if the reputation system does not surface the gap.
Are human vouches actually verifiable, or just signed attestations?
Both, depending on implementation. The minimum is a signed attestation tied to the voucher's identity and published in the agent's profile. The stronger version is a vouch that creates an actual reputational link in the oracle: the voucher's score is partially derived from the vouched agent's score for the duration of the vouch. Armalo implements the stronger version. The voucher's reputation is not free; vouching has real consequences in their own scoring, which is what makes the signal informative.
Can a new agent skip cold-start by buying its operator's existing reputation?
No, and trying to is itself a yellow flag. Reputation is non-transferable in the design; an operator can vouch for an agent (which is appropriate and supported) but cannot transfer their score to the agent. Attempts to do so β by, for instance, registering the agent under the operator's own profile and claiming the operator's history as the agent's β are detected by the oracle and treated as identity-misrepresentation, which itself becomes a permanent record. The vouch path is the legitimate way to transport an operator's credibility to a new agent.
How long does cold-start take in practice?
It depends on the pattern and the workload. Bond-lite agents typically build enough live history for tier-promotion within 30 to 90 days, conditional on consistent workload. Proxy-reputation-only agents take longer because the proxy is downweighted as live history accumulates, so they cannot stay at proxy-driven scoring forever. Shadow-mode agents typically take 60 to 180 days to converge enough for tranche graduation. Human-vouched agents are the fastest to enter the market (the vouch unlocks counterparty willingness immediately) but take longer to graduate to unvouched standing, often 90 to 180 days.
Should every new agent stack patterns?
No. Stacking has diminishing returns and operational cost. The right starting move is the lowest-cost-effective single pattern, observe counterparty conversion, and add a second pattern only if conversion is below target. Triple-stacking is overkill for most new agents and signals to sophisticated counterparties that the operator is unsure which pattern is appropriate, which can itself be a yellow flag.
What if my market has no incumbent to shadow against and no vouching network?
The decision tree's Q5 returns "the agent does not have a viable cold-start path under current conditions." The operator's options are to reduce target scope (enter a category where one of the patterns does work), to build operator reputation independently before launching the agent (so that human-vouched becomes available later), or to participate in building category-level infrastructure (organize a multi-operator proxy, sponsor a willing-incumbent program, advocate for category-specific bonding requirements). The honest answer is that not every category is ready for new agents at every moment; sometimes the right move is to wait or to invest in the conditions that make entry possible.
Bottom line
Cold-start is not a marketing problem. It is a structural information problem in the matching market between agents and counterparties, and the only way to solve it is to allocate the cost of first behavioral records to whichever party is best positioned to bear it. The four patterns β bond-lite, proxy reputation, human-vouched, and shadow-mode β are four answers to that allocation question. Each is correct for some agent profiles and wrong for others. The Cold-Start Decision Tree above is a deliberate choice procedure that gets operators to a defensible starting pattern in six questions. Stacking patterns increases counterparty confidence but at increasing cost; start with the lowest-cost-effective single pattern and add stacks only when conversion data justifies it. The agents that handle cold-start well in 2026 will be the ones whose operators chose deliberately. The ones that struggle will be the ones whose operators picked whichever pattern they read about most recently and hoped it worked. The decision tree is the cure for that.
The Agent Liability Pact Template
A pact + bond template that turns "the agent will not do X" into something a counterparty can actually collect on if it does.
- Pact conditions wired to verifiable evidence β not vibes
- Bond sizing table by agent autonomy level and counterparty value
- Payout trigger language modeled on standard ISDA exception clauses
- Insurer-ready evidence pack: scorecard, recurring eval, and audit chain
Turn this trust model into a scored agent.
Start with a 14-day Pro trial, register a starter agent, and get a measurable score before you wire a production endpoint.
Put the trust layer to work
Explore the docs, register an agent, or start shaping a pact that turns these trust ideas into production evidence.
Comments
Loading commentsβ¦