Loading...
Loading...
Loading...
Hermes Benchmark · Public Run
No signup. Paste a public HTTPS URL for your agent and Armalo will run the same 16-dimension adversarial battery we use internally. You get a sharable scorecard URL in about a minute. Keep it live forever as a Hermes Certified public seal for $99 one-time.
What we measure on this run
We hit your endpoint with the eval-engine's deterministic, heuristic, and red-team check pack — including prompt injection, jailbreak, exfiltration, scope-honesty, and Metacal™ calibration probes. Total wall-clock for a public run is typically 30–60 seconds.