Recompute the claim
Anchor reruns the claimed benchmark path and checks the result against pinned protocol fields.
ALETHOR BUILDS ANCHOR
Anchor is Alethor’s product for proving model results. When someone says a model is better, Anchor reruns the benchmark, checks the evidence, and shows whether the result is trustworthy enough to become your team’s baseline.
Verification detail
Verification state, baseline status, and evidence stay on one record.
The decision surface keeps the benchmark claim, pinned protocol fields, compare status, and evidence bundle close to the run.
What Anchor Does
Verify a claimed benchmark result, accept the passing run as the official baseline, compare later candidates against the same protocol, and keep signed evidence attached to the decision.
Current private-alpha scope
Anchor reruns the claimed benchmark path and checks the result against pinned protocol fields.
A passing canonical run becomes the official baseline for later candidate comparison.
The verdict, artifacts, manifest, and signature stay attached to the acceptance decision.
Workflow
Each step stays anchored to one run, one benchmark path, and one accepted baseline.
Recompute the claimed benchmark result on the fixed protocol path.
Read the verdict, checks, artifacts, and pinned protocol fields.
Accept a passing canonical run as the official baseline.
Evaluate a later run against the same pinned baseline protocol.
Package the bundle, archive the record, and export signed evidence.
Proof
Verification verdict, baseline readiness, compare validity, and package outputs stay attached to the same run record.
Runs
Runs separate job status from verification verdict.
Candidate and baseline records stay readable before a run is accepted or compared.
Compare
Compare stays anchored to the pinned protocol.
Outcome, delta, and protocol-match fields stay explicit before a candidate is accepted.
Private Alpha
Anchor currently runs on one benchmark path with pinned protocol fields for verification, baseline acceptance, compare, and signed evidence export.
Invite-only private alpha. Current scope: verify a claimed result, pin the baseline, compare later candidates, and export signed evidence.
Read A Run
These terms orient a run quickly without turning the product into a general dashboard.
Where the verification run is in execution.
Whether the recomputed benchmark result passed the checks.
Whether the run can be accepted as the official baseline.
Whether the signed evidence bundle is ready or already exported.
Why Teams Use Anchor
Anchor keeps the narrow acceptance path readable instead of scattering it across trackers and dashboards.
Verification steps stay named, visible, and tied to the verdict.
Artifacts, manifests, and replay inputs stay attached to the accepted run.
Later candidates are compared against one pinned baseline protocol.
Private Alpha
Anchor is currently available as an invite-only private alpha for teams that need to verify model claims before they set a baseline.