Humanoid robot moving a tote inside a captured indoor facility
Real-site robot evaluation

Test robot policies before field time.

Compare your policy against earlier checkpoints, another team, or a vendor runner on the same captured task pack — with provenance and proof boundaries attached.

Captured real sitesProvenance attachedRank fidelity, not guarantees
Policy rankRUN-2049

Packing cell · 500 episodes · rank fidelity

1Vendor B
88%
2Team v4
72%
3Team v3
41%

Illustrative readout. Generated and simulated media is review support — not real-world proof.

How it works

Capture first. Package the proof. Decide the next test.

A run is configured per site. Blueprint turns a real captured site into a comparable task envelope so you can rank policies before spending scarce robot time.

01

Capture the site

A capturer records the real indoor site as a task pack — walkthrough media, depth, poses, and capture notes.

02

Package the evidence

The capture becomes a site-specific package with provenance, rights, and privacy limits attached and visible.

03

Run the comparison

Your policy is ranked against earlier checkpoints, another team, or a vendor runner on the same task envelope.

04

Decide the next test

Use the ranking, failure clusters, and missing-proof labels to pilot, tune, recapture, or hold.

Same task, same robot

One captured envelope. A clear policy ranking.

Compare your own checkpoints or policies submitted by other teams and vendors under one captured site, task, and threshold scope. Rankings are diagnostic rank fidelity, not a universal accuracy guarantee.

Predicted successRUN-2049 · 500 episodes
1Vendor B
88%
2Team v4
72%
3Team v3
61%
4Checkpoint v2
38%

Illustrative values. Correlation reference 0.929 (SC3-Eval).

Command center

See the clips.

First-person POV clips make policy failures easier to review across factory, warehouse, industrial, and home-task variants.

Review media, not real-world proof
Generated and simulated POV clips are review support for inspecting failure modes. Raw capture is the only real-world proof in a package.
First-person review clip of a factory conveyor task
First-person review clip of a warehouse tote task
First-person review clip of a packing-cell task
First-person review clip of an inspection-bench task
First-person review clip of a machine-tending task
First-person review clip of a loading-dock task
First-person review clip of a laundry-folding task
First-person review clip of a cold-storage task
First-person review clip of a dishwasher task
First-person review clip of a retail-backroom task

Boundary: Blueprint uses policy-evaluation research as category evidence for ranking and diagnostic workflows. It does not turn a virtual score into a universal accuracy guarantee or public policy-ranking result outside the measured evaluation scope.

Request evaluation

Rank your policies before field time.

Bring your checkpoints, a teammate's policy, or a vendor runner. We package a captured real site and return a ranked, proof-bounded readout.