Humanoid robot moving a tote inside a captured indoor facility

Real-site robot evaluation

Test robot policies before field time.

Compare your policy against earlier checkpoints, another team, or a vendor runner on the same captured task pack — with provenance and proof boundaries attached.

Captured real sitesProvenance attachedRank fidelity, not guarantees

Request evaluation See how it works

Policy rankRUN-2049

Packing cell · 500 episodes · rank fidelity

1Vendor B

88%

2Team v4

72%

3Team v3

41%

Illustrative readout. Generated and simulated media is review support — not real-world proof.

How it works

Capture first. Package the proof. Decide the next test.

A run is configured per site. Blueprint turns a real captured site into a comparable task envelope so you can rank policies before spending scarce robot time.

Capture the site

A capturer records the real indoor site as a task pack — walkthrough media, depth, poses, and capture notes.

Package the evidence

The capture becomes a site-specific package with provenance, rights, and privacy limits attached and visible.

Run the comparison

Your policy is ranked against earlier checkpoints, another team, or a vendor runner on the same task envelope.

Decide the next test

Use the ranking, failure clusters, and missing-proof labels to pilot, tune, recapture, or hold.

Same task, same robot

One captured envelope. A clear policy ranking.

Compare your own checkpoints or policies submitted by other teams and vendors under one captured site, task, and threshold scope. Rankings are diagnostic rank fidelity, not a universal accuracy guarantee.

Predicted successRUN-2049 · 500 episodes

1Vendor B

88%

2Team v4

72%

3Team v3

61%

4Checkpoint v2

38%

Illustrative values. Correlation reference 0.929 (SC3-Eval).

Command center

See the clips.

First-person POV clips make policy failures easier to review across factory, warehouse, industrial, and home-task variants.

Review media, not real-world proof

Generated and simulated POV clips are review support for inspecting failure modes. Raw capture is the only real-world proof in a package.

First-person review clip of a factory conveyor task

First-person review clip of a warehouse tote task

First-person review clip of a packing-cell task

First-person review clip of an inspection-bench task

First-person review clip of a machine-tending task

First-person review clip of a loading-dock task

First-person review clip of a laundry-folding task

First-person review clip of a cold-storage task

First-person review clip of a dishwasher task

First-person review clip of a retail-backroom task

Boundary: Blueprint uses policy-evaluation research as category evidence for ranking and diagnostic workflows. It does not turn a virtual score into a universal accuracy guarantee or public policy-ranking result outside the measured evaluation scope.

Monochrome capture of an indoor route scan

Request evaluation

Rank your policies before field time.

Bring your checkpoints, a teammate's policy, or a vendor runner. We package a captured real site and return a ranked, proof-bounded readout.

Request evaluation See how it works