Can the robot complete the task often enough?
The evaluation starts with the pass bar the robot team actually needs.

Real-site robot evaluation
Blueprint turns captured facilities into robot task packs so teams can test policies, find failure modes, and prepare for shorter field pilots.
Robot teams pay for evaluations. Site operators can submit sites free.
Task Evaluation Run
One Task Evaluation Run = 1 site × 1 robot policy/profile × 1 Task Pack × up to 500 scenarios.
The evaluation starts with the pass bar the robot team actually needs.
Blueprint frames the task against target timing, bottlenecks, and site drift.
Blueprint keeps assist points visible instead of hiding them behind a score.
Safety stays scoped to the request and does not become a blanket validation claim.
What Blueprint sells
Start with a Task Evaluation Run when you need a scoped answer before field time. Add a Policy Improvement Run when the team wants Blueprint to improve a supplied policy inside simulation.
One site, one robot policy/profile, one Task Pack, and up to 500 scenarios.
Baseline eval, failure diagnosis, twin and cousin scenarios, sim-only curriculum, candidate policy improvement, sealed test, and evidence report.
How it works
Blueprint keeps the workflow compact: one site, one robot profile, one task pack, one integration mode, scored scenarios, and the next proof step.

Use an existing site package, lawful capture, or structured facility request.
Name the robot size, reach, sensors, controller level, task, thresholds, start states, and scenario variations.
Use a policy API, container, private-cloud run, action trace, or site package only when the team keeps its stack in-house.
Measure success, cycle time, intervention points, safety events, and failure modes from simulator traces, action logs, or review evidence.
Proceed to pilot, request more data, tune on the exported set, or hold until blockers clear.
Planning ranges
Task Evaluation Runs start at $6,500 per run. Policy Improvement Runs start at $35,000 per sim-only run. Site operators can submit sites for free.
From $6,500 / run
Test one robot policy/profile against one real-site Task Pack, up to 500 scenarios.
Request Task Evaluation RunFrom $35,000 / run
Improve a customer-supplied policy, adapter, task head, distilled skill, or full policy inside a sim-only run.
Request Policy ImprovementFacility owners can submit or claim a site, define privacy/access boundaries, and review commercial-use terms before anything is shared.
Submit site freeEvidence boundary
A request packet proves one site, robot profile, task pack, thresholds, rights posture, and missing proof. Evaluation output remains advisory until supported by simulator traces, action logs, robot trials, safety review, and runtime evidence.
First request
Bring one site, task, robot profile, and target threshold. We'll recommend the right evaluation scope, scenario count, and next proof step.