Generated mosaic of Figure-style humanoid robots handling warehouse and factory evaluation tasks

Real-site robot evaluation

Evaluate robots on real sites before deployment.

Blueprint turns captured facilities into robot task packs so teams can test policies, find failure modes, and prepare for shorter field pilots.

Robot teams pay for evaluations. Site operators can submit sites free.

Task Evaluation Run

One Task Evaluation Run = 1 site × 1 robot policy/profile × 1 Task Pack × up to 500 scenarios.

Success rate

Can the robot complete the task often enough?

The evaluation starts with the pass bar the robot team actually needs.

Cycle time

Can it keep up with the site rhythm?

Blueprint frames the task against target timing, bottlenecks, and site drift.

Intervention rate

Where will people still need to step in?

Blueprint keeps assist points visible instead of hiding them behind a score.

Safety threshold

What still needs review before field exposure?

Safety stays scoped to the request and does not become a blanket validation claim.

What Blueprint sells

Two robot-team products.

Start with a Task Evaluation Run when you need a scoped answer before field time. Add a Post-Training Data Package when the robot team needs curated data to improve the model.

01

Task Evaluation Run

One site, one robot policy/profile, one Task Pack, and up to 500 scenarios.

02

Post-Training Data Package

Curated robot POV clips, scenario labels, synthetic variations, failure cases, and export format for model improvement.

How it works

Turn a real site into an evaluation plan.

Blueprint keeps the workflow compact: one site, one policy, one task pack, scored scenarios, and the next proof step.

Illustrative dashboard for site policy evaluation
1

Start with a real site

Use an existing site package, lawful capture, or structured facility request.

2

Define the task pack

Set the robot profile, task, thresholds, start states, and scenario variations.

3

Run the policy

Run through a policy API, vendor container, action trace, simulation workflow, or assisted review.

4

Score the scenarios

Measure success, cycle time, intervention points, safety events, and failure modes.

5

Decide the next proof step

Proceed to pilot, request more data, tune on the exported set, or hold until blockers clear.

Planning ranges

Two paid robot-team products. Site operators submit free.

Task Evaluation Runs start at $6,500 per run. Post-Training Data Packages start at $25,000+. Site operators can submit sites for free.

Open pricing page

Task Evaluation Run

From $6,500 / run

Test one robot policy/profile against one real-site Task Pack, up to 500 scenarios.

Request Task Evaluation Run

Post-Training Data Package

From $25,000+

Curated robot POV clips, scenario labels, synthetic variations, failure cases, and matched export format.

Request Data Package

Site operators submit sites free.

Facility owners can submit or claim a site, define privacy/access boundaries, and review commercial-use terms before anything is shared.

Submit site free

Evidence boundary

Public examples show the workflow shape.

A request packet proves one site, robot profile, task pack, thresholds, rights posture, and missing proof. Evaluation output remains advisory until supported by simulator traces, action logs, robot trials, safety review, and runtime evidence.

First request

Have one real site or task to evaluate?

Bring one site, task, robot profile, and target threshold. We'll recommend the right evaluation scope, scenario count, and next proof step.