Humanoid robot in a warehouse aisle used as illustrative readiness imagery

Real-site robot evaluation dataset and workflow

Real-site robot eval cards before the pilot.

Capture a real site. Turn it into Site, Task, Scenario, and Eval Cards. Test the task before teams spend months on-site.

Success rate

Can the robot complete the task often enough?

The readiness scope starts with the pass bar the buyer actually needs.

Cycle time

Can it keep up with the site rhythm?

Blueprint frames the task against target timing, bottlenecks, and site drift.

Intervention rate

Where will people still need to step in?

The report names likely assist points instead of hiding them behind a score.

Safety threshold

What still needs review before field exposure?

Safety stays scoped to the request and does not become a blanket validation claim.

What Blueprint sells

The card family for one real robot task.

Blueprint turns capture and pipeline evidence into an inspectable dataset and workflow: what the site is, what task matters, what scenarios can break, and what proof is still missing.

01

Site Card

Site type, geometry, visual and dynamic conditions, safety constraints, robot metadata, provenance, rights, and review status.

02

Task Cards

Task statement, start state, success and failure definitions, required metrics, evidence source, and confidence.

03

Scenario Cards

Normal scenario, variation, edge case, known risk, observed-vs-inferred labels, and missing annotations.

04

Eval Cards

Robot or policy tested, engine used, predicted results, failure modes, uncertainty, validation status, and blocked upgrades.

How it works

Turn pilot risk into an inspectable card workflow.

Blueprint keeps the workflow compact: real site, robot task, scenario variation, eval status, missing annotations, and the next proof step.

Illustrative readiness dashboard for a hosted evaluation
1

Capture a real site

Start from a lawful capture, existing site package, or structured request for the facility in question.

2

Turn it into eval cards

Build the Site Card, Task Cards, Scenario Cards, Eval Cards, annotation backlog, and proof boundaries.

3

Define the robot task and pass bar

Name the robot profile, task suite, success rate, cycle time, intervention rate, and safety threshold.

4

Test what breaks

Call out failure modes, site modifications, data needs, recapture needs, and proof that remains blocked.

5

Decide the next step

Proceed to a short pilot protocol, change the site, gather more evidence, compare vendors, or hold.

Planning ranges

Three ways to start.

Pricing is intentionally simple. Public ranges help a buyer pick a path; live availability, rights, payment, and fulfillment are confirmed per request.

Open pricing page

Site/Task Readiness Review

$2,100 - $3,400

One site, one task suite, one robot profile, one threshold set, and a pre-pilot recommendation.

Request scope

Hosted Evaluation

$16 - $29 / session-hour

Managed browser review, reruns, observations, export framing, and a direct buyer room when available.

Request scope

Custom Multi-Site Benchmark

$50,000+ scoped

Private capture planning, vendor-neutral benchmark design, custom data package, and operator boundaries.

Request scope

Proof boundary

Public card samples show the workflow. Request packets prove one site.

Blueprint can look ready and polished without pretending a robot has passed deployment, safety, payment, provider, rights, or hosted-session checks that still need owner-system proof.

Sample

Public card samples

Stronger proof

Request packets

Samples show the Site/Task/Scenario/Eval Card workflow. Request packets prove one site with provenance, rights, thresholds, and gaps attached.

Sample

Generated or model-derived visuals

Stronger proof

Owner-system evidence

Generated outputs can support review, but simulator traces, action logs, robot trials, safety review, rights proof, and runtime artifacts own stronger claims.

Sample

Readiness advisory

Stronger proof

Operational readiness

Eval cards stay advisory until the missing proof exists for that exact site, robot, task, and threshold set.

First request

Ask for one real-site eval dataset.

Bring the facility, task, robot profile, target thresholds, timeline, and proof you already have. Blueprint routes the next step to an eval-card packet, hosted evaluation, capture ask, or proof blocker.

Request eval dataset

Requests do not grant package access, rights clearance, payment, fulfillment, or hosted-session availability by themselves.

Readiness remains advisory until simulator traces, action logs, robot trials, safety review, rights proof, and runtime proof support a stronger claim.

Generated imagery on the public site is illustrative, not customer or robot-trial proof.

Public Launch Ready copy is allowed. Operational Launch Ready claims still require proof from the system that owns them.