Robot-team eval

Robot-team test interface

Submit a policy endpoint, container, action trace, skill trace, teleop demo, or sim controller as artifact references against one capture-backed site package.

Proof boundary

Blueprint can evaluate submitted references against real-site packages when source evidence exists. A submission is advisory: it does not prove deployment readiness, safety validation, real robot execution, simulator run completion, rights clearance, or guaranteed thresholds.

Artifact refs firstMissing proof trackedHosted-session policyPipeline schema aligned

Site package

Choose the real-site package and task context

The submission is attached to a siteWorldId, taskId, scenarioId, startStateId, robotProfileId, and requested output list.

Selected robot

Mobile manipulator

Bounded robot action vector for hosted rollout execution.

Modality 1

Policy API endpoint

Reference a callable policy service without pasting secrets into Blueprint.

missing required refsneeds_policy_api_endpoint_ref

Modality 2

Docker container

Reference a reproducible policy container and its runtime contract.

not selected

Modality 3

Recorded action traces

Reference offline action traces aligned to Blueprint tasks and observations.

not selected

Modality 4

High-level skill traces

Reference ordered skill-level plans with failure and coverage labels.

not selected

Modality 5

Teleop demos

Reference operator demonstrations and controls with rights/privacy scope.

not selected

Modality 6

Sim controller plugin

Reference a simulator control plugin without claiming the sim has run.

not selected