Test robot policies before field time.

Use captured real-site tasks to see what works.

Realistic humanoid robot moving a tote in a captured facility task

Policy score

Winner
Policy B82%
Policy A61%
Policy C32%
1

Capture site

A real task pack.

2

Run policies

100 or 500 episodes.

3

Pick winner

Know what to test next.

Same task. Same robot. Clear winner.

Compare 1-3 policies on one captured task pack before using robot time.

100 episodes500 episodes1-3 policies

Failure

What broke?

OOD

What changed?

Next test

Where to try?

See the clips.

Generated clips help review results. They are not real-world proof.

Evaluate

Boundary: virtual results guide what to test next. They do not approve deployment, safety, or guaranteed real-world success.