Benchmarking world models and embodied agents in closed-loop interactive environments
Static Benchmark Problems
Visual Fidelity Focus
Isolated Task Evaluation
Closed-Loop Complexity
Multi-Modal Integration
Temporal Dependencies