Multi-Agent

A Physics-Grounded Benchmark for Multi-Agent Dynamics in World Models

A physics-grounded benchmark, CrashTwin, that stress-tests whether generative world models obey physical laws in safety-critical multi-agent collisions, exposing physical violations hidden behind high perceptual quality.