Physics-IQ

Creator
Creator
Seonglae ChoSeonglae Cho
Created
Created
2026 Mar 24 13:47
Editor
Edited
Edited
2026 Mar 24 13:54
Refs
Refs
Found that there is no statistically significant correlation between visual realism and physical understanding
While existing benchmarks relied on synthetic data and faced real-vs-synthetic distribution shift issues, Physics-IQ uses real, high-quality filmed videos. The Physics-IQ dataset consists of 66 scenarios captured from 3 camera viewpoints, filmed twice, totaling 396 videos (3840x2160, 30FPS)
Spatiotemporal IoU considers the time axis and compares motion masks on a frame-by-frame basis
Weighted Spatial IoU measures how much motion occurred at specific locations using a weighted average
The Physics-IQ score combines these 4 metrics and is normalized to 100% based on physical variance (the difference between two recordings of the same scenario).
 
 
Do generative video models understand physical principles?
AI video generation is undergoing a revolution, with quality and realism advancing rapidly. These advances have led to a passionate scientific debate: Do video models learn "world models" that...
Do generative video models understand physical principles?
 
 

Recommendations