Joint Embedding Predictive Architecture
JEPA Types
Making representations for latent planning where state-change trajectories in representation space are less curved makes planning much easier. Existing pretrained visual encoders have highly curved latent trajectories, meaning that Euclidean distance in latent space poorly reflects actual reachability difficulty or geodesic distance, and gradient-based planning also fails. Therefore, the authors propose temporal straightening: Reducing Curvature
arxiv.org
https://arxiv.org/pdf/2603.12231

Seonglae Cho