AI Spatial Memory

Creator
Creator
Seonglae ChoSeonglae Cho
Created
Created
2026 Jan 7 12:32
Editor
Edited
Edited
2026 Mar 23 0:53

AI 3D Memory

AI Spatial Memory Methods
 
 
 

Spatial understanding

VoxRep: Enhancing 3D Spatial Understanding in 2D Vision-Language Models | Menlo Research
VoxRep is a novel framework that enables standard Vision-Language Models to interpret structured 3D voxel data, facilitating richer spatial comprehension vital for advanced AI systems.
VoxRep: Enhancing 3D Spatial Understanding in 2D Vision-Language Models | Menlo Research
VLMs excel at semantic understanding but struggle with spatial relationships. Current VLMs have a real spatial blindspot, caused by encoder design and 1D alignment methods. To fix this, spatial grounding itself must be treated as a separate design axis, not just semantic capability.
arxiv.org
 
 

Recommendations