SAGE

Creator
Creator
Seonglae ChoSeonglae Cho
Created
Created
2026 Mar 8 15:51
Editor
Edited
Edited
2026 Mar 8 16:3
Refs
Refs
  • ground-event → Find specific event timestamps (approximate video search)
  • extract-video-parts → Extract frame clips from relevant segments
  • analyze → Analyze frames with VLM
  • transcribe-speech → ASR
  • web-search → External knowledge
In essence, the actual operation is largely iterative video retrieval + reasoning agent.
 
 
SAGE: Smart Any-Horizon Agents
"What are the technical challenges toward effectively training video reasoning models under the AGENT paradigm with Reinforcement Learning?"
SAGE - a allenai Collection
Smart Any-Horizon Agent for Long Video Reasoning
SAGE - a allenai Collection
 

Recommendations