molmo2
Molmo 2: State-of-the-art video understanding, pointing, and tracking | Ai2
Molmo 2, a new suite of state-of-the-art vision-language models with open weights, training data, and training code, can analyze videos and multiple images at once.
https://allenai.org/blog/molmo2


Seonglae Cho