LLaVA-NeXT: Improved reasoning, OCR, and world knowledge
LLaVA team presents LLaVA-NeXT, with improved reasoning, OCR, and world knowledge. LLaVA-NeXT even exceeds Gemini Pro on several benchmarks.
https://llava-vl.github.io/blog/2024-01-30-llava-next/