SciArena: A New Platform for Evaluating Foundation Models in Scientific Literature Tasks | Ai2
Discover how SciArena is being used to evaluate foundation models’ capabilities in scientific literature tasks through community-driven, literature-grounded, and multi-disciplinary reasoning.
https://allenai.org/blog/sciarena