Factuality benchmarksreference-free factuality benchmarkreference-based factuality benchmarkHallucination BenchmarksOpenAI SimpleQAPhare benchmarkFEVERFACTS GroundingLongFactFActScore several typesarxiv.orghttps://arxiv.org/pdf/2410.22071vectara/hallucination_evaluation_model · Hugging FaceWe’re on a journey to advance and democratize artificial intelligence through open source and open science.https://huggingface.co/vectara/hallucination_evaluation_modelLeaderboardhuggingface.cohttps://huggingface.co/spaces/vectara/Hallucination-evaluation-leaderboardSTS model to judgedleemiller/ModernCE-base-sts · Hugging FaceWe’re on a journey to advance and democratize artificial intelligence through open source and open science.https://huggingface.co/dleemiller/ModernCE-base-stscross-encoder/stsb-roberta-large · Hugging FaceWe’re on a journey to advance and democratize artificial intelligence through open source and open science.https://huggingface.co/cross-encoder/stsb-roberta-large