finding multiple relevant passages and step-by-step reasoning to answer complex questions.
Multi-hop QA Models
Multi-hop QA Datasets
There is moderate evidence of the second-hop reasoning, which does not become stronger with increasing model size.
Seonglae Cho
Seonglae Cho