AsCL: An Asymmetry-sensitive Contrastive Learning Method for...
The image-text retrieval task aims to retrieve relevant information from a given image or text. The main challenge is to unify multimodal representation and distinguish fine-grained differences...
https://arxiv.org/abs/2405.10029