Multimodal Interpretability UsagesVision InterpretabilityAudio Model InterpretabilityRL Vision Interpretability Task Vectors are Cross-ModalTask Vectors are Cross-ModalTask representations in VLMs are consistent across modality (text, image) and specification (example, instruction).https://task-vectors-are-cross-modal.github.io/