Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Object/Multimodal AI/Multimodality Fusion/Multimodal Representation/Cross-modal Attention/
Language Is Not All You Need
Search

Language Is Not All You Need

Creator
Creator
Seonglae Cho
Created
Created
2023 Apr 9 5:13
Editor
Editor
Seonglae Cho
Edited
Edited
2024 May 29 3:44
Refs
Refs
KOSMOS
이미지와 텍스트를 동시에 처리할 수 있는 Cross-modal Embedding 방법을 사용
 
 
 
 
 
arxiv.org
https://arxiv.org/pdf/2302.14045.pdf
Language Is Not All You Need: Aligning Perception with Language Models
A big convergence of language, multimodal perception, action, and world modeling is a key step toward artificial general intelligence. In this work, we introduce Kosmos-1, a Multimodal Large...
Language Is Not All You Need: Aligning Perception with Language Models
https://arxiv.org/abs/2302.14045
Language Is Not All You Need: Aligning Perception with Language Models
 
 

Recommendations

Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Object/Multimodal AI/Multimodality Fusion/Multimodal Representation/Cross-modal Attention/
Language Is Not All You Need
Copyright Seonglae Cho