Chameleon is a model based on tokens which can interpret and produce text and images in any order. Its entirely token-based structure allows for smooth data integration across different modalities. By turning images into distinct tokens and training from the ground up on mixed-modal data, Chameleon is able to collaboratively process both image and text in a completely novel manner.
Chameleon - a facebook Collection
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
https://huggingface.co/collections/facebook/chameleon-668da9663f80d483b4c61f58

Seonglae Cho