šØ šØ Excited to share our latest paper, now on #arXiv!
š¼ļø We studied how unified VLMs, trained to generate both text and images (e.g., Meta's Chameleon), exchange information between modalities, comparing them to standard VLMs.
š Paper: arxiv.org/abs/2412.06646
Deep dive: š