Visualizing Embedding Alignment: UMAP Dimensionality Reduction in MindEye2

As discussed in Scotti et al. (2023), UMAP dimensionality reduction (McInnes et al., 2020) plots of disjointed CLIP fMRI embeddings next to aligned CLIP fMRI embeddings visualize how the diffusion prior effectively addresses the disjointed embedding spaces problem. Theoretically, multimodal contrastive learning will always produce disjointed embeddings because of the “modality gap” phenomenon whereby encoding modalities into a shared space restricts the effective embedding space to a narrow cone in geometric space (Liang et al., 2022).

This paper is available on arxiv under CC BY 4.0 DEED license.

Authors:

(1) Paul S. Scotti, Stability AI and Medical AI Research Center (MedARC);

(2) Mihir Tripathy, Medical AI Research Center (MedARC) and a Core contribution;

(3) Cesar Kadir Torrico Villanueva, Medical AI Research Center (MedARC) and a Core contribution;

(4) Reese Kneeland, University of Minnesota and a Core contribution;

(5) Tong Chen, The University of Sydney and Medical AI Research Center (MedARC);

(6) Ashutosh Narang, Medical AI Research Center (MedARC);

(7) Charan Santhirasegaran, Medical AI Research Center (MedARC);

(8) Jonathan Xu, University of Waterloo and Medical AI Research Center (MedARC);

(9) Thomas Naselaris, University of Minnesota;

(10) Kenneth A. Norman, Princeton Neuroscience Institute;

(11) Tanishq Mathew Abraham, Stability AI and Medical AI Research Center (MedARC).

← Previous

Pretraining Efficiency: MindEye2's Performance with Fewer Subjects

Up Next →

Visualizing Brain Function: MindEye2 Reconstructions from ROI-Specific fMRI