Jonathan R. Williford, PhD

AI

CLIP, LLaVA, and the Brain

Deep Learning and the Brain Insights into Multimodal Transformers from Neuroscience Jonathan R. Williford, PhD · Follow Published in Towards Data Science · 8 min read · 5 days ago — Image generated by the author using Dall-E 3. How do recent multimodal transformer networks, like CLIP (Radford et al. 2021) and LLaVA (Liu et al. 2023), compare to the brain? Are there similarities between the attention in these networks and the brain? In this

Read More »