Notes on LLaVA-Gemma: Accelerating Multimodal Foundation Models With a Compact Language Model

Disclaimer: This is part of my notes on AI research papers. I do this to learn and communicate what I understand. Feel free to comment if you have any suggestion, that would be very much appreciated. The following post is a comment on the paper LlaVA-Gemma: Accelerating Multimodal Foundation Models With a Compact Language Model by Musashi Hinck, Matthew L. Olson, David Cobbley, Shao-Yen Tseng, and Vasudev Lal. Hinck et. al....

April 11, 11113 · 3 min · Àlex Pujol Vidal