No “Zero-Shot” Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance
- "Multimodal models require exponentially more data to achieve linear improvements in downstream zero-shot performance"
- "Multimodal models require exponentially more data to achieve linear improvements in downstream zero-shot performance"