Check out our findings on Visual Instruction Fine-tuning: COCO is “ALL” You Need for Visual Instruction Fine-tuning