Stitching Vision Encoders into LLMs: Clip vs. I-JEPA vs. ViT Comparison | Heykuki News