Multi-Modal Retrieval: GPT text embedding and CLIP image embedding | Heykuki News