Train a multi-modal chatbot with visual and language instructions | Heykuki News