Show HN: Jlama – A fast Java inference engine for GPT and Llama models | Heykuki News