CUDA/Metal accelerated language model inference | Heykuki News