Inference at the edge: Efficient transformer model inference on-device | Heykuki News