Optimizing Retrieval Inference for scale and performance | Heykuki News