Sparse LLM Inference on CPU: 75% fewer parameters | Heykuki News