Speculative decoding of llama2 models in pure C | Heykuki News