Quantized Llama models with increased speed and a reduced memory footprint

Heykuki News

508 points

2 years ago

122 comments

Quantized Llama models with increased speed and a reduced memory footprint | Heykuki News