Code for the Byte Pair Encoding algorithm, commonly used in LLM tokenization | Heykuki News