Fast `StatsBase.countmap` for small types on the GPU via CUDA.jl | Heykuki News