Show HN: Built a 9MB GPU kernel achieving 43M ops/SEC with deterministic replay | Heykuki News