Open Source Models Score Low on ARC-AGI-2 Reasoning Benchmark | Heykuki News