Show HN: Aion-Torch – Adaptive residual scaling for deep Transformers | Heykuki News