SSE code 6 times slower without VZEROUPPER on Skylake (2016) | Heykuki News