Heykuki News

TopNewBestAskShowJobs
TopNewBestAskShowJobs
1.
Vectorization, dependencies and outer loop vectorization (johnysswlab.com)
56 points
ingve
4 years ago
6 comments
2.
Instruction-level parallelism: speeding up memory-bound programs with low ILP (johnysswlab.com)
5 points
ingve
4 years ago
discuss
3.
The price of dynamic memory: Allocation (johnysswlab.com)
3 points
signa11
5 years ago
discuss
4.
Long instruction dependency chains and performance (johnysswlab.com)
2 points
ingve
4 years ago
discuss
5.
The memory subsystem from the viewpoint of software performance (part 1/3) (johnysswlab.com)
1 point
signa11
4 years ago
discuss
6.
The memory subsystem from the viewpoint of software (johnysswlab.com)
1 point
ingve
4 years ago
discuss
7.
The price of dynamic memory: Memory Access (johnysswlab.com)
1 point
wheresvic5
4 years ago
discuss
8.
Memory consumption, dataset size and performance: how does it all relate? (johnysswlab.com)
1 point
ingve
4 years ago
discuss
9.
Make your programs run faster by better using the data cache (2020) (johnnysswlab.com)
143 points
eatonphil
3 years ago
59 comments
10.
Horrible Code, Clean Performance (johnnysswlab.com)
121 points
signa11
3 years ago
114 comments
11.
The messy reality of SIMD (vector) functions (johnnysswlab.com)
120 points
mfiguiere
a year ago
81 comments
12.
Decreasing the number of memory accesses (johnnysswlab.com)
90 points
g0xA52A2A
3 years ago
52 comments
13.
Unexpected ways memory subsystem interacts with branch prediction (johnnysswlab.com)
89 points
r4um
2 years ago
27 comments
14.
Memory Subsystem Optimizations (johnnysswlab.com)
48 points
mfiguiere
5 months ago
13 comments
15.
Growing Buffers to Avoid Copying Data (johnnysswlab.com)
42 points
ingve
a year ago
29 comments
16.
Link Time Optimizations: New Way to Do Compiler Optimizations (johnnysswlab.com)
39 points
signa11
a year ago
33 comments
17.
A story of a large loop with a long instruction dependency chain (johnnysswlab.com)
39 points
signa11
2 years ago
4 comments
18.
Avoiding register spills in vectorized code with many constants (johnnysswlab.com)
38 points
ingve
2 years ago
14 comments
19.
An optimizing compiler doesn't help much with long instruction dependencies (johnnysswlab.com)
35 points
ingve
a year ago
6 comments
20.
Performance Debugging with LLVM-mca: Simulating the CPU (johnnysswlab.com)
33 points
signa11
a year ago
14 comments
21.
The messy reality of SIMD (vector) functions (johnnysswlab.com)
30 points
ingve
a year ago
1 comment
22.
Loop Optimizations: taking matters into your hands (2021) (johnnysswlab.com)
20 points
mooreds
4 years ago
6 comments
23.
CPU Dispatching: Make your code both portable and fast (2020) (johnnysswlab.com)
11 points
lalaland1125
2 years ago
1 comment
24.
Deep Dive in Java vs. C++ Performance (johnnysswlab.com)
4 points
ingve
6 months ago
discuss
25.
Faster hash maps, binary trees etc. through data layout modification (johnnysswlab.com)
4 points
ingve
3 years ago
discuss
26.
Frugal Programming: Saving Memory Subsystem Bandwidth (johnnysswlab.com)
4 points
bubblehack3r
3 years ago
discuss
27.
Exposing More Parallelism Is the Reason Why Some Vectorized Loops Are Faster (johnnysswlab.com)
3 points
ingve
3 months ago
discuss
28.
Floating-Point Error Handling in C++: What Works (johnnysswlab.com)
3 points
ingve
4 months ago
discuss
29.
The price of dynamic memory: Memory Access (2020) (johnnysswlab.com)
3 points
signa11
7 months ago
discuss
30.
Things Every Fresh Graduate Should Know About Software Performance (johnnysswlab.com)
3 points
Bogdanp
9 months ago
discuss
More