Loop Fusion
Loop overhead reduced
Better instruction overlap
Lower cache misses
Be aware of associativity issues with array’s mapping to the same cache line.
Previous slide
Next slide
Back to first slide
View graphic version