Stride Minimization
We must think about spatial locality.
Effective usage of the cache provides us with the best possibility for a performance gain.
Recently accessed data are likely to be faster to access.
Tune your algorithm to minimize stride, innermost index changes fastest.