I saw a question on stackoverflow.
http://stackoverflow.com/questions/3804 ... rix-values
It shows that in a vector-matrix multiplication, gemv/gemm runs much faster when the vector contains a lot of 0s, and the result is correct.
Until now I can only reproduce this with the package libblas-dev on Ubuntu 14.04.
Could you help explain why?