OpenMP Performance Tuning
Fix false sharing
Multiple threads writing to the same cache line
Increase chunk size
Tune schedule
Reduce barriers
SPMD Vs. Loop Level
Previous slide
Next slide
Back to first slide
View graphic version