Co-Scheduling HPC Workloads on Cache-Partitioned CMP Platforms

Guillaume Aupy; Anne Benoit; Brice Goglin; Loïc Pottier; Yves Robert

Submitted by claxton on Wed, 05/29/2019 - 18:18

Title	Co-Scheduling HPC Workloads on Cache-Partitioned CMP Platforms
Publication Type	Journal Article
Year of Publication	2019
Authors	Aupy, G., A. Benoit, B. Goglin, L. Pottier, and Y. Robert
Journal	International Journal of High Performance Computing Applications
Volume	33
Issue	6
Pagination	1221-1239
Date Published	2019-11
Keywords	cache partitioning, chip multiprocessor, co-scheduling, HPC application
Abstract	With the recent advent of many-core architectures such as chip multiprocessors (CMPs), the number of processing units accessing a global shared memory is constantly increasing. Co-scheduling techniques are used to improve application throughput on such architectures, but sharing resources often generates critical interferences. In this article, we focus on the interferences in the last level of cache (LLC) and use the Cache Allocation Technology (CAT) recently provided by Intel to partition the LLC and give each co-scheduled application their own cache area. We consider m iterative HPC applications running concurrently and answer to the following questions: (i) How to precisely model the behavior of these applications on the cache-partitioned platform? and (ii) how many cores and cache fractions should be assigned to each application to maximize the platform efficiency? Here, platform efficiency is defined as maximizing the performance either globally, or as guaranteeing a fixed ratio of iterations per second for each application. Through extensive experiments using CAT, we demonstrate the impact of cache partitioning when multiple HPC applications are co-scheduled onto CMP platforms.
DOI	10.1177/1094342019846956

File:

icl-utk-1176.pdf

External Publication Flag: