HPC Challenge Benchmark Record

System Information
Affiliation:   Cray   URL:   www.cray.com
Location:   USA, Wisconsin   System Use:   Vendor
System Manufacturer:   Cray Inc.   System Name:   mfeg8
Interconnect Manufacturer:   Cray   Interconnect Type:   Modified 2D Torus
Operating System:   Unicos/MP   MPI:   mpt 2.4
MPI Wtick:   0.000000008849558   BLAS:   libsci 2.4
Language:   C   Compiler:   PrgEnv 5.4
Compiler Flags:   -O2 -hlist=m -DLONG_IS_64BITS -hrestrict=a   Processor Type:   Cray X1E
Processor Speed:   1.13 GHz   Total Processors:   256
Processors Entered:   248   Processors determined:   248
Cores per chip:   1   HPL Processes:   248
MPI Processes:   248   Threads Entered:   1
Threads determined:   1   FLOPs per cycle:   16
Theoretical peak:   4464 TFlop/s   Total memory:   GiB
FFT library:    
Explain Optimizations:
STREAM: Aligned the data to cache line boundaries and added no_cache_alloc directives. Single cpu RandomAccess: Change vector length to 1024 and added concurrent directive. MPI RandomAccess: Changed to use UPC version FFTs: Added directives to vectorize. Collapsed some loops.

HPL:   3.38887 Tflop/s   HPL time:   1416.71
HPL eps:   1.11022e-16   HPL Rnorm1:   0.00000000168794
HPL Anorm1:   48543   HPL AnormI:   48565.9
HPL Xnorm1:   156898   HPL XnormI:   4.56798
HPL N:   193111   HPL NB:   187
HPL NProw:   31   HPL NPcol:   8
HPL depth:   1   HPL NBdiv:   2
HPL NBmin:   4   HPL CPfact:   R
HPL CRfact:   R   HPL CPtop:   1
HPL order:   R
HPL dMach EPS:     HPL sMach EPS:  
HPL dMach sfMin:     HPL sMach sfMin:  
HPL dMach Base:     HPL sMach Base:  
HPL dMach Prec:     HPL sMach Prec:  
HPL dMach mLen:     HPL sMach mLen:  
HPL dMach Rnd:     HPL sMach Rnd:  
HPL dMach eMin:     HPL sMach eMin:  
HPL dMach rMin:     HPL sMach rMin:  
HPL dMach eMax:     HPL sMach eMax:  
HPL dMach rMax:     HPL sMach rMax:  
dweps:     sweps:  

PTRANS:   66.0098 GB/s   PTRANS time:   1.12988 seconds
PTRANS residual:   0   PTRANS N:   96555
PTRANS NB:   51   PTRANS NProw:   31
PTRANS NPcol:   8

S-STREAM Copy:   27.4771 GB/s   S-STREAM Scale:   27.3668 GB/s
S-STREAM Add:   32.9811 GB/s   S-STREAM Triad:   32.8212 GB/s
EP-STREAM Copy:   10.7501 GB/s   EP-STREAM Scale:   10.819 GB/s
EP-STREAM Add:   13.3785 GB/s   EP-STREAM Triad:   13.2295 GB/s
STREAM Vector Size:   50123456   STREAM Threads:   1

S-RandomAccess:   0.249107 Gup/s   EP-RandomAccess:   0.135601 Gup/s
G-RandomAccess:   1.85475 Gup/s   G-RandomAccess N:   34359738368
G-RandomAccess time:   74.1012 seconds   G-RandomAccess Check Time:   -1 seconds
G-RandomAccess Errors:   -1   G-RandomAccess Errors Fraction:   -1
G-RandomAccess TimeBound:     G-RandomAccess ExeUpdates:  
RandomAccess N:   1

S-FFT:   2.45229 GFlop/s   EP-FFT:   1.83733 GFlop/s
MPIFFT:   -1 GFlop/s   MPIFFT N:   -1
MPIFFT Max Error:   -1   MPIFFT time0:   0 seconds
MPIFFT time1:   0 seconds   MPIFFT time2:   0 seconds
MPIFFT time3:   0 seconds   MPIFFT time4:   0 seconds
MPIFFT time5:   0 seconds   MPIFFT time6:   0 seconds
FFTEnblk:     FFTEnp:  

S-DGEMM:   14.7748 GFlop/s   EP-DGEMM:   13.564 GFlop/s
DGEMM N:   6131

RandomRing Latency/Bandwidth
RandomRing Latency:   14.576 usec   RandomRing Bandwidth:   0.298865 GB/s

NaturalRing Latency/Bandwidth
NaturalRing Latency:   16.5398 usec   NaturalRing Bandwidth:   3.15336 GB/s

PingPong Latency/Bandwidth
Maximum PingPong Latency:   11.594 usec   Maximum PingPong Bandwidth:   11.0314 GB/s
Minimum PingPong Latency:   6.42201 usec   Minimum PingPong Bandwidth:   8.19108 GB/s
Average PingPong Latency:   8.06958 usec   Average PingPong Bandwidth:   8.9025 GB/s

Size of Data Types
char:   1 byte     short:   2 bytes
int:   4 bytes   long:   8 bytes
void ptr:   8 bytes   float:   4 bytes
double:   8 bytes   size t:   bytes
s64Int:   bytes   u64Int:   bytes

M OpenMP:     OpenMP Num Threads:  
OpenMP Num Procs:     OpenMP Max Threads:  

MemProc:     MemSpec:  


Version: 0.8..a - Run Type: opt
Created: 2005-06-15 - Exported: Wed Jul 6 15:40:34 2022
