Take a moment and ask yourself the following question:
If you can have both large scale OpenMP and MPI performance in an easy to manage single system image built from commodity hardware and price points, would you prefer that a solution?
This very question was the subject a recent white paper: Redefining Scalable OpenMP and MPI Price-to-Performance with Numascale’s NumaConnect published by Numascale.
Numascale has been perfecting the NumaConnect technology and has now published several benchmarks that show both excellent shared (OpenMP) and distributed (MPI) memory performance. A Numascale cluster uses commodity hardware and the “pug-and-play” NumaConnect interconnect to deliver the ease of shared memory programming and administration at standard HPC cluster price points. One running system currently offers users over 1,700 cores with a 4.6 TByte single memory image.
Reference implementations of NPB are available in commonly-used programming models such as MPI and OpenMP, which make them ideal for measuring the performance of both distributed memory and SMP systems. The results below were run on a NumaConnect Shared Memory benchmark system with 1TB of memory and 256 cores (using eight servers, each equipped with two AMD Opteron 2.5 GHz 6380 CPUs, each with 16 cores and 128GB of memory). Figure One shows results for the NPB-LU benchmark (Lower-Upper Gauss-Seidel solver) scaling over a range of 16 to 121 cores using OpenMP for the Class E problem size. The results show an unprecedented number of computing cores running a NAS benchmark of this size.
Figure One: OpenMP NAS Parallel results for NPB-LU (Class E) It should be noted that NASA has never seen OpenMP E Class results with such a high number of cores.
Figure Two: NPB-LU comparison of NumaConnect-MPI to FDR-MPI InfiniBand.
Additional benchmarks can be found the the white paper. There are several large scale shared memory production systems running with NumaConnect, including University of Oslo in Norway, StaOil, and Keele University.
The paper also includes an interesting cost comparison that shows a small NumaConnect cluster lands at a comparable price point to a small FDR InfiniBand cluster. There is more information in the full white paper.
Note: Douglas Eadline, author of this article is also the author of the Numascale white paper.