Reviews and Benchmarks
Here is where the rubber meets the road, the wheat is separated from the chafe, the hard questions get answered. Clusters are all about performance and getting the best bang for your buck. Join us as we go under the hood and look at the power plants that are driving HPC today.
- Details
- Written by Douglas Eadline
- Hits: 6139
Note: this paper was prepared for a conference that we decided not to attend (Okay, it was not accepted). It is written in a more formal style than the normal ClusterMonkey articles and is sponsored by the The Beowulf Foundation
Abstract
Popular homogeneous clustered HPC systems (e.g., commodity x86 servers connected by a high-speed interconnect) have given way to heterogeneous clusters comprised of multi-core servers, high speed interconnects, accelerators (often GPU based), and custom storage arrays. Cluster designers are often faced with finding a balance between purpose-built (tailored to specific problem domains ) and general use systems. Traditional cluster-based approaches, however, all share a hard boundary between internal server buses (mainly PCIe) and the rest of the cluster. In heterogeneous environments, the server boundary often creates inefficient resource management, limits solution flexibility, and heavily influences the design of clustered HPC applications. This paper explores the malleability of the GigaIO™ FabreX™ PCIe memory fabric in relation to HPC cluster applications. A discussion of emerging concepts (e.g., a routable PCIe bus) and hands-on benchmarks using shared GPUs will be provided. In addition, results of a simple integration with SLURM resource scheduler will be discussed as way to make composable/malleable computing transparently available to end-users. Keywords. Composable computing, malleable computing, PCIe, HPC cluster, SLURM, benchmark, FabreX , GigaIO, resource schedulerSubcategories
Interconnects Article Count: 3
Getting the data where it needs to go is only half the story. getting it there quickly and with minimal latency is the issue with clusters. Whether it is one byte or a gigabyte, interconnects are the get the work done.
Cluster Hardware Article Count: 3
Choosing cluster hardware can be difficult without some real application data and experience. Our hardware reviews will try and offer some insights into today's hardware choices.
System Software Article Count: 1
Books Article Count: 1
We list all the books on clusters we could find. We even read most of these books. Where we felt qualified, we provide a short review.
Benchmarking Methods Article Count: 5
Not only are we going to provide the benchmark numbers, we also provide the benchmark methods and techniques. How is that for service. Now you can run your own benchmarks.
HPCWire
-
Eviden and Consortium Seal €500M Deal for Exascale Supercomputing in Europe
Eviden and Consortium Seal €500M Deal for Exascale Supercomputing in Europe
PARIS, Oct. 4, 2023 — A French-German consortium composed of Eviden, the Atos Group business leading in advanced computing, and ParTec, a German modular supercomputing company, today announced a contract […] The post Eviden and Consortium Seal €500M Deal for[…]
Source: HPCwire
Created on: Oct 4, 2023 | 19:02 pm
HPCwire | Oct 4, 2023 | 19:02 pm -
NCSA’s Quantum Computing Initiative Head, Nuñez-Corrales, Bridges Classical and Quantum Worlds
NCSA’s Quantum Computing Initiative Head, Nuñez-Corrales, Bridges Classical and Quantum Worlds
Oct. 4, 2023 — In the late 1990s, Costa Rica was establishing itself as a hub for technology innovation. Intel had just opened up operations there and free courses offered […] The post NCSA’s Quantum Computing Initiative Head, Nuñez-Corrales, Bridges[…]
Source: HPCwire
Created on: Oct 4, 2023 | 18:17 pm
HPCwire | Oct 4, 2023 | 18:17 pm -
Adtran and Orange Demo 400G Transmission of QKD-Secured Data Across 184km End-to-End System
Adtran and Orange Demo 400G Transmission of QKD-Secured Data Across 184km End-to-End System
PARIS, Oct. 4, 2023 — Adtran has announced its collaboration with Orange on a lab trial of quantum key distribution (QKD) technology, marking a key step towards safeguarding real-world networks […] The post Adtran and Orange Demo 400G Transmission of[…]
Source: HPCwire
Created on: Oct 4, 2023 | 16:51 pm
HPCwire | Oct 4, 2023 | 16:51 pm