First Experiences with Composable Hardware: An HPC User Perspective
- Details
- Written by Douglas Eadline
- Hits: 6142
Note: this paper was prepared for a conference that we decided not to attend (Okay, it was not accepted). It is written in a more formal style than the normal ClusterMonkey articles and is sponsored by the The Beowulf Foundation
Abstract
Popular homogeneous clustered HPC systems (e.g., commodity x86 servers connected by a high-speed interconnect) have given way to heterogeneous clusters comprised of multi-core servers, high speed interconnects, accelerators (often GPU based), and custom storage arrays. Cluster designers are often faced with finding a balance between purpose-built (tailored to specific problem domains ) and general use systems. Traditional cluster-based approaches, however, all share a hard boundary between internal server buses (mainly PCIe) and the rest of the cluster. In heterogeneous environments, the server boundary often creates inefficient resource management, limits solution flexibility, and heavily influences the design of clustered HPC applications. This paper explores the malleability of the GigaIO™ FabreX™ PCIe memory fabric in relation to HPC cluster applications. A discussion of emerging concepts (e.g., a routable PCIe bus) and hands-on benchmarks using shared GPUs will be provided. In addition, results of a simple integration with SLURM resource scheduler will be discussed as way to make composable/malleable computing transparently available to end-users. Keywords. Composable computing, malleable computing, PCIe, HPC cluster, SLURM, benchmark, FabreX , GigaIO, resource schedulerFLOP GUN: Return of the Beowulf Bash
- Details
- Written by Number Six
- Hits: 716
The Bash is Back! After a two year Covid respite, the Bash is back live in Dallas for 2022. And, this year we are all about FLOP (sorry Top) Gun: Maverick the hugely poplar sequel (remake?) with old timey jets, a contrived mission, death-star like trench, all sorts of contorted nosebleed high-G maneuvers, unseen bad guys, and a little ground fighting just for fun. We are all in. What's not to like!
IMPORTANT: We know Covid is not over (we are science types afterall) and want everyone to be safe. For that reason, we are inviting all attendees to take the HPC Community Covid Safety Pledge
Now the fun parts. First, you can go to the Beowulf Bash Page and check out our extra snarky invite. There will be jet fighting (simulated, sorry) and a laser tag contest where you can work on becoming the Top Gun, refreshments, a band and a place you can talk with your friends (we provide a quiet room).
The live action event begins at 9pm Monday, November 14, right after SC’s Opening Gala. We'll be flying the jets at Gilley’s Dallas – 1135 Botham Jean Blvd.
Last but not least, you can also meet the new Beowulf Foundation Mascot "Potato."
Jack Dongarra Likes Julia (the language)
- Details
- Written by Douglas Eadline
- Hits: 1491

From the "What We Have Been Saying Dept"
Fresh from Julia Computing is a short note about HPC maven Jack Dongarra had to say about Julia.
2021 Turing Award Winner Jack Dongarra Says Julia Is ‘Much Better’ Than Other Languages and Should Perhaps Take Over: The winner of the 2021 Turing Award, often referred to as the ‘Nobel Prize of Computing’, is Jack Dongarra. Dongarra says Julia is ‘much better’ than other languages and should perhaps take over. According to ZDNet:
“While hardware speeds up matrix multiplication, [Jack] Dongarra is, again, mindful of the needs of the scientists and the software writer. ‘I grew up writing FORTRAN, and today we have much better mechanisms’ such as the Julia programming language and Jupyter Notebooks. What's needed now, he said, are more ways to ‘express those computations in an easy way,’ meaning linear algebra computations such as matrix multiplications. Specifically, more tools are needed to abstract the details. ‘Making the scientist more productive is the right way to go,’ he said. Asked what software programming paradigm should perhaps take over, Dongarra suggested the Julia language is one good candidate …”
Of course some forward looking sites, (cough, cough) have been highlighting Julia for ten years,
These are Not the Watts You Are Looking For
- Details
- Written by Douglas Eadline
- Hits: 6485
From the What's Watts Dept.
Some Clarity around TDP ratings
Managing power usage on multi-core processors has become an important aspect with modern computing systems. At the same time, finding an accurate specification of actual power usage has become more difficult. Knowing power usage is important in many areas and particularly when considering the efficiency of High Performance Computing (HPC) systems. In almost all modern CPUs and GPUs the only number that seems to give a hint about power usage is Thermal Design Power or TDP.According to Wikipedia Thermal Design Power is defined as follows:
... is the maximum amount of heat generated by a computer chip or component (often a CPU, GPU or system on a chip) that the cooling system in a computer is designed to dissipate under any workload.
Some sources state that the peak power rating for a microprocessor is usually 1.5 times the TDP rating
Disrupt Forward: Announcing the Beowulf Foundation
- Details
- Written by Douglas Eadline
- Hits: 2156

From the self reference department
The Beowulf Foundation was announced by Douglas Eadline and Lara Kisielewska at the SC 2021 Beowulf Bash in St Louis. The idea for a “Foundation” emerged from the continued discussion amongst the Beowulf community about the supporting the “Beowulf Ethos,” that began with the Beowulf Project at NASA using commodity hardware and open source software to build high-performance systems at low cost. Initially considered an anomaly, the “wrong ideas” demonstrated by the Beowulf Project changed the face of modern supercomputing.
The goal of the Beowulf Foundation is not to relive the past, but to support the “wrong” ideas of the future that may lead to further breakthroughs in high performance computing. As stated by Eadline:
Search
Login And Newsletter
Feedburner
Who's Online
We have 65 guests and no members online
Latest Stories/News
Popular
HPCWire
-
Keshav Pingali Receives Ken Kennedy Award for High Performance and Parallel Computing
Keshav Pingali Receives Ken Kennedy Award for High Performance and Parallel Computing
Oct. 4, 2023 — It’s 4 a.m. in Italy. Jet lagged before a conference, Keshav Pingali, professor of Computer Science and core faculty member at the Oden Institute for Computational Engineering and Sciences, found […] The post Keshav Pingali Receives Ken Kennedy Award for[…]
Source: HPCwire
Created on: Oct 4, 2023 | 21:17 pm
HPCwire | Oct 4, 2023 | 21:17 pm -
OpenTopography Recognized for Excellence in Advancing Open Earth and Space Science
OpenTopography Recognized for Excellence in Advancing Open Earth and Space Science
Oct. 4, 2023 — OpenTopography, a National Science Foundation (NSF)-funded data facility operated collaboratively between the San Diego Supercomputer Center (SDSC) at University of California San Diego, EarthScope Consortium and […] The post OpenTopography Recognized for Excellence in Advancing Open[…]
Source: HPCwire
Created on: Oct 4, 2023 | 21:00 pm
HPCwire | Oct 4, 2023 | 21:00 pm -
EU Grabs ARM for First ExaFLOP Supercomputer, x86 Misses Out
EU Grabs ARM for First ExaFLOP Supercomputer, x86 Misses Out
The configuration of Europe’s first exascale supercomputer, Jupiter, has been finalized, and it is a win for Nvidia and a disappointment for x86 chip vendors Intel and AMD. The Jupiter […] The post EU Grabs ARM for First ExaFLOP Supercomputer,[…]
Source: HPCwire
Created on: Oct 4, 2023 | 20:47 pm
HPCwire | Oct 4, 2023 | 20:47 pm
InsideHPC
-
NVIDIA, Intel and Google Alums Form Lemurian Labs, Raise $9M for 20X AI Throughput Boost
Oct 4, 2023 | 20:39 pm
-
Keshav Pingali to Receive ACM-IEEE CS Ken Kennedy Award
Oct 4, 2023 | 19:22 pm
-
Hyperion: HPC Community’s Interest in LLMs Has ‘Exploded,’ with Complexity, Cost Concerns
Oct 4, 2023 | 16:52 pm