Essential HPC
From Cluster Documentation Project
Practical High Performance Concepts, Techniques, and Procedures
This book will be based on CDP wiki content is is intended to help answer questions and get your HPC projects moving forward quickly.
Tentative Table of Contents
Fundamental Concepts
- What is HPC and Why Does It Matter
- Introduction To Building Blocks
- Setting Expectations
- Parallel and Distributed Computing 101
- What Do I Need to Know As An Administrator?
- What Do I need to Know As A User?
Taking The First Steps
- It all Depends
- Understanding Costs
- Success Metrics
- Choosing Hardware
- Choosing Software
- Using A Cloud
Techniques
- Using an Open Approach To HPC
- Designing Your Solution
- Cluster Provisioning and Control
- Sharing The Resource
- Parallel Programming
Procedures
- Software Management
- File Systems
- Testing and Tuning
- Monitoring
Next Steps
- HPC Applications
- Other Resources