Essential HPC
From Cluster Documentation Project
Practical High Performance Concepts, Techniques, and Procedures
This book will be based on CDP wiki content and is intended to help answer questions and get your HPC projects moving forward quickly. The book is expected to be available in the near future. A soft version will be available at no charge and a special limited edition hard copy will be printed "on demand." We are looking for sponsors for a SC 2017 version. (more details soon)
Tentative Table of Contents
Fundamental Concepts
- What is HPC and Why Does It Matter
- Introduction To Building Blocks
- Setting Expectations
- Parallel and Distributed Computing 101
- What Do I Need to Know As An Administrator?
- What Do I need to Know As A User?
Taking The First Steps
- It all Depends
- Understanding Costs
- Success Metrics
- Choosing Hardware
- Choosing Software
- Using A Cloud
Techniques
- Using an Open Approach To HPC
- Designing Your Solution
- Cluster Provisioning and Control
- Sharing The Resource
- Parallel Programming
Procedures
- Software Management
- File Systems
- Testing and Tuning
- Monitoring
Next Steps
- HPC Applications
- Other Resources