Difference between revisions of "Batch Schedulers"

From Cluster Documentation Project
Jump to: navigation, search
(added SLURM)
Line 6: Line 6:
 
[http://www.clusterresources.com/pages/products/torque-resource-manager.php Torque] - TORQUE is an open source resource manager providing control over batch jobs and distributed compute nodes. It is a community effort based on the original *PBS project and, with more than 1,200 patches, has incorporated significant advances in the areas of scalability, fault tolerance, and feature extensions contributed by NCSA, OSC, USC , the U.S. Dept of Energy, Sandia, PNNL, U of Buffalo, TeraGrid, and many other leading edge HPC organizations
 
[http://www.clusterresources.com/pages/products/torque-resource-manager.php Torque] - TORQUE is an open source resource manager providing control over batch jobs and distributed compute nodes. It is a community effort based on the original *PBS project and, with more than 1,200 patches, has incorporated significant advances in the areas of scalability, fault tolerance, and feature extensions contributed by NCSA, OSC, USC , the U.S. Dept of Energy, Sandia, PNNL, U of Buffalo, TeraGrid, and many other leading edge HPC organizations
  
[http://www.hpccommunity.org/index.php?pageid=lava Platform Lava] Platform Lava is an open source entry-level workload scheduler designed to meet a wide range of workload scheduling needs for clusters up to 512-nodes. Lava is available via the [http://www.hpccommunity.org/ HPCCommunity site] and is also included as a component of Platform Cluster Manager (PCM).
+
[http://www.hpccommunity.org/index.php?pageid=lava Platform Lava] is an open source entry-level workload scheduler designed to meet a wide range of workload scheduling needs for clusters up to 512-nodes. Lava is available via the [http://www.hpccommunity.org/ HPCCommunity site] and is also included as a component of Platform Cluster Manager (PCM).
 +
 
 +
[http://slurm.schedmd.com/ SLURM] is an open-source workload manager designed for Linux clusters of all sizes. It provides three key functions. First it allocates exclusive and/or non-exclusive access to resources (computer nodes) to users for some duration of time so they can perform work. Second, it provides a framework for starting, executing, and monitoring work (typically a parallel job) on a set of allocated nodes. Finally, it arbitrates contention for resources by managing a queue of pending work.

Revision as of 14:05, 18 July 2013

Sun Grid Engine - SGE is an openly available Batch Scheduler. The Grid Engine project is an open source community effort to facilitate the adoption of distributed computing solutions. Sponsored by Sun Microsystems and hosted by CollabNet, the Grid Engine project provides enabling distributed resource management software for wide ranging requirements from compute farms to grid computing. Current versions are 5.3 and 6.0.

Torque - TORQUE is an open source resource manager providing control over batch jobs and distributed compute nodes. It is a community effort based on the original *PBS project and, with more than 1,200 patches, has incorporated significant advances in the areas of scalability, fault tolerance, and feature extensions contributed by NCSA, OSC, USC , the U.S. Dept of Energy, Sandia, PNNL, U of Buffalo, TeraGrid, and many other leading edge HPC organizations

Platform Lava is an open source entry-level workload scheduler designed to meet a wide range of workload scheduling needs for clusters up to 512-nodes. Lava is available via the HPCCommunity site and is also included as a component of Platform Cluster Manager (PCM).

SLURM is an open-source workload manager designed for Linux clusters of all sizes. It provides three key functions. First it allocates exclusive and/or non-exclusive access to resources (computer nodes) to users for some duration of time so they can perform work. Second, it provides a framework for starting, executing, and monitoring work (typically a parallel job) on a set of allocated nodes. Finally, it arbitrates contention for resources by managing a queue of pending work.