Comparison of cluster software

The following tables compare general and technical information for notable computer cluster software. This software can be grossly separated in four categories: Job scheduler, nodes management, nodes installation and integrated stack (all the above).

General information

Software Maintainer Category Development status ArchitectureOCS High-Performance/ High-Throughput Computing License Platforms supported Cost Paid support available
Alchemi
Amoeba MIT
Base One Foundation Component Library Proprietary
Boss StackIQ All in one actively developed Master deploys to members High-Performance Various licenses RHEL, CentOS, Oracle, Scientific Linux Varies Yes
HTCondor University Of Wisconsin CS dep Job/Data Scheduler actively developed Distributed master/executions/submits node HTC Apache license v2.0 Unix-like, Windows, Mac OS X Free Yes
DIET INRIA, SysFera, Open Source All in one GridRPC, SPMD, Hierarchical and distributed architecture, CORBA HTC/HPC CeCILL Unix-like, Mac OS X, AIX Free
Ganglia Monitoring actively developed BSD Unix, Linux, Windows NT/XP/2000/2003/2008, FreeBSD, NetBSD, OpenBSD, DragonflyBSD, Mac OS X, Solaris, AIX, IRIX, Tru64, HPUX. Free
GreenTea Software
Gridbus Toolkit
Globus Toolkit Globus Alliance, Argonne National Laboratory Job/Data Scheduler actively developed SOA Grid Linux Free
Grid MP Univa (formerly United Devices) Job Scheduler actively developed Distributed master/worker HTC/HPC Proprietary Windows, Linux, Mac OS X, Solaris Cost
grun Erik Aronesty (Expression Analysis) actively developed master/worker HPC/HTC GPL Linux, Mac OS X, BSD Free
JPPF Laurent Cohen (founder) actively developed distributed master/worker and P2P HPC/HTC Apache license v2.0 Windows, Linux, Mac OS X, Solaris Free
Kubernetes Google actively developed Apache license v2.0 Linux Free
LanderCluster Lander Software Technology Co. Ltd Job Scheduler/Monitoring actively developed Proprietary Windows, Linux, & UNIX platforms Cost
JSTM
Apache Mesos Apache actively developed Apache license v2.0 Linux Free Yes (Mesosphere)
Moab Cluster Suite Cluster Resources, Inc. Job Scheduler/Monitoring actively developed HPC Proprietary Linux, Mac OS X, Windows, AIX, OSF/Tru-64, Solaris, HP-UX, IRIX, FreeBSD & other UNIX platforms Cost
Moab Cluster Suite Cluster Resources, Inc. Job Scheduler actively developed HPC Proprietary Linux, Mac OS X, Windows, AIX, OSF/Tru-64, Solaris, HP-UX, IRIX, FreeBSD & other UNIX platforms Cost
‌Maui Cluster Scheduler followup? Job Scheduler HTC/HPC Proprietary Unix-like, Free
NetworkComputer ‌RTDA Runtime Design Automation actively developed HTC/HPC Proprietary Unix-like, Windows Cost
OAR INRIA and LIG Job Scheduler actively developed HPC/HTC GPL Linux/*nix Free
OpenLava Teraproc Job Scheduler actively developed Master/Worker, multiple admin/submit nodes HTC/HPC GPL Linux Free Yes
PBS Professional PBS Works (A division of Altair) Job Scheduler/Monitoring actively developed HPC Proprietary Unix, Linux, Windows Cost
Platform LSF Platform Computing Job Scheduler actively developed HPC/HTC Proprietary Unix, Linux, Windows Cost
Platform Cluster Manager Platform Computing All in one actively developed HTC/HPC OpenSource Linux Free
Rocks Cluster Distribution Open Source/NSF grant All in one actively developed HTC/HPC OpenSource CentOS Free
Popular Power
ProActive INRIA, ActiveEon, Open Source All in one actively developed Master/Worker, SPMD, Distributed Component Model, Skeletons HTC/HPC GPL Unix-like, Windows, Mac OS X Free
PRUN Andrey Budnik Job Scheduler actively developed Master node/exec clients, multiple admin/submit nodes HTC Apache license v2.0 Linux/*nix Free
RPyC Tomer Filiba actively developed MIT License *nix/Windows Free
SLURM SchedMD Job Scheduler actively developed HPC/HTC GPL Linux/*nix Free Yes
Oracle Grid Engine Univa Job Scheduler active Development moved to Univa Grid Engine Master node/exec clients, multiple admin/submit nodes HPC/HTC Proprietary *nix/Windows Cost
Son of Grid Engine Dave Love Job Scheduler actively developed Master node/exec clients, multiple admin/submit nodes HPC/HTC SISSL *nix/Windows Free
SynfiniWay Fujitsu actively developed HPC/HTC ? Unix, Linux, Windows Cost
TORQUE Resource Manager Torque Cluster Resources, Inc. Job Scheduler actively developed custom Linux, *nix Free
UniCloud Univa All in One (dynamic cluster creation/re-sizing, cloud bursting, etc.) Actively Developed Proprietary Oracle Unbreakable Linux, RHEL, and Cent Os Cost
UniCluster Univa All in One Functionality and development moved to UniCloud (see above) Free Yes
UNICORE
Univa Grid Engine Univa Job Scheduler actively developed Master node/exec clients, multiple admin/submit nodes HPC/HTC Proprietary *nix/Windows Cost
Vaakya Vaakya Technologies Pvt Ltd R&D Technology provider actively developed Cross-Platform, Distributed Computing Architecture Proprietary Windows/Linux Cost
XGE
Xgrid Apple Computer
Software Maintainer Category Development status Architecture High-Performance/ High-Throughput Computing License Platforms supported Cost Paid support available

Table explanation

Technical information

Software Implementation Language Authentification Encryption Integrity Global File System Global File System + Kerberos Heterogeneous/ Homogeneous exec node Jobs priority Group priority Queue type SMP aware Max exec node Max job submitted CPU scavenging Parallel job Job checkpointing
Torque C SSH, munge None, any Heterogeneous Yes Yes Programmable Yes tested tested Yes Yes Yes (blcr)
OAR Perl, Ocaml, Ruby SSH None, NFS Heterogeneous Yes Yes Programmable Yes tested 80k tested >20k Yes Yes Yes (blcr)
OpenLava C/C++ OS authentication None NFS Heterogeneous Linux Yes Yes Configurable Yes Yes, supports preemption based on priority Yes Yes
Platform LSF yes Yes to start jobs. Did it suspend job when the person come back? Yes
Condor C++ GSI, SSL, Kerberos, Password, File System, Remote File System, Windows, Claim To Be, Anonymous None, Triple DES, BLOWFISH None, MD5 None, NFS, AFS Not official, hack with ACL and NFS4 Heterogeneous Yes Yes Fair-share with some programmability basic (hard separation into different node) tested ~10000? tested ~100000? Yes MPI, OpenMP, PVM Yes
Slurm C Munge, None, Kerberos Heterogeneous Yes Yes Multifactor Fair-share yes tested 120k tested 100k No Yes Yes (blcr)
Univa Grid Engine C Authentification Encryption Integrity Arbitrary, e.g. NFS, Lustre, HDFS, AFS AFS Fully heterogeneous Yes; automatically policy controlled (e.g. fair-share, deadline, resource dependent) or manual Yes; can be dependent on user groups as well as projects and is governed by policies Batch, interactive, checkpointing, parallel and combinations Yes, with core binding, GPU and Intel Xeon Phi support commercial deployments with many tens of thousands hosts >300K tested in commercial deployments Yes; can suspend job on interactive usage Yes, with support of arbitrary parallel environments such as OpenMPI, MPICH 1/2, MVAPICH 1/2, LAM, etc. Yes, with support for user, kernel or library level checkpointing environments
Software programation language Authentification Encryption Integrity Global File System Global File System + Kerberos Heterogeneous/ Homogeneous exec node Jobs priority Group priority Queue type SMP aware Max exec node Max job submitted CPU scavenging Parallel job Job checkpointing

Table Explanation

See also

Notes

    This article is issued from Wikipedia - version of the Thursday, April 07, 2016. The text is available under the Creative Commons Attribution/Share Alike but additional terms may apply for the media files.