Connecting a standalone login node to multiple clusters in AWS PCS
The pcs-multi-cluster-login-configure.sh script provides an automated way to
configure multiple Slurm sackd daemons on a single standalone login node. It enables the
login node to communicate with multiple clusters. The script automates the following
operations:
-
Uses AWS PCS API actions to get cluster information
-
Prompts for the base64-encoded Slurm authentication key
-
Creates a Slurm JWKS file with cluster authentication key
-
Configures the
sackdservice with cluster endpoints and ports -
Creates a
systemdservice file for a cluster-specificsackddaemon -
Generates an activation script for cluster environment setup
-
Enables and starts the
sackdservice
Note
This script requires Slurm version 25.05 or later.
Slurm must already be installed on the instance (equivalent to
step 3 in the manual process).
The instance must be able to reach the target cluster's endpoints. The script performs the
equivalent operations of step 4
and step 5 in the manual
configuration process. It automatically gets the cluster information, configures the sackd service,
creates the necessary systemd service files, and creates an activation script
that users can use to configure their shell environment for cluster interaction.