Close

Presentation

This content is available for: Workshop Reg Pass. Upgrade Registration
ICE 2.0: Restructuring and Growing an Instructional HPC Cluster
DescriptionThe Partnership for an Advanced Computing Environment (PACE) at Georgia Tech (GT) has been running two campus-wide cluster resources available for academic courses and workshops for five years. The initial design focused on creating a federated resource for a wide range of educational topics, based on a PACE and College of Computing (COC) partnership. Due to funding, this took the form of separate resources, one funded by PACE, and another by COC. These "Instructional Cluster Environments", PACE-ICE and COC-ICE, became very popular with instructors at GT but led to a high maintenance cost due to the split nature of the environments. With the transition to the Slurm scheduler, PACE collaborated with COC to merge the two clusters into one, ICE. This work details the strategies used to sensibly merge the two production systems, including the storage architecture, shared system policies, and scheduler priority configurations that honor funding complexities.
Event Type
Workshop
TimeSunday, 12 November 20239:43am - 10am MST
Location503-504
Tags
Resource Management
State of the Practice
Registration Categories
W