From 7d17e0ac42b3e0e8d97b8d575573835849781cd4 Mon Sep 17 00:00:00 2001 From: Enrico Usai Date: Fri, 15 Dec 2023 16:30:36 +0100 Subject: [PATCH] Update 3.8.0 changelog Signed-off-by: Enrico Usai --- CHANGELOG.md | 64 +++++++++++++++++++++++++++------------------------- 1 file changed, 33 insertions(+), 31 deletions(-) diff --git a/CHANGELOG.md b/CHANGELOG.md index b09774ad0..db6d09ab4 100644 --- a/CHANGELOG.md +++ b/CHANGELOG.md @@ -25,20 +25,20 @@ This file is used to list changes made in each version of the AWS ParallelCluste ------ **ENHANCEMENTS** -- Add support for Rocky Linux 8. - Add support for EC2 Capacity Blocks for ML. -- Add support for `Scheduling/SlurmSettings/Database/DatabaseName` parameter to render `StorageLoc` in the slurmdbd configuration generated by ParallelCluster. -- Add the option to use EFS storage instead of NFS exports from the head node root volume for intra-cluster shared ParallelCluster, Intel, Slurm, login node, and /home data. +- Add support for Rocky Linux 8. +- Add support for `Scheduling/SlurmSettings/Database/DatabaseName` parameter to render `StorageLoc` + in the slurmdbd configuration generated by ParallelCluster. +- Add the option to use EFS storage instead of NFS exports from the head node root volume + for intra-cluster shared file system resources: ParallelCluster, Intel, Slurm, and `/home` data. - Allow for mounting `home` as an EFS or FSx external shared storage via the `SharedStorage` section of the config file. **CHANGES** - Upgrade Slurm to 23.02.7 (from 23.02.6). -- Do not wait for static nodes in maintenance to signal CFN that the head node initialization is complete. -- Upgrade `aws-cfn-bootstrap` to version 2.0-28. -- Upgrade Python to 3.9.17. - Upgrade NVIDIA driver to version 535.129.03. -- Upgrade CUDA Toolkit to version 12.2.2 +- Upgrade CUDA Toolkit to version 12.2.2. - Use Open Source NVIDIA GPU drivers (OpenRM) as NVIDIA kernel module for Linux instead of NVIDIA closed source module. +- Do not wait for static nodes in maintenance to signal CFN that the head node initialization is complete. - Upgrade EFA installer to `1.29.0`. - Efa-driver: `efa-2.6.0-1` - Efa-config: `efa-config-1.15-1` @@ -47,11 +47,13 @@ This file is used to list changes made in each version of the AWS ParallelCluste - Rdma-core: `rdma-core-46.0-1` - Open MPI: `openmpi40-aws-4.1.6-1` - Upgrade GDRCopy to version 2.4 in all supported OSes, except for Centos 7 where version 2.3.1 is used. +- Upgrade `aws-cfn-bootstrap` to version 2.0-28. +- Upgrade Python to 3.9.17. **BUG FIXES** - Fix inconsistent scaling configuration after cluster update rollback when modifying the list of instance types declared in the Compute Resources. - Fix users SSH keys generation when switching users without root privilege in clusters integrated with an external LDAP server through cluster configuration files. -- Fix disabling Slurm power save mode when setting ScaledownIdletime = -1 +- Fix disabling Slurm power save mode when setting `ScaledownIdletime = -1`. - Fix hard-coded path to Slurm installation dir in `update_slurm_database_password.sh` script for Slurm Accounting. 3.7.2 @@ -725,7 +727,7 @@ When using a custom AMI with a preinstalled EFA package, no actions are taken at **CHANGES** - A Route53 private hosted zone is now created together with the cluster and used in DNS resolution inside cluster nodes - when using Slurm scheduler. + when using Slurm scheduler. - Upgrade EFA installer to version 1.9.5: - EFA configuration: ``efa-config-1.4`` (from efa-config-1.3) - EFA profile: ``efa-profile-1.0.0`` @@ -1096,7 +1098,7 @@ When using a custom AMI with a preinstalled EFA package, no actions are taken at ----- - Set SGE Accounting summary to be true, this reports a single accounting record -for a mpi job + for a mpi job - Add option to disable ganglia `extra_json = { "cfncluster" : { "ganglia_enabled" : "no" } }` @@ -1111,48 +1113,48 @@ for a mpi job Major new features/updates: - - Added parameter to specify custom cfncluster-node package +- Added parameter to specify custom cfncluster-node package Bug fixes/minor improvements: - - Fixed poise-python dependecy issue - - Poll on EBS Volume attachment status - - Added more info on failure of pre and post install files - - Fixed SLURM cron job to publish pending metric +- Fixed poise-python dependecy issue +- Poll on EBS Volume attachment status +- Added more info on failure of pre and post install files +- Fixed SLURM cron job to publish pending metric 1.4.1 ----- Major new features/updates: - - Updated to latest cfncluster-node 1.4.3 +- Updated to latest cfncluster-node 1.4.3 1.4.0 ----- Major new features/updates: - - Updated to Amazon Linux 2017.09.1 - - Applied patches to Ubuntu 16.04 - - Applied patches to Ubuntu 14.04 - - Updated to Centos 7.4 - - Upgraded Centos 6 AMI - - Updated to Nvidia driver 384 - - Updated to CUDA 9 - - Updated to latest cfncluster-node 1.4.2 +- Updated to Amazon Linux 2017.09.1 +- Applied patches to Ubuntu 16.04 +- Applied patches to Ubuntu 14.04 +- Updated to Centos 7.4 +- Upgraded Centos 6 AMI +- Updated to Nvidia driver 384 +- Updated to CUDA 9 +- Updated to latest cfncluster-node 1.4.2 Bug fixes/minor improvements: - - Added support for NVMe-based instance store - - Fixed ganglia plotting issue on ubuntu - - Fixed slow SLURM scaling times on systemd platforms. +- Added support for NVMe-based instance store +- Fixed ganglia plotting issue on ubuntu +- Fixed slow SLURM scaling times on systemd platforms. 1.3.2 ----- - - Relicensed to Apache License 2.0 - - Updated to Amazon Linux 2017.03 - - Pulled in latest cookbook dependencies - - Removed Openlava support +- Relicensed to Apache License 2.0 +- Updated to Amazon Linux 2017.03 +- Pulled in latest cookbook dependencies +- Removed Openlava support 1.2.0 -----