-
Notifications
You must be signed in to change notification settings - Fork 312
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
EFS /home in parallel cluster #2344
Comments
Hi @andrei-xdlab, Why not use the cluster config to mount the EFS drive on all the cluster nodes instead of doing it via fstab in the AMI? This would likely require mounting the EFS drive on a directory other than /home, but it would avoid the need for any customization. In theory, you could accomplish this without using the config would be by using a pre-install script to modify the portions of the ParallelCluster cookbook recipes in-place so that the head node doesn't attempt to export the home directory, and the compute nodes don't attempt to mount it. |
Hi @tilne Currently, parallel cluster doesn't support EFS access points natively, so we are using fstab in the AMI to mount /home access point and set default user permissions. I am able to use cluster config to mount EFS filesystem on the a directory other than /home. But we have a customer that requires a large /home that would persist after pcluster delete operations. We also have to backup /home and EFS is best option for that. Can you suggest which ParallelCluster cookbook recipes have to be modified via per-install script in order to accomplish our goal? Do you plan to introduce EFS /home option for parallel cluster in the future? We are trying to complete replace EBS and NFS running on the master node with AWS shared filesystem solutions (EFS and FSx) to avoid performance bottleneck. Happy New Year! |
In the current version, the recipes that need to be modified will be
I'm not aware of any definitive plans, but I believe it's been requested before and we're tracking that internally.
Same to you! 🥂 |
I use this kind of EFS-backed home directory, but have it mounted at /users instead. |
I'm going to close this ticket in favour of:
|
|
We have a requirement for a large persistent /home EFS filesystem + AWS backup shared across cluster nodes. I created custom AMI (2.10.1) and able to mount /home access points but pcluster deployment is failing due to "nfs_export failure" (see cfg-init.log snippet below). I believe pcluster is trying to re-export EFS NFS /home and build is failing. How do I disable /home nfs export in template to allow successful build?
cfn-init.log
Error executing action
create
on resource 'nfs_export[/home]'Mixlib::ShellOut::ShellCommandFailed
execute[exportfs] (/etc/chef/local-mode-cache/cache/cookbooks/nfs/providers/export.rb line 43) had an error: Mixlib::ShellOut::ShellCommandFailed: Expected process to exit with [0], but received '1'
---- Begin output of exportfs -ar ----
STDOUT:
STDERR: exportfs: /etc/exports [1]: Neither 'subtree_check' or 'no_subtree_check' specified for export "172.31.0.0/16:/shared".
Assuming default behaviour ('no_subtree_check').
NOTE: this default has changed since nfs-utils version 1.0.x
exportfs: /etc/exports [2]: Neither 'subtree_check' or 'no_subtree_check' specified for export "172.31.0.0/16:/home".
Assuming default behaviour ('no_subtree_check').
NOTE: this default has changed since nfs-utils version 1.0.x
exportfs: /home requires fsid= for NFS export
---- End output of exportfs -ar ----
Ran exportfs -ar returned 1
Cookbook Trace:
/etc/chef/local-mode-cache/cache/cookbooks/nfs/providers/export.rb:73:in `block in class_from_file'
Resource Declaration:
In /etc/chef/local-mode-cache/cache/cookbooks/aws-parallelcluster/recipes/head_node_base_config.rb
130: nfs_export "/home" do
131: network node['cfncluster']['ec2-metadata']['vpc-ipv4-cidr-blocks']
132: writeable true
133: options ['no_root_squash']
134: end
135:
The text was updated successfully, but these errors were encountered: