Skip to content
This repository has been archived by the owner on Jun 6, 2024. It is now read-only.

Add manual for adding node to existing PAI cluster #5267

Open
Binyang2014 opened this issue Jan 27, 2021 · 1 comment
Open

Add manual for adding node to existing PAI cluster #5267

Binyang2014 opened this issue Jan 27, 2021 · 1 comment
Assignees

Comments

@Binyang2014
Copy link
Contributor

To add node to existing PAI cluster we need to do:

  1. run ansible-playbook -i ${HOME}/pai-deploy/cluster-cfg/hosts.yml docker-runtime-setup.yml --limit=nodelist to change docker daemon config
  2. run ansible-playbook -i inventory/pai/hosts.yml cluster.yml --become --become-user=root -e "@inventory/gcrv100/openpai.yml" --limit=node list to add node to k8s cluster.
  3. If you encounter cert related error. Please make sure all etcd master node has: /usr/local/bin/etcd-scripts and has /etc/ssl/etcd/openssl.conf. If not, copy it from other etcd masters
  4. After add node to k8s cluster, if you want to remove CNI, please follow doc to remove it
  5. Change PAI layout.yaml to add node to layout.yaml. Change services-configuration.yaml to update hived config
  6. Update pai config by ./paictl config push -p /udpated-config
  7. run ./paictl service start -n cluster-configuration to update config
  8. Restart rest-server, hived, job-exporter
@Binyang2014 Binyang2014 changed the title Add manual for adding node in PAI cluster Add manual for adding node to existing PAI cluster Jan 27, 2021
@suiguoxin
Copy link
Member

Add-node issue for clusters with single master / etcd node is resolved in #5269.

This issue can be left here to track potential issues for clusters with multiple master/etcd nodes.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants