Skip to content

Commit

Permalink
update cluster resources
Browse files Browse the repository at this point in the history
  • Loading branch information
kukushking committed Nov 11, 2024
1 parent 7ea0e5d commit bc88ffd
Show file tree
Hide file tree
Showing 3 changed files with 9 additions and 8 deletions.
1 change: 1 addition & 0 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
Expand Up @@ -19,6 +19,7 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
### **Changed**
- changed `ray-image` to pull from AWS Public ECR to avoid docker pull rate limits
- changed `ray-orchestrator` to not retrieve full training job logs and avoid `States.DataLimitExceeded`
- update `ray-on-eks` manifest cluster resources

## v1.7.0

Expand Down
4 changes: 2 additions & 2 deletions manifests/ray-on-eks/core-modules.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -44,9 +44,9 @@ parameters:
eks_node_labels:
usage: core
- eks_ng_name: ng-gpu
eks_node_quantity: 1
eks_node_quantity: 5
eks_node_max_quantity: 10
eks_node_min_quantity: 1
eks_node_min_quantity: 5
eks_node_disk_size: 400
eks_node_instance_type: "g4dn.4xlarge"
eks_node_labels:
Expand Down
12 changes: 6 additions & 6 deletions manifests/ray-on-eks/ray-cluster-modules.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -31,11 +31,11 @@ parameters:
- name: HeadResources
value:
requests:
cpu: "1"
memory: "8G"
cpu: "8"
memory: "24G"
limits:
cpu: "4"
memory: "16G"
cpu: "8"
memory: "24G"
- name: WorkerReplicas
value: 1
- name: WorkerMinReplicas
Expand All @@ -45,8 +45,8 @@ parameters:
- name: WorkerResources
value:
requests:
cpu: "4"
memory: "8G"
cpu: "14"
memory: "60G"
limits:
cpu: "14"
memory: "60G"
Expand Down

0 comments on commit bc88ffd

Please sign in to comment.