VolumeSpec value can bloat as cluster size (number of nodes) increases due to allowed_nodes and preferred_nodes #1154

hasethuraman · 2022-07-07T05:36:46Z

Describe the bug
Created a cluster with 4 nodes (1 master and 3 agents) and I see all nodes are added to these allowed_nodes and preferred_nodes. So I think when we increase the cluster size and no topology information that opens up the chance to capture all nodes in these sections. Having said that, when we have x1000's of volumes in such large cluster, this can potentially increase the disk usage of etcd and increase the latency overall.

/namespace/mayastor/control-plane/VolumeSpec/38098332-3acc-4850-874b-a5315acf3dce
{
"uuid": "38098332-3acc-4850-874b-a5315acf3dce",
....
"topology": {
"node": {
"Explicit": {
"allowed_nodes": [
"k8s-agentpool1-40851847-0",
"k8s-agentpool1-40851847-1",
"k8s-master-40851847-0",
"k8s-agentpool1-40851847-2"
],
"preferred_nodes": [
"k8s-agentpool1-40851847-2",
"k8s-master-40851847-0",
"k8s-agentpool1-40851847-0",
"k8s-agentpool1-40851847-1"
]
}
}...
}

To Reproduce
Steps to reproduce the behavior:
I hope the above sample can explain how to do that.

Expected behavior
A clear and concise description of what you expected to happen.
Should we really capture all the nodes and only capture the nodes where the replicas are present?

Screenshots
If applicable, add screenshots to help explain your problem.

** OS info (please complete the following information):**

Distro: [e.g. NixOS]
Kernel version
MayaStor revision or container image : develop

Additional context
Add any other context about the problem here.

tiagolobocastro · 2022-07-07T16:07:58Z

@hasethuraman I'm not sure why we're doing this, seems we're conflating accessibility for the application with data placement, unless I'm misunderstanding.
I can't think of a reason to keep it so it's probably safe to omit these nodes until we have such need.

hasethuraman · 2022-07-07T16:24:34Z

Thanks @tiagolobocastro. Please let me know when you have any update on fix and timelines.

I may be probably wrong with this suggestion - instead of omitting the nodes completely, I think the necessary nodes (where replicas) can be there and omit the rest of the nodes in that array. This information may be helpful to the admin to query ETCd/mayastor to know the location of the replicas. Since I am not familiar with mayastor and my suggestion is completely wrong or doesnt add any value here, please ignore this.

hasethuraman · 2022-07-11T11:34:47Z

I think this would be a better way:

If there is a topology information (user's desire) can be sent to CSI through storageclass or pvc, Mayastor's VolumeSpec can have a new toplogy field to capture this topology-info (which can be consumed in future. for example: a cluster restart, scale-out) and avoid allowed_nodes and preferred_nodes.

If topology information is not provided, then it means all nodes are eligible to provision the replica, still allowed_nodes, preferred_nodes can be [] and topology key will be nil.

hasethuraman added the NEW New issue label Jul 7, 2022

hasethuraman assigned GlennBullingham Jul 7, 2022

Abhinandan-Purkait mentioned this issue Jul 26, 2022

refactor(csi): accessible topology to be not used for data placement openebs/mayastor-control-plane#275

Merged

bors bot closed this as completed in openebs/mayastor-control-plane@4cfac46 Jul 27, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

VolumeSpec value can bloat as cluster size (number of nodes) increases due to allowed_nodes and preferred_nodes #1154

VolumeSpec value can bloat as cluster size (number of nodes) increases due to allowed_nodes and preferred_nodes #1154

hasethuraman commented Jul 7, 2022

tiagolobocastro commented Jul 7, 2022

hasethuraman commented Jul 7, 2022

hasethuraman commented Jul 11, 2022

VolumeSpec value can bloat as cluster size (number of nodes) increases due to allowed_nodes and preferred_nodes #1154

VolumeSpec value can bloat as cluster size (number of nodes) increases due to allowed_nodes and preferred_nodes #1154

Comments

hasethuraman commented Jul 7, 2022

tiagolobocastro commented Jul 7, 2022

hasethuraman commented Jul 7, 2022

hasethuraman commented Jul 11, 2022