Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Upgrade Machinelearningservices to 2021-04-01 #14666

Closed
gro1m opened this issue May 13, 2021 · 5 comments
Closed

Upgrade Machinelearningservices to 2021-04-01 #14666

gro1m opened this issue May 13, 2021 · 5 comments
Labels
ADO customer-reported Issues that are reported by GitHub users external to the Azure organization. feature-request This issue requires a new behavior in the product in order be resolved. Machine Learning Service Attention Workflow: This issue is responsible by Azure service team. Service This issue points to a problem in the service.
Milestone

Comments

@gro1m
Copy link

gro1m commented May 13, 2021

Feature Request

The REST API is on version 2021-04-01 for machinelearningservices: https://github.com/Azure/azure-rest-api-specs/tree/master/specification/machinelearningservices/resource-manager/Microsoft.MachineLearningServices/stable/2021-04-01.
The GO SDK is one year behind (2020-04-01): https://pkg.go.dev/github.com/Azure/azure-sdk-for-go/services/machinelearningservices/[email protected]+incompatible.

I want to do the terraform azure provider implementation for the Machine Learning Computing "Suite":

  1. Machine Learning Compute Instance - this Compute type is unfortunately unavailable in the version 2020-04-01. (Support for Azure Machine Learning Compute Instance (new resource) hashicorp/terraform-provider-azurerm#11255 / Support for azureml compute  hashicorp/terraform-provider-azurerm#6827)

  2. Machine Learning Compute Cluster - I cannot do the update operation. PR is here: new resource - azurerm_machine_learning_compute_cluster hashicorp/terraform-provider-azurerm#11675. Looked into it with @favoretti from Terraform Azure Providers Team and this seems to be due to outdated versions:
    portal sends payload to PATCH request in form of:

{"properties":
    {"properties":
        {"scaleSettings":
            {"minNodeCount":2,
              "maxNodeCount":3,
              "nodeIdleTimeBeforeScaleDown":"PT900S"
             }
         }
     }
}

, whereas Go SDK sends:

{"properties":
    {"scaleSettings":
        {"maxNodeCount":2,
         "minNodeCount":1,
         "nodeIdleTimeBeforeScaleDown":"PT60S"
        }
     }
}

, but portal seems to use a newer API version.

  1. Machine Learning Inference Cluster: not sure if that has an issue with the 2020-04-01 version. PR is here: new resource azurerm_machine_learning_inference_cluster hashicorp/terraform-provider-azurerm#11550.

The following part in https://github.com/Azure/azure-rest-api-specs/blob/master/specification/machinelearningservices/resource-manager/Microsoft.MachineLearningServices/stable/2021-04-01/machineLearningServices.json#L713 describes best the Get operation I need (of course also all other necessary operations for implementing the CRUD):

        "operationId": "MachineLearningCompute_Get",
        "description": "Gets compute definition by its name. Any secrets (storage keys, service credentials, etc) are not returned - use 'keys' nested resource to get them.",
        "x-ms-examples": {
          "Get a AKS Compute": {
            "$ref": "./examples/getAKSCompute.json"
          },
          "Get a AML Compute": {
            "$ref": "./examples/getAmlCompute.json"
          },
          "Get an ComputeInstance": {
            "$ref": "./examples/getComputeInstance.json"
          }
        },

Would look forward to a rather quick implementation (rough expectation: 1 week), but I am a bit irritated that the versions have not been pulled after consequently. So if there is a way I can contribute so that this moves quicker, I will be happy to do so :)

@ghost ghost added needs-triage Workflow: This is a new issue that needs to be triaged to the appropriate team. customer-reported Issues that are reported by GitHub users external to the Azure organization. question The issue doesn't require a change to the product in order to be resolved. Most issues start as that labels May 13, 2021
@favoretti
Copy link

@ArcturusZhang would this be a trivial fix or?..

@ArcturusZhang ArcturusZhang added feature-request This issue requires a new behavior in the product in order be resolved. Machine Learning Service Attention Workflow: This issue is responsible by Azure service team. labels May 18, 2021
@ghost ghost removed the needs-triage Workflow: This is a new issue that needs to be triaged to the appropriate team. label May 18, 2021
@ghost
Copy link

ghost commented May 18, 2021

Thanks for the feedback! We are routing this to the appropriate team for follow-up. cc @azureml-github.

Issue Details

Feature Request

The REST API is on version 2021-04-01 for machinelearningservices: https://github.com/Azure/azure-rest-api-specs/tree/master/specification/machinelearningservices/resource-manager/Microsoft.MachineLearningServices/stable/2021-04-01.
The GO SDK is one year behind (2020-04-01): https://pkg.go.dev/github.com/Azure/azure-sdk-for-go/services/machinelearningservices/[email protected]+incompatible.

I want to do the terraform azure provider implementation for the Machine Learning Computing "Suite":

  1. Machine Learning Compute Instance - this Compute type is unfortunately unavailable in the version 2020-04-01. (Support for Azure Machine Learning Compute Instance (new resource) hashicorp/terraform-provider-azurerm#11255 / Support for azureml compute  hashicorp/terraform-provider-azurerm#6827)

  2. Machine Learning Compute Cluster - I cannot do the update operation. PR is here: new resource - azurerm_machine_learning_compute_cluster hashicorp/terraform-provider-azurerm#11675. Looked into it with @favoretti from Terraform Azure Providers Team and this seems to be due to outdated versions:
    portal sends payload to PATCH request in form of:

{"properties":
    {"properties":
        {"scaleSettings":
            {"minNodeCount":2,
              "maxNodeCount":3,
              "nodeIdleTimeBeforeScaleDown":"PT900S"
             }
         }
     }
}

, whereas Go SDK sends:

{"properties":
    {"scaleSettings":
        {"maxNodeCount":2,
         "minNodeCount":1,
         "nodeIdleTimeBeforeScaleDown":"PT60S"
        }
     }
}

, but portal seems to use a newer API version.

  1. Machine Learning Inference Cluster: not sure if that has an issue with the 2020-04-01 version. PR is here: new resource azurerm_machine_learning_inference_cluster hashicorp/terraform-provider-azurerm#11550.

The following part in https://github.com/Azure/azure-rest-api-specs/blob/master/specification/machinelearningservices/resource-manager/Microsoft.MachineLearningServices/stable/2021-04-01/machineLearningServices.json#L713 describes best the Get operation I need (of course also all other necessary operations for implementing the CRUD):

        "operationId": "MachineLearningCompute_Get",
        "description": "Gets compute definition by its name. Any secrets (storage keys, service credentials, etc) are not returned - use 'keys' nested resource to get them.",
        "x-ms-examples": {
          "Get a AKS Compute": {
            "$ref": "./examples/getAKSCompute.json"
          },
          "Get a AML Compute": {
            "$ref": "./examples/getAmlCompute.json"
          },
          "Get an ComputeInstance": {
            "$ref": "./examples/getComputeInstance.json"
          }
        },

Would look forward to a rather quick implementation (rough expectation: 1 week), but I am a bit irritated that the versions have not been pulled after consequently. So if there is a way I can contribute so that this moves quicker, I will be happy to do so :)

Author: gro1m
Assignees: -
Labels:

Machine Learning, Service Attention, customer-reported, feature-request, needs-triage, question

Milestone: -

@ArcturusZhang ArcturusZhang removed the question The issue doesn't require a change to the product in order to be resolved. Most issues start as that label May 18, 2021
@ArcturusZhang
Copy link
Member

Hi @gro1m @favoretti thanks for opening this issue.

The SDK team cannot release an api-version without the release requests from the corresponding service team - therefore I tagged this issue so that the corresponding service team could get a notice of this.

@ArcturusZhang
Copy link
Member

Hey @gro1m and @favoretti I just found that this issue comes from the swagger and the new api-version cannot solve this issue. More details can be found in this issue #14669

@RickWinter RickWinter added this to the Backlog milestone Jul 12, 2021
@jsquire jsquire removed the ML-MLOps label Aug 2, 2021
@RickWinter RickWinter added the Service This issue points to a problem in the service. label Sep 13, 2021
Copy link

Hi @gro1m, we deeply appreciate your input into this project. Regrettably, this issue has remained inactive for over 2 years, leading us to the decision to close it. We've implemented this policy to maintain the relevance of our issue queue and facilitate easier navigation for new contributors. If you still believe this topic requires attention, please feel free to create a new issue, referencing this one. Thank you for your understanding and ongoing support.

@github-actions github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Mar 13, 2024
@github-actions github-actions bot locked and limited conversation to collaborators Mar 13, 2024
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
ADO customer-reported Issues that are reported by GitHub users external to the Azure organization. feature-request This issue requires a new behavior in the product in order be resolved. Machine Learning Service Attention Workflow: This issue is responsible by Azure service team. Service This issue points to a problem in the service.
Projects
None yet
Development

No branches or pull requests

6 participants