1. Packages
  2. Azure Native
  3. API Docs
  4. machinelearningservices
  5. getOnlineDeployment
This is the latest version of Azure Native. Use the Azure Native v1 docs if using the v1 version of this package.
Azure Native v2.73.0 published on Wednesday, Nov 20, 2024 by Pulumi

azure-native.machinelearningservices.getOnlineDeployment

Explore with Pulumi AI

azure-native logo
This is the latest version of Azure Native. Use the Azure Native v1 docs if using the v1 version of this package.
Azure Native v2.73.0 published on Wednesday, Nov 20, 2024 by Pulumi

    Azure REST API version: 2023-04-01.

    Other available API versions: 2021-03-01-preview, 2022-02-01-preview, 2023-04-01-preview, 2023-06-01-preview, 2023-08-01-preview, 2023-10-01, 2024-01-01-preview, 2024-04-01, 2024-04-01-preview, 2024-07-01-preview, 2024-10-01, 2024-10-01-preview.

    Using getOnlineDeployment

    Two invocation forms are available. The direct form accepts plain arguments and either blocks until the result value is available, or returns a Promise-wrapped result. The output form accepts Input-wrapped arguments and returns an Output-wrapped result.

    function getOnlineDeployment(args: GetOnlineDeploymentArgs, opts?: InvokeOptions): Promise<GetOnlineDeploymentResult>
    function getOnlineDeploymentOutput(args: GetOnlineDeploymentOutputArgs, opts?: InvokeOptions): Output<GetOnlineDeploymentResult>
    def get_online_deployment(deployment_name: Optional[str] = None,
                              endpoint_name: Optional[str] = None,
                              resource_group_name: Optional[str] = None,
                              workspace_name: Optional[str] = None,
                              opts: Optional[InvokeOptions] = None) -> GetOnlineDeploymentResult
    def get_online_deployment_output(deployment_name: Optional[pulumi.Input[str]] = None,
                              endpoint_name: Optional[pulumi.Input[str]] = None,
                              resource_group_name: Optional[pulumi.Input[str]] = None,
                              workspace_name: Optional[pulumi.Input[str]] = None,
                              opts: Optional[InvokeOptions] = None) -> Output[GetOnlineDeploymentResult]
    func LookupOnlineDeployment(ctx *Context, args *LookupOnlineDeploymentArgs, opts ...InvokeOption) (*LookupOnlineDeploymentResult, error)
    func LookupOnlineDeploymentOutput(ctx *Context, args *LookupOnlineDeploymentOutputArgs, opts ...InvokeOption) LookupOnlineDeploymentResultOutput

    > Note: This function is named LookupOnlineDeployment in the Go SDK.

    public static class GetOnlineDeployment 
    {
        public static Task<GetOnlineDeploymentResult> InvokeAsync(GetOnlineDeploymentArgs args, InvokeOptions? opts = null)
        public static Output<GetOnlineDeploymentResult> Invoke(GetOnlineDeploymentInvokeArgs args, InvokeOptions? opts = null)
    }
    public static CompletableFuture<GetOnlineDeploymentResult> getOnlineDeployment(GetOnlineDeploymentArgs args, InvokeOptions options)
    // Output-based functions aren't available in Java yet
    
    fn::invoke:
      function: azure-native:machinelearningservices:getOnlineDeployment
      arguments:
        # arguments dictionary

    The following arguments are supported:

    DeploymentName string
    Inference Endpoint Deployment name.
    EndpointName string
    Inference endpoint name.
    ResourceGroupName string
    The name of the resource group. The name is case insensitive.
    WorkspaceName string
    Name of Azure Machine Learning workspace.
    DeploymentName string
    Inference Endpoint Deployment name.
    EndpointName string
    Inference endpoint name.
    ResourceGroupName string
    The name of the resource group. The name is case insensitive.
    WorkspaceName string
    Name of Azure Machine Learning workspace.
    deploymentName String
    Inference Endpoint Deployment name.
    endpointName String
    Inference endpoint name.
    resourceGroupName String
    The name of the resource group. The name is case insensitive.
    workspaceName String
    Name of Azure Machine Learning workspace.
    deploymentName string
    Inference Endpoint Deployment name.
    endpointName string
    Inference endpoint name.
    resourceGroupName string
    The name of the resource group. The name is case insensitive.
    workspaceName string
    Name of Azure Machine Learning workspace.
    deployment_name str
    Inference Endpoint Deployment name.
    endpoint_name str
    Inference endpoint name.
    resource_group_name str
    The name of the resource group. The name is case insensitive.
    workspace_name str
    Name of Azure Machine Learning workspace.
    deploymentName String
    Inference Endpoint Deployment name.
    endpointName String
    Inference endpoint name.
    resourceGroupName String
    The name of the resource group. The name is case insensitive.
    workspaceName String
    Name of Azure Machine Learning workspace.

    getOnlineDeployment Result

    The following output properties are available:

    Id string
    Fully qualified resource ID for the resource. Ex - /subscriptions/{subscriptionId}/resourceGroups/{resourceGroupName}/providers/{resourceProviderNamespace}/{resourceType}/{resourceName}
    Location string
    The geo-location where the resource lives
    Name string
    The name of the resource
    OnlineDeploymentProperties Pulumi.AzureNative.MachineLearningServices.Outputs.KubernetesOnlineDeploymentResponse | Pulumi.AzureNative.MachineLearningServices.Outputs.ManagedOnlineDeploymentResponse
    [Required] Additional attributes of the entity.
    SystemData Pulumi.AzureNative.MachineLearningServices.Outputs.SystemDataResponse
    Azure Resource Manager metadata containing createdBy and modifiedBy information.
    Type string
    The type of the resource. E.g. "Microsoft.Compute/virtualMachines" or "Microsoft.Storage/storageAccounts"
    Identity Pulumi.AzureNative.MachineLearningServices.Outputs.ManagedServiceIdentityResponse
    Managed service identity (system assigned and/or user assigned identities)
    Kind string
    Metadata used by portal/tooling/etc to render different UX experiences for resources of the same type.
    Sku Pulumi.AzureNative.MachineLearningServices.Outputs.SkuResponse
    Sku details required for ARM contract for Autoscaling.
    Tags Dictionary<string, string>
    Resource tags.
    Id string
    Fully qualified resource ID for the resource. Ex - /subscriptions/{subscriptionId}/resourceGroups/{resourceGroupName}/providers/{resourceProviderNamespace}/{resourceType}/{resourceName}
    Location string
    The geo-location where the resource lives
    Name string
    The name of the resource
    OnlineDeploymentProperties KubernetesOnlineDeploymentResponse | ManagedOnlineDeploymentResponse
    [Required] Additional attributes of the entity.
    SystemData SystemDataResponse
    Azure Resource Manager metadata containing createdBy and modifiedBy information.
    Type string
    The type of the resource. E.g. "Microsoft.Compute/virtualMachines" or "Microsoft.Storage/storageAccounts"
    Identity ManagedServiceIdentityResponse
    Managed service identity (system assigned and/or user assigned identities)
    Kind string
    Metadata used by portal/tooling/etc to render different UX experiences for resources of the same type.
    Sku SkuResponse
    Sku details required for ARM contract for Autoscaling.
    Tags map[string]string
    Resource tags.
    id String
    Fully qualified resource ID for the resource. Ex - /subscriptions/{subscriptionId}/resourceGroups/{resourceGroupName}/providers/{resourceProviderNamespace}/{resourceType}/{resourceName}
    location String
    The geo-location where the resource lives
    name String
    The name of the resource
    onlineDeploymentProperties KubernetesOnlineDeploymentResponse | ManagedOnlineDeploymentResponse
    [Required] Additional attributes of the entity.
    systemData SystemDataResponse
    Azure Resource Manager metadata containing createdBy and modifiedBy information.
    type String
    The type of the resource. E.g. "Microsoft.Compute/virtualMachines" or "Microsoft.Storage/storageAccounts"
    identity ManagedServiceIdentityResponse
    Managed service identity (system assigned and/or user assigned identities)
    kind String
    Metadata used by portal/tooling/etc to render different UX experiences for resources of the same type.
    sku SkuResponse
    Sku details required for ARM contract for Autoscaling.
    tags Map<String,String>
    Resource tags.
    id string
    Fully qualified resource ID for the resource. Ex - /subscriptions/{subscriptionId}/resourceGroups/{resourceGroupName}/providers/{resourceProviderNamespace}/{resourceType}/{resourceName}
    location string
    The geo-location where the resource lives
    name string
    The name of the resource
    onlineDeploymentProperties KubernetesOnlineDeploymentResponse | ManagedOnlineDeploymentResponse
    [Required] Additional attributes of the entity.
    systemData SystemDataResponse
    Azure Resource Manager metadata containing createdBy and modifiedBy information.
    type string
    The type of the resource. E.g. "Microsoft.Compute/virtualMachines" or "Microsoft.Storage/storageAccounts"
    identity ManagedServiceIdentityResponse
    Managed service identity (system assigned and/or user assigned identities)
    kind string
    Metadata used by portal/tooling/etc to render different UX experiences for resources of the same type.
    sku SkuResponse
    Sku details required for ARM contract for Autoscaling.
    tags {[key: string]: string}
    Resource tags.
    id str
    Fully qualified resource ID for the resource. Ex - /subscriptions/{subscriptionId}/resourceGroups/{resourceGroupName}/providers/{resourceProviderNamespace}/{resourceType}/{resourceName}
    location str
    The geo-location where the resource lives
    name str
    The name of the resource
    online_deployment_properties KubernetesOnlineDeploymentResponse | ManagedOnlineDeploymentResponse
    [Required] Additional attributes of the entity.
    system_data SystemDataResponse
    Azure Resource Manager metadata containing createdBy and modifiedBy information.
    type str
    The type of the resource. E.g. "Microsoft.Compute/virtualMachines" or "Microsoft.Storage/storageAccounts"
    identity ManagedServiceIdentityResponse
    Managed service identity (system assigned and/or user assigned identities)
    kind str
    Metadata used by portal/tooling/etc to render different UX experiences for resources of the same type.
    sku SkuResponse
    Sku details required for ARM contract for Autoscaling.
    tags Mapping[str, str]
    Resource tags.
    id String
    Fully qualified resource ID for the resource. Ex - /subscriptions/{subscriptionId}/resourceGroups/{resourceGroupName}/providers/{resourceProviderNamespace}/{resourceType}/{resourceName}
    location String
    The geo-location where the resource lives
    name String
    The name of the resource
    onlineDeploymentProperties Property Map | Property Map
    [Required] Additional attributes of the entity.
    systemData Property Map
    Azure Resource Manager metadata containing createdBy and modifiedBy information.
    type String
    The type of the resource. E.g. "Microsoft.Compute/virtualMachines" or "Microsoft.Storage/storageAccounts"
    identity Property Map
    Managed service identity (system assigned and/or user assigned identities)
    kind String
    Metadata used by portal/tooling/etc to render different UX experiences for resources of the same type.
    sku Property Map
    Sku details required for ARM contract for Autoscaling.
    tags Map<String>
    Resource tags.

    Supporting Types

    CodeConfigurationResponse

    ScoringScript string
    [Required] The script to execute on startup. eg. "score.py"
    CodeId string
    ARM resource ID of the code asset.
    ScoringScript string
    [Required] The script to execute on startup. eg. "score.py"
    CodeId string
    ARM resource ID of the code asset.
    scoringScript String
    [Required] The script to execute on startup. eg. "score.py"
    codeId String
    ARM resource ID of the code asset.
    scoringScript string
    [Required] The script to execute on startup. eg. "score.py"
    codeId string
    ARM resource ID of the code asset.
    scoring_script str
    [Required] The script to execute on startup. eg. "score.py"
    code_id str
    ARM resource ID of the code asset.
    scoringScript String
    [Required] The script to execute on startup. eg. "score.py"
    codeId String
    ARM resource ID of the code asset.

    ContainerResourceRequirementsResponse

    containerResourceLimits Property Map
    Container resource limit info:
    containerResourceRequests Property Map
    Container resource request info:

    ContainerResourceSettingsResponse

    Cpu string
    Number of vCPUs request/limit for container. More info: https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/
    Gpu string
    Number of Nvidia GPU cards request/limit for container. More info: https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/
    Memory string
    Memory size request/limit for container. More info: https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/
    Cpu string
    Number of vCPUs request/limit for container. More info: https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/
    Gpu string
    Number of Nvidia GPU cards request/limit for container. More info: https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/
    Memory string
    Memory size request/limit for container. More info: https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/
    cpu String
    Number of vCPUs request/limit for container. More info: https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/
    gpu String
    Number of Nvidia GPU cards request/limit for container. More info: https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/
    memory String
    Memory size request/limit for container. More info: https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/
    cpu string
    Number of vCPUs request/limit for container. More info: https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/
    gpu string
    Number of Nvidia GPU cards request/limit for container. More info: https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/
    memory string
    Memory size request/limit for container. More info: https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/
    cpu str
    Number of vCPUs request/limit for container. More info: https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/
    gpu str
    Number of Nvidia GPU cards request/limit for container. More info: https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/
    memory str
    Memory size request/limit for container. More info: https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/
    cpu String
    Number of vCPUs request/limit for container. More info: https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/
    gpu String
    Number of Nvidia GPU cards request/limit for container. More info: https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/
    memory String
    Memory size request/limit for container. More info: https://kubernetes.io/docs/concepts/configuration/manage-compute-resources-container/

    DefaultScaleSettingsResponse

    KubernetesOnlineDeploymentResponse

    ProvisioningState string
    Provisioning state for the endpoint deployment.
    AppInsightsEnabled bool
    If true, enables Application Insights logging.
    CodeConfiguration Pulumi.AzureNative.MachineLearningServices.Inputs.CodeConfigurationResponse
    Code configuration for the endpoint deployment.
    ContainerResourceRequirements Pulumi.AzureNative.MachineLearningServices.Inputs.ContainerResourceRequirementsResponse
    The resource requirements for the container (cpu and memory).
    Description string
    Description of the endpoint deployment.
    EgressPublicNetworkAccess string
    If Enabled, allow egress public network access. If Disabled, this will create secure egress. Default: Enabled.
    EnvironmentId string
    ARM resource ID or AssetId of the environment specification for the endpoint deployment.
    EnvironmentVariables Dictionary<string, string>
    Environment variables configuration for the deployment.
    InstanceType string
    Compute instance type.
    LivenessProbe Pulumi.AzureNative.MachineLearningServices.Inputs.ProbeSettingsResponse
    Liveness probe monitors the health of the container regularly.
    Model string
    The URI path to the model.
    ModelMountPath string
    The path to mount the model in custom container.
    Properties Dictionary<string, string>
    Property dictionary. Properties can be added, but not removed or altered.
    ReadinessProbe Pulumi.AzureNative.MachineLearningServices.Inputs.ProbeSettingsResponse
    Readiness probe validates if the container is ready to serve traffic. The properties and defaults are the same as liveness probe.
    RequestSettings Pulumi.AzureNative.MachineLearningServices.Inputs.OnlineRequestSettingsResponse
    Request settings for the deployment.
    ScaleSettings Pulumi.AzureNative.MachineLearningServices.Inputs.DefaultScaleSettingsResponse | Pulumi.AzureNative.MachineLearningServices.Inputs.TargetUtilizationScaleSettingsResponse
    Scale settings for the deployment. If it is null or not provided, it defaults to TargetUtilizationScaleSettings for KubernetesOnlineDeployment and to DefaultScaleSettings for ManagedOnlineDeployment.
    ProvisioningState string
    Provisioning state for the endpoint deployment.
    AppInsightsEnabled bool
    If true, enables Application Insights logging.
    CodeConfiguration CodeConfigurationResponse
    Code configuration for the endpoint deployment.
    ContainerResourceRequirements ContainerResourceRequirementsResponse
    The resource requirements for the container (cpu and memory).
    Description string
    Description of the endpoint deployment.
    EgressPublicNetworkAccess string
    If Enabled, allow egress public network access. If Disabled, this will create secure egress. Default: Enabled.
    EnvironmentId string
    ARM resource ID or AssetId of the environment specification for the endpoint deployment.
    EnvironmentVariables map[string]string
    Environment variables configuration for the deployment.
    InstanceType string
    Compute instance type.
    LivenessProbe ProbeSettingsResponse
    Liveness probe monitors the health of the container regularly.
    Model string
    The URI path to the model.
    ModelMountPath string
    The path to mount the model in custom container.
    Properties map[string]string
    Property dictionary. Properties can be added, but not removed or altered.
    ReadinessProbe ProbeSettingsResponse
    Readiness probe validates if the container is ready to serve traffic. The properties and defaults are the same as liveness probe.
    RequestSettings OnlineRequestSettingsResponse
    Request settings for the deployment.
    ScaleSettings DefaultScaleSettingsResponse | TargetUtilizationScaleSettingsResponse
    Scale settings for the deployment. If it is null or not provided, it defaults to TargetUtilizationScaleSettings for KubernetesOnlineDeployment and to DefaultScaleSettings for ManagedOnlineDeployment.
    provisioningState String
    Provisioning state for the endpoint deployment.
    appInsightsEnabled Boolean
    If true, enables Application Insights logging.
    codeConfiguration CodeConfigurationResponse
    Code configuration for the endpoint deployment.
    containerResourceRequirements ContainerResourceRequirementsResponse
    The resource requirements for the container (cpu and memory).
    description String
    Description of the endpoint deployment.
    egressPublicNetworkAccess String
    If Enabled, allow egress public network access. If Disabled, this will create secure egress. Default: Enabled.
    environmentId String
    ARM resource ID or AssetId of the environment specification for the endpoint deployment.
    environmentVariables Map<String,String>
    Environment variables configuration for the deployment.
    instanceType String
    Compute instance type.
    livenessProbe ProbeSettingsResponse
    Liveness probe monitors the health of the container regularly.
    model String
    The URI path to the model.
    modelMountPath String
    The path to mount the model in custom container.
    properties Map<String,String>
    Property dictionary. Properties can be added, but not removed or altered.
    readinessProbe ProbeSettingsResponse
    Readiness probe validates if the container is ready to serve traffic. The properties and defaults are the same as liveness probe.
    requestSettings OnlineRequestSettingsResponse
    Request settings for the deployment.
    scaleSettings DefaultScaleSettingsResponse | TargetUtilizationScaleSettingsResponse
    Scale settings for the deployment. If it is null or not provided, it defaults to TargetUtilizationScaleSettings for KubernetesOnlineDeployment and to DefaultScaleSettings for ManagedOnlineDeployment.
    provisioningState string
    Provisioning state for the endpoint deployment.
    appInsightsEnabled boolean
    If true, enables Application Insights logging.
    codeConfiguration CodeConfigurationResponse
    Code configuration for the endpoint deployment.
    containerResourceRequirements ContainerResourceRequirementsResponse
    The resource requirements for the container (cpu and memory).
    description string
    Description of the endpoint deployment.
    egressPublicNetworkAccess string
    If Enabled, allow egress public network access. If Disabled, this will create secure egress. Default: Enabled.
    environmentId string
    ARM resource ID or AssetId of the environment specification for the endpoint deployment.
    environmentVariables {[key: string]: string}
    Environment variables configuration for the deployment.
    instanceType string
    Compute instance type.
    livenessProbe ProbeSettingsResponse
    Liveness probe monitors the health of the container regularly.
    model string
    The URI path to the model.
    modelMountPath string
    The path to mount the model in custom container.
    properties {[key: string]: string}
    Property dictionary. Properties can be added, but not removed or altered.
    readinessProbe ProbeSettingsResponse
    Readiness probe validates if the container is ready to serve traffic. The properties and defaults are the same as liveness probe.
    requestSettings OnlineRequestSettingsResponse
    Request settings for the deployment.
    scaleSettings DefaultScaleSettingsResponse | TargetUtilizationScaleSettingsResponse
    Scale settings for the deployment. If it is null or not provided, it defaults to TargetUtilizationScaleSettings for KubernetesOnlineDeployment and to DefaultScaleSettings for ManagedOnlineDeployment.
    provisioning_state str
    Provisioning state for the endpoint deployment.
    app_insights_enabled bool
    If true, enables Application Insights logging.
    code_configuration CodeConfigurationResponse
    Code configuration for the endpoint deployment.
    container_resource_requirements ContainerResourceRequirementsResponse
    The resource requirements for the container (cpu and memory).
    description str
    Description of the endpoint deployment.
    egress_public_network_access str
    If Enabled, allow egress public network access. If Disabled, this will create secure egress. Default: Enabled.
    environment_id str
    ARM resource ID or AssetId of the environment specification for the endpoint deployment.
    environment_variables Mapping[str, str]
    Environment variables configuration for the deployment.
    instance_type str
    Compute instance type.
    liveness_probe ProbeSettingsResponse
    Liveness probe monitors the health of the container regularly.
    model str
    The URI path to the model.
    model_mount_path str
    The path to mount the model in custom container.
    properties Mapping[str, str]
    Property dictionary. Properties can be added, but not removed or altered.
    readiness_probe ProbeSettingsResponse
    Readiness probe validates if the container is ready to serve traffic. The properties and defaults are the same as liveness probe.
    request_settings OnlineRequestSettingsResponse
    Request settings for the deployment.
    scale_settings DefaultScaleSettingsResponse | TargetUtilizationScaleSettingsResponse
    Scale settings for the deployment. If it is null or not provided, it defaults to TargetUtilizationScaleSettings for KubernetesOnlineDeployment and to DefaultScaleSettings for ManagedOnlineDeployment.
    provisioningState String
    Provisioning state for the endpoint deployment.
    appInsightsEnabled Boolean
    If true, enables Application Insights logging.
    codeConfiguration Property Map
    Code configuration for the endpoint deployment.
    containerResourceRequirements Property Map
    The resource requirements for the container (cpu and memory).
    description String
    Description of the endpoint deployment.
    egressPublicNetworkAccess String
    If Enabled, allow egress public network access. If Disabled, this will create secure egress. Default: Enabled.
    environmentId String
    ARM resource ID or AssetId of the environment specification for the endpoint deployment.
    environmentVariables Map<String>
    Environment variables configuration for the deployment.
    instanceType String
    Compute instance type.
    livenessProbe Property Map
    Liveness probe monitors the health of the container regularly.
    model String
    The URI path to the model.
    modelMountPath String
    The path to mount the model in custom container.
    properties Map<String>
    Property dictionary. Properties can be added, but not removed or altered.
    readinessProbe Property Map
    Readiness probe validates if the container is ready to serve traffic. The properties and defaults are the same as liveness probe.
    requestSettings Property Map
    Request settings for the deployment.
    scaleSettings Property Map | Property Map
    Scale settings for the deployment. If it is null or not provided, it defaults to TargetUtilizationScaleSettings for KubernetesOnlineDeployment and to DefaultScaleSettings for ManagedOnlineDeployment.

    ManagedOnlineDeploymentResponse

    ProvisioningState string
    Provisioning state for the endpoint deployment.
    AppInsightsEnabled bool
    If true, enables Application Insights logging.
    CodeConfiguration Pulumi.AzureNative.MachineLearningServices.Inputs.CodeConfigurationResponse
    Code configuration for the endpoint deployment.
    Description string
    Description of the endpoint deployment.
    EgressPublicNetworkAccess string
    If Enabled, allow egress public network access. If Disabled, this will create secure egress. Default: Enabled.
    EnvironmentId string
    ARM resource ID or AssetId of the environment specification for the endpoint deployment.
    EnvironmentVariables Dictionary<string, string>
    Environment variables configuration for the deployment.
    InstanceType string
    Compute instance type.
    LivenessProbe Pulumi.AzureNative.MachineLearningServices.Inputs.ProbeSettingsResponse
    Liveness probe monitors the health of the container regularly.
    Model string
    The URI path to the model.
    ModelMountPath string
    The path to mount the model in custom container.
    Properties Dictionary<string, string>
    Property dictionary. Properties can be added, but not removed or altered.
    ReadinessProbe Pulumi.AzureNative.MachineLearningServices.Inputs.ProbeSettingsResponse
    Readiness probe validates if the container is ready to serve traffic. The properties and defaults are the same as liveness probe.
    RequestSettings Pulumi.AzureNative.MachineLearningServices.Inputs.OnlineRequestSettingsResponse
    Request settings for the deployment.
    ScaleSettings Pulumi.AzureNative.MachineLearningServices.Inputs.DefaultScaleSettingsResponse | Pulumi.AzureNative.MachineLearningServices.Inputs.TargetUtilizationScaleSettingsResponse
    Scale settings for the deployment. If it is null or not provided, it defaults to TargetUtilizationScaleSettings for KubernetesOnlineDeployment and to DefaultScaleSettings for ManagedOnlineDeployment.
    ProvisioningState string
    Provisioning state for the endpoint deployment.
    AppInsightsEnabled bool
    If true, enables Application Insights logging.
    CodeConfiguration CodeConfigurationResponse
    Code configuration for the endpoint deployment.
    Description string
    Description of the endpoint deployment.
    EgressPublicNetworkAccess string
    If Enabled, allow egress public network access. If Disabled, this will create secure egress. Default: Enabled.
    EnvironmentId string
    ARM resource ID or AssetId of the environment specification for the endpoint deployment.
    EnvironmentVariables map[string]string
    Environment variables configuration for the deployment.
    InstanceType string
    Compute instance type.
    LivenessProbe ProbeSettingsResponse
    Liveness probe monitors the health of the container regularly.
    Model string
    The URI path to the model.
    ModelMountPath string
    The path to mount the model in custom container.
    Properties map[string]string
    Property dictionary. Properties can be added, but not removed or altered.
    ReadinessProbe ProbeSettingsResponse
    Readiness probe validates if the container is ready to serve traffic. The properties and defaults are the same as liveness probe.
    RequestSettings OnlineRequestSettingsResponse
    Request settings for the deployment.
    ScaleSettings DefaultScaleSettingsResponse | TargetUtilizationScaleSettingsResponse
    Scale settings for the deployment. If it is null or not provided, it defaults to TargetUtilizationScaleSettings for KubernetesOnlineDeployment and to DefaultScaleSettings for ManagedOnlineDeployment.
    provisioningState String
    Provisioning state for the endpoint deployment.
    appInsightsEnabled Boolean
    If true, enables Application Insights logging.
    codeConfiguration CodeConfigurationResponse
    Code configuration for the endpoint deployment.
    description String
    Description of the endpoint deployment.
    egressPublicNetworkAccess String
    If Enabled, allow egress public network access. If Disabled, this will create secure egress. Default: Enabled.
    environmentId String
    ARM resource ID or AssetId of the environment specification for the endpoint deployment.
    environmentVariables Map<String,String>
    Environment variables configuration for the deployment.
    instanceType String
    Compute instance type.
    livenessProbe ProbeSettingsResponse
    Liveness probe monitors the health of the container regularly.
    model String
    The URI path to the model.
    modelMountPath String
    The path to mount the model in custom container.
    properties Map<String,String>
    Property dictionary. Properties can be added, but not removed or altered.
    readinessProbe ProbeSettingsResponse
    Readiness probe validates if the container is ready to serve traffic. The properties and defaults are the same as liveness probe.
    requestSettings OnlineRequestSettingsResponse
    Request settings for the deployment.
    scaleSettings DefaultScaleSettingsResponse | TargetUtilizationScaleSettingsResponse
    Scale settings for the deployment. If it is null or not provided, it defaults to TargetUtilizationScaleSettings for KubernetesOnlineDeployment and to DefaultScaleSettings for ManagedOnlineDeployment.
    provisioningState string
    Provisioning state for the endpoint deployment.
    appInsightsEnabled boolean
    If true, enables Application Insights logging.
    codeConfiguration CodeConfigurationResponse
    Code configuration for the endpoint deployment.
    description string
    Description of the endpoint deployment.
    egressPublicNetworkAccess string
    If Enabled, allow egress public network access. If Disabled, this will create secure egress. Default: Enabled.
    environmentId string
    ARM resource ID or AssetId of the environment specification for the endpoint deployment.
    environmentVariables {[key: string]: string}
    Environment variables configuration for the deployment.
    instanceType string
    Compute instance type.
    livenessProbe ProbeSettingsResponse
    Liveness probe monitors the health of the container regularly.
    model string
    The URI path to the model.
    modelMountPath string
    The path to mount the model in custom container.
    properties {[key: string]: string}
    Property dictionary. Properties can be added, but not removed or altered.
    readinessProbe ProbeSettingsResponse
    Readiness probe validates if the container is ready to serve traffic. The properties and defaults are the same as liveness probe.
    requestSettings OnlineRequestSettingsResponse
    Request settings for the deployment.
    scaleSettings DefaultScaleSettingsResponse | TargetUtilizationScaleSettingsResponse
    Scale settings for the deployment. If it is null or not provided, it defaults to TargetUtilizationScaleSettings for KubernetesOnlineDeployment and to DefaultScaleSettings for ManagedOnlineDeployment.
    provisioning_state str
    Provisioning state for the endpoint deployment.
    app_insights_enabled bool
    If true, enables Application Insights logging.
    code_configuration CodeConfigurationResponse
    Code configuration for the endpoint deployment.
    description str
    Description of the endpoint deployment.
    egress_public_network_access str
    If Enabled, allow egress public network access. If Disabled, this will create secure egress. Default: Enabled.
    environment_id str
    ARM resource ID or AssetId of the environment specification for the endpoint deployment.
    environment_variables Mapping[str, str]
    Environment variables configuration for the deployment.
    instance_type str
    Compute instance type.
    liveness_probe ProbeSettingsResponse
    Liveness probe monitors the health of the container regularly.
    model str
    The URI path to the model.
    model_mount_path str
    The path to mount the model in custom container.
    properties Mapping[str, str]
    Property dictionary. Properties can be added, but not removed or altered.
    readiness_probe ProbeSettingsResponse
    Readiness probe validates if the container is ready to serve traffic. The properties and defaults are the same as liveness probe.
    request_settings OnlineRequestSettingsResponse
    Request settings for the deployment.
    scale_settings DefaultScaleSettingsResponse | TargetUtilizationScaleSettingsResponse
    Scale settings for the deployment. If it is null or not provided, it defaults to TargetUtilizationScaleSettings for KubernetesOnlineDeployment and to DefaultScaleSettings for ManagedOnlineDeployment.
    provisioningState String
    Provisioning state for the endpoint deployment.
    appInsightsEnabled Boolean
    If true, enables Application Insights logging.
    codeConfiguration Property Map
    Code configuration for the endpoint deployment.
    description String
    Description of the endpoint deployment.
    egressPublicNetworkAccess String
    If Enabled, allow egress public network access. If Disabled, this will create secure egress. Default: Enabled.
    environmentId String
    ARM resource ID or AssetId of the environment specification for the endpoint deployment.
    environmentVariables Map<String>
    Environment variables configuration for the deployment.
    instanceType String
    Compute instance type.
    livenessProbe Property Map
    Liveness probe monitors the health of the container regularly.
    model String
    The URI path to the model.
    modelMountPath String
    The path to mount the model in custom container.
    properties Map<String>
    Property dictionary. Properties can be added, but not removed or altered.
    readinessProbe Property Map
    Readiness probe validates if the container is ready to serve traffic. The properties and defaults are the same as liveness probe.
    requestSettings Property Map
    Request settings for the deployment.
    scaleSettings Property Map | Property Map
    Scale settings for the deployment. If it is null or not provided, it defaults to TargetUtilizationScaleSettings for KubernetesOnlineDeployment and to DefaultScaleSettings for ManagedOnlineDeployment.

    ManagedServiceIdentityResponse

    PrincipalId string
    The service principal ID of the system assigned identity. This property will only be provided for a system assigned identity.
    TenantId string
    The tenant ID of the system assigned identity. This property will only be provided for a system assigned identity.
    Type string
    Type of managed service identity (where both SystemAssigned and UserAssigned types are allowed).
    UserAssignedIdentities Dictionary<string, Pulumi.AzureNative.MachineLearningServices.Inputs.UserAssignedIdentityResponse>
    The set of user assigned identities associated with the resource. The userAssignedIdentities dictionary keys will be ARM resource ids in the form: '/subscriptions/{subscriptionId}/resourceGroups/{resourceGroupName}/providers/Microsoft.ManagedIdentity/userAssignedIdentities/{identityName}. The dictionary values can be empty objects ({}) in requests.
    PrincipalId string
    The service principal ID of the system assigned identity. This property will only be provided for a system assigned identity.
    TenantId string
    The tenant ID of the system assigned identity. This property will only be provided for a system assigned identity.
    Type string
    Type of managed service identity (where both SystemAssigned and UserAssigned types are allowed).
    UserAssignedIdentities map[string]UserAssignedIdentityResponse
    The set of user assigned identities associated with the resource. The userAssignedIdentities dictionary keys will be ARM resource ids in the form: '/subscriptions/{subscriptionId}/resourceGroups/{resourceGroupName}/providers/Microsoft.ManagedIdentity/userAssignedIdentities/{identityName}. The dictionary values can be empty objects ({}) in requests.
    principalId String
    The service principal ID of the system assigned identity. This property will only be provided for a system assigned identity.
    tenantId String
    The tenant ID of the system assigned identity. This property will only be provided for a system assigned identity.
    type String
    Type of managed service identity (where both SystemAssigned and UserAssigned types are allowed).
    userAssignedIdentities Map<String,UserAssignedIdentityResponse>
    The set of user assigned identities associated with the resource. The userAssignedIdentities dictionary keys will be ARM resource ids in the form: '/subscriptions/{subscriptionId}/resourceGroups/{resourceGroupName}/providers/Microsoft.ManagedIdentity/userAssignedIdentities/{identityName}. The dictionary values can be empty objects ({}) in requests.
    principalId string
    The service principal ID of the system assigned identity. This property will only be provided for a system assigned identity.
    tenantId string
    The tenant ID of the system assigned identity. This property will only be provided for a system assigned identity.
    type string
    Type of managed service identity (where both SystemAssigned and UserAssigned types are allowed).
    userAssignedIdentities {[key: string]: UserAssignedIdentityResponse}
    The set of user assigned identities associated with the resource. The userAssignedIdentities dictionary keys will be ARM resource ids in the form: '/subscriptions/{subscriptionId}/resourceGroups/{resourceGroupName}/providers/Microsoft.ManagedIdentity/userAssignedIdentities/{identityName}. The dictionary values can be empty objects ({}) in requests.
    principal_id str
    The service principal ID of the system assigned identity. This property will only be provided for a system assigned identity.
    tenant_id str
    The tenant ID of the system assigned identity. This property will only be provided for a system assigned identity.
    type str
    Type of managed service identity (where both SystemAssigned and UserAssigned types are allowed).
    user_assigned_identities Mapping[str, UserAssignedIdentityResponse]
    The set of user assigned identities associated with the resource. The userAssignedIdentities dictionary keys will be ARM resource ids in the form: '/subscriptions/{subscriptionId}/resourceGroups/{resourceGroupName}/providers/Microsoft.ManagedIdentity/userAssignedIdentities/{identityName}. The dictionary values can be empty objects ({}) in requests.
    principalId String
    The service principal ID of the system assigned identity. This property will only be provided for a system assigned identity.
    tenantId String
    The tenant ID of the system assigned identity. This property will only be provided for a system assigned identity.
    type String
    Type of managed service identity (where both SystemAssigned and UserAssigned types are allowed).
    userAssignedIdentities Map<Property Map>
    The set of user assigned identities associated with the resource. The userAssignedIdentities dictionary keys will be ARM resource ids in the form: '/subscriptions/{subscriptionId}/resourceGroups/{resourceGroupName}/providers/Microsoft.ManagedIdentity/userAssignedIdentities/{identityName}. The dictionary values can be empty objects ({}) in requests.

    OnlineRequestSettingsResponse

    MaxConcurrentRequestsPerInstance int
    The number of maximum concurrent requests per node allowed per deployment. Defaults to 1.
    MaxQueueWait string
    The maximum amount of time a request will stay in the queue in ISO 8601 format. Defaults to 500ms.
    RequestTimeout string
    The scoring timeout in ISO 8601 format. Defaults to 5000ms.
    MaxConcurrentRequestsPerInstance int
    The number of maximum concurrent requests per node allowed per deployment. Defaults to 1.
    MaxQueueWait string
    The maximum amount of time a request will stay in the queue in ISO 8601 format. Defaults to 500ms.
    RequestTimeout string
    The scoring timeout in ISO 8601 format. Defaults to 5000ms.
    maxConcurrentRequestsPerInstance Integer
    The number of maximum concurrent requests per node allowed per deployment. Defaults to 1.
    maxQueueWait String
    The maximum amount of time a request will stay in the queue in ISO 8601 format. Defaults to 500ms.
    requestTimeout String
    The scoring timeout in ISO 8601 format. Defaults to 5000ms.
    maxConcurrentRequestsPerInstance number
    The number of maximum concurrent requests per node allowed per deployment. Defaults to 1.
    maxQueueWait string
    The maximum amount of time a request will stay in the queue in ISO 8601 format. Defaults to 500ms.
    requestTimeout string
    The scoring timeout in ISO 8601 format. Defaults to 5000ms.
    max_concurrent_requests_per_instance int
    The number of maximum concurrent requests per node allowed per deployment. Defaults to 1.
    max_queue_wait str
    The maximum amount of time a request will stay in the queue in ISO 8601 format. Defaults to 500ms.
    request_timeout str
    The scoring timeout in ISO 8601 format. Defaults to 5000ms.
    maxConcurrentRequestsPerInstance Number
    The number of maximum concurrent requests per node allowed per deployment. Defaults to 1.
    maxQueueWait String
    The maximum amount of time a request will stay in the queue in ISO 8601 format. Defaults to 500ms.
    requestTimeout String
    The scoring timeout in ISO 8601 format. Defaults to 5000ms.

    ProbeSettingsResponse

    FailureThreshold int
    The number of failures to allow before returning an unhealthy status.
    InitialDelay string
    The delay before the first probe in ISO 8601 format.
    Period string
    The length of time between probes in ISO 8601 format.
    SuccessThreshold int
    The number of successful probes before returning a healthy status.
    Timeout string
    The probe timeout in ISO 8601 format.
    FailureThreshold int
    The number of failures to allow before returning an unhealthy status.
    InitialDelay string
    The delay before the first probe in ISO 8601 format.
    Period string
    The length of time between probes in ISO 8601 format.
    SuccessThreshold int
    The number of successful probes before returning a healthy status.
    Timeout string
    The probe timeout in ISO 8601 format.
    failureThreshold Integer
    The number of failures to allow before returning an unhealthy status.
    initialDelay String
    The delay before the first probe in ISO 8601 format.
    period String
    The length of time between probes in ISO 8601 format.
    successThreshold Integer
    The number of successful probes before returning a healthy status.
    timeout String
    The probe timeout in ISO 8601 format.
    failureThreshold number
    The number of failures to allow before returning an unhealthy status.
    initialDelay string
    The delay before the first probe in ISO 8601 format.
    period string
    The length of time between probes in ISO 8601 format.
    successThreshold number
    The number of successful probes before returning a healthy status.
    timeout string
    The probe timeout in ISO 8601 format.
    failure_threshold int
    The number of failures to allow before returning an unhealthy status.
    initial_delay str
    The delay before the first probe in ISO 8601 format.
    period str
    The length of time between probes in ISO 8601 format.
    success_threshold int
    The number of successful probes before returning a healthy status.
    timeout str
    The probe timeout in ISO 8601 format.
    failureThreshold Number
    The number of failures to allow before returning an unhealthy status.
    initialDelay String
    The delay before the first probe in ISO 8601 format.
    period String
    The length of time between probes in ISO 8601 format.
    successThreshold Number
    The number of successful probes before returning a healthy status.
    timeout String
    The probe timeout in ISO 8601 format.

    SkuResponse

    Name string
    The name of the SKU. Ex - P3. It is typically a letter+number code
    Capacity int
    If the SKU supports scale out/in then the capacity integer should be included. If scale out/in is not possible for the resource this may be omitted.
    Family string
    If the service has different generations of hardware, for the same SKU, then that can be captured here.
    Size string
    The SKU size. When the name field is the combination of tier and some other value, this would be the standalone code.
    Tier string
    This field is required to be implemented by the Resource Provider if the service has more than one tier, but is not required on a PUT.
    Name string
    The name of the SKU. Ex - P3. It is typically a letter+number code
    Capacity int
    If the SKU supports scale out/in then the capacity integer should be included. If scale out/in is not possible for the resource this may be omitted.
    Family string
    If the service has different generations of hardware, for the same SKU, then that can be captured here.
    Size string
    The SKU size. When the name field is the combination of tier and some other value, this would be the standalone code.
    Tier string
    This field is required to be implemented by the Resource Provider if the service has more than one tier, but is not required on a PUT.
    name String
    The name of the SKU. Ex - P3. It is typically a letter+number code
    capacity Integer
    If the SKU supports scale out/in then the capacity integer should be included. If scale out/in is not possible for the resource this may be omitted.
    family String
    If the service has different generations of hardware, for the same SKU, then that can be captured here.
    size String
    The SKU size. When the name field is the combination of tier and some other value, this would be the standalone code.
    tier String
    This field is required to be implemented by the Resource Provider if the service has more than one tier, but is not required on a PUT.
    name string
    The name of the SKU. Ex - P3. It is typically a letter+number code
    capacity number
    If the SKU supports scale out/in then the capacity integer should be included. If scale out/in is not possible for the resource this may be omitted.
    family string
    If the service has different generations of hardware, for the same SKU, then that can be captured here.
    size string
    The SKU size. When the name field is the combination of tier and some other value, this would be the standalone code.
    tier string
    This field is required to be implemented by the Resource Provider if the service has more than one tier, but is not required on a PUT.
    name str
    The name of the SKU. Ex - P3. It is typically a letter+number code
    capacity int
    If the SKU supports scale out/in then the capacity integer should be included. If scale out/in is not possible for the resource this may be omitted.
    family str
    If the service has different generations of hardware, for the same SKU, then that can be captured here.
    size str
    The SKU size. When the name field is the combination of tier and some other value, this would be the standalone code.
    tier str
    This field is required to be implemented by the Resource Provider if the service has more than one tier, but is not required on a PUT.
    name String
    The name of the SKU. Ex - P3. It is typically a letter+number code
    capacity Number
    If the SKU supports scale out/in then the capacity integer should be included. If scale out/in is not possible for the resource this may be omitted.
    family String
    If the service has different generations of hardware, for the same SKU, then that can be captured here.
    size String
    The SKU size. When the name field is the combination of tier and some other value, this would be the standalone code.
    tier String
    This field is required to be implemented by the Resource Provider if the service has more than one tier, but is not required on a PUT.

    SystemDataResponse

    CreatedAt string
    The timestamp of resource creation (UTC).
    CreatedBy string
    The identity that created the resource.
    CreatedByType string
    The type of identity that created the resource.
    LastModifiedAt string
    The timestamp of resource last modification (UTC)
    LastModifiedBy string
    The identity that last modified the resource.
    LastModifiedByType string
    The type of identity that last modified the resource.
    CreatedAt string
    The timestamp of resource creation (UTC).
    CreatedBy string
    The identity that created the resource.
    CreatedByType string
    The type of identity that created the resource.
    LastModifiedAt string
    The timestamp of resource last modification (UTC)
    LastModifiedBy string
    The identity that last modified the resource.
    LastModifiedByType string
    The type of identity that last modified the resource.
    createdAt String
    The timestamp of resource creation (UTC).
    createdBy String
    The identity that created the resource.
    createdByType String
    The type of identity that created the resource.
    lastModifiedAt String
    The timestamp of resource last modification (UTC)
    lastModifiedBy String
    The identity that last modified the resource.
    lastModifiedByType String
    The type of identity that last modified the resource.
    createdAt string
    The timestamp of resource creation (UTC).
    createdBy string
    The identity that created the resource.
    createdByType string
    The type of identity that created the resource.
    lastModifiedAt string
    The timestamp of resource last modification (UTC)
    lastModifiedBy string
    The identity that last modified the resource.
    lastModifiedByType string
    The type of identity that last modified the resource.
    created_at str
    The timestamp of resource creation (UTC).
    created_by str
    The identity that created the resource.
    created_by_type str
    The type of identity that created the resource.
    last_modified_at str
    The timestamp of resource last modification (UTC)
    last_modified_by str
    The identity that last modified the resource.
    last_modified_by_type str
    The type of identity that last modified the resource.
    createdAt String
    The timestamp of resource creation (UTC).
    createdBy String
    The identity that created the resource.
    createdByType String
    The type of identity that created the resource.
    lastModifiedAt String
    The timestamp of resource last modification (UTC)
    lastModifiedBy String
    The identity that last modified the resource.
    lastModifiedByType String
    The type of identity that last modified the resource.

    TargetUtilizationScaleSettingsResponse

    MaxInstances int
    The maximum number of instances that the deployment can scale to. The quota will be reserved for max_instances.
    MinInstances int
    The minimum number of instances to always be present.
    PollingInterval string
    The polling interval in ISO 8691 format. Only supports duration with precision as low as Seconds.
    TargetUtilizationPercentage int
    Target CPU usage for the autoscaler.
    MaxInstances int
    The maximum number of instances that the deployment can scale to. The quota will be reserved for max_instances.
    MinInstances int
    The minimum number of instances to always be present.
    PollingInterval string
    The polling interval in ISO 8691 format. Only supports duration with precision as low as Seconds.
    TargetUtilizationPercentage int
    Target CPU usage for the autoscaler.
    maxInstances Integer
    The maximum number of instances that the deployment can scale to. The quota will be reserved for max_instances.
    minInstances Integer
    The minimum number of instances to always be present.
    pollingInterval String
    The polling interval in ISO 8691 format. Only supports duration with precision as low as Seconds.
    targetUtilizationPercentage Integer
    Target CPU usage for the autoscaler.
    maxInstances number
    The maximum number of instances that the deployment can scale to. The quota will be reserved for max_instances.
    minInstances number
    The minimum number of instances to always be present.
    pollingInterval string
    The polling interval in ISO 8691 format. Only supports duration with precision as low as Seconds.
    targetUtilizationPercentage number
    Target CPU usage for the autoscaler.
    max_instances int
    The maximum number of instances that the deployment can scale to. The quota will be reserved for max_instances.
    min_instances int
    The minimum number of instances to always be present.
    polling_interval str
    The polling interval in ISO 8691 format. Only supports duration with precision as low as Seconds.
    target_utilization_percentage int
    Target CPU usage for the autoscaler.
    maxInstances Number
    The maximum number of instances that the deployment can scale to. The quota will be reserved for max_instances.
    minInstances Number
    The minimum number of instances to always be present.
    pollingInterval String
    The polling interval in ISO 8691 format. Only supports duration with precision as low as Seconds.
    targetUtilizationPercentage Number
    Target CPU usage for the autoscaler.

    UserAssignedIdentityResponse

    ClientId string
    The client ID of the assigned identity.
    PrincipalId string
    The principal ID of the assigned identity.
    TenantId string
    The tenant ID of the user assigned identity.
    ClientId string
    The client ID of the assigned identity.
    PrincipalId string
    The principal ID of the assigned identity.
    TenantId string
    The tenant ID of the user assigned identity.
    clientId String
    The client ID of the assigned identity.
    principalId String
    The principal ID of the assigned identity.
    tenantId String
    The tenant ID of the user assigned identity.
    clientId string
    The client ID of the assigned identity.
    principalId string
    The principal ID of the assigned identity.
    tenantId string
    The tenant ID of the user assigned identity.
    client_id str
    The client ID of the assigned identity.
    principal_id str
    The principal ID of the assigned identity.
    tenant_id str
    The tenant ID of the user assigned identity.
    clientId String
    The client ID of the assigned identity.
    principalId String
    The principal ID of the assigned identity.
    tenantId String
    The tenant ID of the user assigned identity.

    Package Details

    Repository
    Azure Native pulumi/pulumi-azure-native
    License
    Apache-2.0
    azure-native logo
    This is the latest version of Azure Native. Use the Azure Native v1 docs if using the v1 version of this package.
    Azure Native v2.73.0 published on Wednesday, Nov 20, 2024 by Pulumi