azure unexpected deployment Template failure - databricks

i was trying to create a cluster in Databricks but every time i try it does'nt start and it shows this message: Error code: UnexpectedDeploymentTemplateFailure, error message: Failing to launch instances for the cluster because of unexpected deployment failure. Message: {"error":{"code":"MultipleErrorsOccurred","message":"Multiple error occurred: BadRequest,BadRequest. Please see details.","details":[{"code":"InvalidTemplateDeployment","message":"The template deployment failed with error: 'The resource with id: '/subscriptions/efbb03c8-943f-477e-8c81-568425a73b74/resourceGroups/databricks-rg-DPC-ovvxul4l4o77a/providers/Microsoft.Compute/virtualMachines/4233ecc1fb88403caec0a5d994698bb7' failed validation with message: 'The requested size for resource '/subscriptions/efbb03c8-943f-477e-8c81-
any help please ?

actually i realized that i can't work with databriks with a free azure account because of the size of cluster. you can work with the community addition instead.

Related

Unable to Deploy Flatcar OS on Azure

I was trying to deploy flatcar image on Azure, but I am not able to deploy it. following are the steps I performed
I downloaded latest azure supported VHD from https://stable.release.flatcar-linux.net/amd64-usr/current/flatcar_production_azure_image.vhd.bz2.
I uploaded this VHD to azure storage blob and converted it to an image as recommended by Azure guides
I tried creating VM out of this image. VM gets created successfully, but we can see one error while creating VM and VM creation is shown as failed (Even though it is actually successful). Following is the error which I can see:
{
"code": "DeploymentFailed",
"message": "At least one resource deployment operation failed. Please list deployment operations for details. Please see https://aka.ms/DeployOperations for usage details.",
"details": [
{
"code": "VMExtensionHandlerNonTransientError",
"message": "The handler for VM extension type 'Microsoft.Azure.Diagnostics.LinuxDiagnostic' has reported terminal failure for VM extension 'LinuxDiagnostic' with error message: '[ExtensionOperationError] Non-zero exit code: 1, /var/lib/waagent/Microsoft.Azure.Diagnostics.LinuxDiagnostic-3.0.141/diagnostic.py -install\n[stdout]\n\n\n[stderr]\n File \"/var/lib/waagent/Microsoft.Azure.Diagnostics.LinuxDiagnostic-3.0.141/diagnostic.py\", line 54\n print 'A local import (e.g., waagent) failed. Exception: {0}\\n' \\\n ^\nSyntaxError: invalid syntax\n'.\r\n \r\n'Install handler failed for the extension. More information on troubleshooting is available at https://aka.ms/VMExtensionLinuxDiagnosticsTroubleshoot'"
}
]
}
I tried going through link provided, but it didn't help much.
I also tried another option as following
Deployed flatcar VM through Azure marketplace
Captured generalized image out of this VM
Deployed VM using the image created in above step
Even with this approach I am getting same error.
for now, waagent (Azure Linux agent) does not support python 3.x, hence this syntax error exists. You need to have python 2.x on your OS to not have this issue.

CI/CD Pipeline is failing for Synapse

I am getting below error when deploying Synapse CI/CD pipeline using Azure DevOps. The deployment was working fine until I added a few more notebooks and it started failing. Notebooks run fine on lower environment. All paths in the notebooks exist in higher environment. There is no concept of linkedTemplates in Synapse. I thought it was failing due to the length of ARMtemplate has exceeded.
Below is the error log, I have removed subscription name and resource group name and pipeline names.
2022-07-06T01:53:27.9739484Z Error during execution: Error: Failed to fetch the deployment status {"code":"400","message":"CreateOrUpdateNotebook failed: [statusCode from ADF:BadRequest, ErrorMessage: {\"code\":\"BadRequest\",\"message\":\"The document creation or update failed because of invalid reference ''.\",\"target\":\"/subscriptions//resourceGroups/rg-name/providers/Microsoft.Synapse/workspaces/workspace-02/notebooks/SilverToGoldPipeline\",\"details\":null,\"error\":null}, workspace: workspace-02, notebook: SilverToGoldPipeline, ArtifactId: ]"}
2022-07-06T01:53:27.9811549Z ##[error]Encountered with exception:Error: Failed to fetch the deployment status {"code":"400","message":"CreateOrUpdateNotebook failed: [statusCode from ADF:BadRequest, ErrorMessage: {\"code\":\"BadRequest\",\"message\":\"The document creation or update failed because of invalid reference ''.\",\"target\":\"/subscriptions/<>/resourceGroups/rg-name/providers/Microsoft.Synapse/workspaces/workspace-02/notebooks/SilverToGoldPipeline\",\"details\":null,\"error\":null}, workspace: workspace-02, notebook: SilverToGoldPipeline, ArtifactId: ]"}
2022-07-06T01:53:27.9954864Z ##[section]Finishing: Synapse deployment task for workspace: Stage
I was able to resolve the issue. The issue was that parameters file for higher environment was missing values for the pipelines. When we publish from master/main branch, if a new notebook is added and it is parameterized then an entry is created in the parameters file.
In my case, the error was because I had to create the Apache Spark pool first, before deploying.
Apparently, the deployment was not creating it automatically.

Configuration of Managed Service Identity(MSI)

2019-10-02T04:00:46.7615707Z ##[error]Error: Failed to get resource ID
for resource type 'Microsoft.Web/Sites' and resource name
'morningtonFrontEnd'. Error: Could not fetch access token for Managed
Service Principal. Please configure Managed Service Identity (MSI) for
virtual machine 'https://aka.ms/azure-msi-docs'. Status code: 400,
status message: Bad Request.
Could anyone explain to me what is the issue?
I have "Contributor" access to Azure.
Logs:
2019-10-02T04:00:45.2038473Z ##[section]Starting: Deploy Azure App Service
2019-10-02T04:00:45.2159933Z ==============================================================================
2019-10-02T04:00:45.2160013Z Task : Azure App Service deploy
2019-10-02T04:00:45.2160056Z Description : Deploy to Azure App Service a web, mobile, or API app using Docker, Java, .NET, .NET Core, Node.js, PHP, Python, or Ruby
2019-10-02T04:00:45.2160116Z Version : 4.157.1
2019-10-02T04:00:45.2160154Z Author : Microsoft Corporation
2019-10-02T04:00:45.2160215Z Help : https://learn.microsoft.com/azure/devops/pipelines/tasks/deploy/azure-rm-web-app-deployment
2019-10-02T04:00:45.2160272Z ==============================================================================
2019-10-02T04:00:46.7125417Z Got service connection details for Azure App Service:'morningtonFrontEnd'
2019-10-02T04:00:46.7615707Z ##[error]Error: Failed to get resource ID for resource type 'Microsoft.Web/Sites' and resource name 'morningtonFrontEnd'. Error: Could not fetch access token for Managed Service Principal. Please configure Managed Service Identity (MSI) for virtual machine 'https://aka.ms/azure-msi-docs'. Status code: 400, status message: Bad Request
2019-10-02T04:00:46.7625812Z (node:2404) UnhandledPromiseRejectionWarning: Unhandled promise rejection (rejection id: 1): TypeError: Cannot read property 'getApplicationURL' of undefined
2019-10-02T04:00:46.7741572Z Failed to add release annotation. TypeError: Cannot read property 'getApplicationSettings' of undefined
2019-10-02T04:00:46.7782967Z ##[section]Finishing: Deploy Azure App Service
[error]Error: Failed to get resource ID for resource type
'Microsoft.Web/Sites' and resource name 'morningtonFrontEnd'. Error:
Could not fetch access token for Managed Service Principal. Please
configure Managed Service Identity (MSI) for virtual machine
'https://aka.ms/azure-msi-docs'. Status code: 400, status message: Bad
Request
Focus on this error message, the next error line UnhandledPromiseRejectionWarning: Unhandled promise rejection (rejection id: 1): TypeError: Cannot read property 'getApplicationURL' of undefined caused by the first service connection could not connect successfully.
For this error, it does not relevant your Contributor role. This generally because the authentication token is expired or invalid. You can first click Verify connection to check whether this connection is nothing goes wrong. If the verification is successful, it will show verified, which means that the current connection is no problem.
If this verified failed, you'd better re-create one new service connection. Please follow this doc and this blog to start the create steps.
After the service connection created successfully, click Verify connection firstly. Just ensure nothing goes wrong, then use it in pipeline.
Also, not forget click Authorized in task configuration:
Finish all above pre-verification steps, the error should not appear again.

How to get more information about a failed Azure deployment?

I'm trying to automatically deploy a resource of type Microsoft.BotService/botServices on Azure and the deployment gets stuck endlessly retrying it while only giving me a "Internal server error (500)" message.
Is there any way to get more specific information?
Click on the resource group you're deploying to in the Portal.

Azure deployment for QnA service fails in portal - invalid template

I'm trying to deploy Azure QnA service via Azure portal. When I populate all fields and start deployment it fails with the following message:
The template deployment 'Microsoft.CognitiveServicesQnAMaker' is not valid according to the validation procedure. The tracking id is '....'. See inner errors for details. Please see https://aka.ms/arm-deploy for usage details.
If you have deployed the resource from a custom template or from a template saved to your account that is similar to the procedure shown here -> (https://learn.microsoft.com/en-us/azure/azure-resource-manager/resource-group-template-deploy-portal) and if you are receiving an invalid template deployment error according to validation procedure then you can follow below steps to resolve the issue. Hope this helps!
Get Correlation ID of the deployment operation event by following the steps that are illustrated here -> (https://learn.microsoft.com/en-us/azure/azure-resource-manager/resource-manager-deployment-operations)
Run the command Get-AzureRMLog -CorrelationId xxxxxxxx-xxxx-xxxx-xxxx-xxxxxxxxxxxx -DetailedOutput to see if the output of it has any specific information regarding the root cause of the issue.
Find the exact error code and validate it here -> (https://learn.microsoft.com/en-us/azure/azure-resource-manager/resource-manager-common-deployment-errors). For example, if your error code is InvalidTemplate then check your template syntax for errors and follow this (https://learn.microsoft.com/en-us/azure/azure-resource-manager/resource-manager-invalid-template-errors) link to resolve invalid template error.

Resources