AzOps - Discovery Performance Issues #438

reckitt-maciejglowacki · 2021-10-06T13:47:09Z

AzOps - Pull pipeline of AzOps Accelerator run in Azure DevOps fails to grab information about all of the subscriptions and times out.

SPN has been given privileges over root management group with about 250 subscriptions. The build times out after 4 hours That's what I've set in the pipeline itself. I'm actually not sure if it does anything for that long (or just hangs midway) because log file is too large to browse it effectively.

Here's a screenshot:

Has anyone experienced anything like that? Is this tool designed to handle so many subscriptions? Or maybe is it some problem with DevOps pool? Any help will be much appreciated.

daltondhcp · 2021-10-08T11:56:56Z

Hey @reckitt-maciejglowacki ,
We have customers with over 1000 subscription running this, so it should definitively work.
Can you share how the below settings are configured in the settings.json file?

reckitt-maciejglowacki · 2021-10-11T07:51:21Z

I'm using defaults from https://github.com/Azure/AzOps-Accelerator/blob/main/settings.json

The only thing that I have changed is timeoutInMinutes in the pipeline itself.

daltondhcp · 2021-10-11T12:19:25Z

Thank you. For troubleshooting purposes, could you please try and change the Core.SkipResourceGroup setting to true and report back the results?

reckitt-maciejglowacki · 2021-10-12T06:33:02Z

That certainly helped :) The pipeline now runs just about 2 hours but it still fails due to #439

Are there any disadvantages to skipping rg discovery?

daltondhcp · 2021-10-12T06:53:36Z

You would only want RG discovery if you intend to do RG level deployments (like VMs or other resources) with AzOps, which I assume is not the intent here?

reckitt-maciejglowacki · 2021-10-12T07:05:24Z

It's not but we do want to be able to differentiate policy and role assignments between different resource groups.

daltondhcp · 2021-10-13T12:38:01Z

Are you going to manage that from a central platform perspective via AzOps or let the individual LZ teams do it?

reckitt-maciejglowacki · 2021-10-13T12:57:29Z

We're doing it centrally I'm afraid

daltondhcp · 2021-10-13T14:00:09Z

Understood. Can you try to change back the setting to discover RGs and change the pipeline timeout in ADO to 6 hrs and see if it completes successfully?

reckitt-maciejglowacki · 2021-10-18T09:04:36Z

Okay. I'll do that today and let you know the results.

reckitt-maciejglowacki · 2021-10-19T07:00:13Z

Same :(

daltondhcp · 2021-10-21T07:22:18Z

Thank you for confirming this. We will take a look at this and see what we can do. The advise would be to disable resource group discovery for now.

reckitt-maciejglowacki · 2021-11-04T12:59:15Z

Hi @daltondhcp Just wanted to check have you managed to look into this issue? Thanks

reckitt-maciejglowacki · 2021-11-16T14:44:58Z

@daltondhcp bump

daltondhcp · 2021-11-17T08:46:42Z

Hey @reckitt-maciejglowacki - we are currently working on this, unfortunately no short term fix. I will make sure to keep the progress updated in this issue.

reckitt-maciejglowacki · 2021-11-18T09:56:24Z

Got it. Thanks for the info.

reckitt-maciejglowacki · 2021-12-13T11:16:59Z

Hi, any update on this?

jtracey93 · 2021-12-14T14:57:54Z

Hey @reckitt-maciejglowacki,

Unfortunately we are still investigating this but as a workaround for now you could use Self-hosted agents that have an unlimited run time as per: https://docs.microsoft.com/en-us/azure/devops/pipelines/process/phases?view=azure-devops&tabs=yaml#timeouts

Guidance on creating on self-hosted agents can be found here: https://docs.microsoft.com/en-us/azure/devops/pipelines/agents/agents?view=azure-devops&tabs=browser#install

Hope that helps move you forward in the near term 👍

reckitt-maciejglowacki · 2023-02-21T10:34:17Z

Hi. Just wanted to let you know that this update definitely hasn't fixed anything. Quite the opposite.

I'm getting various random errors when trying to execute this in an ADO pipeline. Even when it does run uninterupted (which seems completely random) it times out after an hour.

Jefajers · 2023-03-17T09:03:12Z

Hi @reckitt-maciejglowacki, thanks for updating this issue and sharing.

I agree with your experience in regards to a bunch of different errors ultimately causing pipeline executions to fail.

We started seeing this as well once we release 2.0.0 into the wild and determined that majority of the different errors are due to the expanded usage of processing in parallel. When combined with an execution machine containing a "high" throttle limit and "low" amount of cores the errors starts to show a lot.

Our response to this was to implement logic in the module to detect these misalignments and override the throttle limit when detected. In addition to that we created a wiki for performance considerations.

Since release 2.0.2 improvements are included in AzOps module intended to resolve the behavior.

Could you confirm if you still have these issues on the latest release? (if yes, then lets re-open the issue).

reckitt-maciejglowacki · 2023-04-25T12:23:35Z

Thank you @Jefajers Latest update does seem to work. I haven't tried it on a resource level yet but it runs well for subscriptions and resource groups.

reckitt-maciejglowacki · 2023-04-27T11:34:33Z

Turns out my enthusiasm was premature..

daltondhcp · 2023-04-27T17:28:43Z

Can you share the details of the errors? Same as before or something else?

reckitt-maciejglowacki · 2023-04-28T10:26:51Z

The same I think:

Those seem to appear above certain number of objects but I haven't drilled it down yet.

We're using AZOPS_MODULE_VERSION 2.1.2 and pretty much default settings.json from AzOps-Accelerator project with "Core.SkipResourceGroup": false

daltondhcp added the waiting-for-response Maintainers have replied and are awaiting a response from the bug/issue/feature creator label Oct 8, 2021

daltondhcp added area/powershell enhancement New feature or request and removed waiting-for-response Maintainers have replied and are awaiting a response from the bug/issue/feature creator labels Oct 21, 2021

daltondhcp added the long-term Long term item - used for automation label Jan 13, 2022

Jefajers added this to the Release - v2.0.0 milestone Apr 27, 2022

SomilGanguly self-assigned this Apr 28, 2022

daltondhcp changed the title ~~AzOps - Pull times out~~ AzOps - Discovery Performance Issues May 24, 2022

daltondhcp mentioned this issue Feb 17, 2023

AzOps v2.0 #748

Merged

5 tasks

daltondhcp linked a pull request Feb 17, 2023 that will close this issue

AzOps v2.0 #748

Merged

5 tasks

daltondhcp closed this as completed in #748 Feb 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AzOps - Discovery Performance Issues #438

AzOps - Discovery Performance Issues #438

reckitt-maciejglowacki commented Oct 6, 2021

daltondhcp commented Oct 8, 2021

reckitt-maciejglowacki commented Oct 11, 2021

daltondhcp commented Oct 11, 2021

reckitt-maciejglowacki commented Oct 12, 2021

daltondhcp commented Oct 12, 2021

reckitt-maciejglowacki commented Oct 12, 2021

daltondhcp commented Oct 13, 2021

reckitt-maciejglowacki commented Oct 13, 2021

daltondhcp commented Oct 13, 2021

reckitt-maciejglowacki commented Oct 18, 2021

reckitt-maciejglowacki commented Oct 19, 2021

daltondhcp commented Oct 21, 2021

reckitt-maciejglowacki commented Nov 4, 2021

reckitt-maciejglowacki commented Nov 16, 2021

daltondhcp commented Nov 17, 2021

reckitt-maciejglowacki commented Nov 18, 2021

reckitt-maciejglowacki commented Dec 13, 2021

jtracey93 commented Dec 14, 2021

reckitt-maciejglowacki commented Feb 21, 2023

Jefajers commented Mar 17, 2023

reckitt-maciejglowacki commented Apr 25, 2023

reckitt-maciejglowacki commented Apr 27, 2023

daltondhcp commented Apr 27, 2023

reckitt-maciejglowacki commented Apr 28, 2023

AzOps - Discovery Performance Issues #438

AzOps - Discovery Performance Issues #438

Comments

reckitt-maciejglowacki commented Oct 6, 2021

daltondhcp commented Oct 8, 2021

reckitt-maciejglowacki commented Oct 11, 2021

daltondhcp commented Oct 11, 2021

reckitt-maciejglowacki commented Oct 12, 2021

daltondhcp commented Oct 12, 2021

reckitt-maciejglowacki commented Oct 12, 2021

daltondhcp commented Oct 13, 2021

reckitt-maciejglowacki commented Oct 13, 2021

daltondhcp commented Oct 13, 2021

reckitt-maciejglowacki commented Oct 18, 2021

reckitt-maciejglowacki commented Oct 19, 2021

daltondhcp commented Oct 21, 2021

reckitt-maciejglowacki commented Nov 4, 2021

reckitt-maciejglowacki commented Nov 16, 2021

daltondhcp commented Nov 17, 2021

reckitt-maciejglowacki commented Nov 18, 2021

reckitt-maciejglowacki commented Dec 13, 2021

jtracey93 commented Dec 14, 2021

reckitt-maciejglowacki commented Feb 21, 2023

Jefajers commented Mar 17, 2023

reckitt-maciejglowacki commented Apr 25, 2023

reckitt-maciejglowacki commented Apr 27, 2023

daltondhcp commented Apr 27, 2023

reckitt-maciejglowacki commented Apr 28, 2023