[feat] Optimise the test case generation #2544

vkarak · 2022-06-27T13:24:25Z

This PR optimises the generate_testcases() function by eliminating the deep copies completely. This has been one of the major contributing factors in increased load times for large number of tests (at the order of thousands). The execution time overhead of the generate_testcases() has been practically eliminated (from 6.09s to generate 10000 test cases out the HelloTest to 0.1156s on my Mac – 2.3GHz quad-core Intel Core i5). The total gain in listing the checks timed with time, which include (import overheads and print outs) ranges from 25% to 40% on various systems that it was tested. Now the loading phase of tests is fully dominated by the test instantiation:

reframe/reframe/core/decorators.py

Line 67 in 1be1749

def instantiate_all(self, reset_sysenv=0):

This in turn can be broken down in three major factors:

The __setattr__ and __getattr__ machinery of the metaclass
The dynamic type checking performed for the various test field by the TypedField
The deep-copy of the default variable values

These time consuming operations are more difficult to reduce, since especially (1) is at the heart of the test syntax eDSL. For the other two factors, we might think of possible optimisations in the future.

Implementation details

The performance gains in loading the tests come from the following two factors:

The deep copies of the test case's partition and environment are completely eliminated. Both the SystemPartition and the Environment are immutable objects, so there is no need to copy them.
The deep copy of the test (which is the most expensive) cannot be eliminated, but it was deferred to the setup stage of the pipeline. This way, it gets amortised during the run and it is not paid up front. Also, in order to not pay that cost all at once during the setup stage, the default pipeline timeout was set to 3s, meaning that the framework will only push down the pipeline as many test cases as it can during this window. As these test cases later block waiting for their corresponding build and run jobs, the runtime will advance other tests cases with their setup stage and therefore the deep copy cost of the test is hidden.

Fixes #2497

- This gives us a 8% performance benefit in total. Here is the experiment: ``` ./bin/reframe -c unittests/resources/checks/hellocheck.py -n HelloTest \ --repeat=1000 -l ```

And set pipeline timeout to 3s, in order to give better responsiveness with very large datasets which is now much more important as we moved the check deepcopy in the test's setup phase.

victorusu

lgtm. I think that there is only one English typo... But I am not a native speaker.

reframe/frontend/executors/__init__.py

Co-authored-by: victorusu <[email protected]>

Vasileios Karakasis added 2 commits June 23, 2022 15:39

Do not deep copy the partition and the environment

b68ea2d

- This gives us a 8% performance benefit in total. Here is the experiment: ``` ./bin/reframe -c unittests/resources/checks/hellocheck.py -n HelloTest \ --repeat=1000 -l ```

Deep copy checks lazily

46907f3

And set pipeline timeout to 3s, in order to give better responsiveness with very large datasets which is now much more important as we moved the check deepcopy in the test's setup phase.

vkarak added prio: normal enhancement runtime labels Jun 27, 2022

vkarak added this to the ReFrame Sprint 22.06.1 milestone Jun 27, 2022

vkarak requested review from ekouts and victorusu June 27, 2022 13:24

vkarak self-assigned this Jun 27, 2022

vkarak mentioned this pull request Jun 27, 2022

[feat] Add a lightweight time profiler in the framework #2545

Merged

victorusu approved these changes Jun 28, 2022

View reviewed changes

reframe/frontend/executors/__init__.py Outdated Show resolved Hide resolved

vkarak and others added 2 commits June 28, 2022 16:00

Address PR comments

02fbe55

Co-authored-by: victorusu <[email protected]>

Merge branch 'master' into feat/improve-test-load-perf

06ac208

vkarak merged commit 8d758ce into reframe-hpc:master Jun 30, 2022

vkarak deleted the feat/improve-test-load-perf branch June 30, 2022 08:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[feat] Optimise the test case generation #2544

[feat] Optimise the test case generation #2544

vkarak commented Jun 27, 2022 •

edited

Loading

victorusu left a comment

[feat] Optimise the test case generation #2544

[feat] Optimise the test case generation #2544

Conversation

vkarak commented Jun 27, 2022 • edited Loading

Implementation details

victorusu left a comment

Choose a reason for hiding this comment

vkarak commented Jun 27, 2022 •

edited

Loading