Bug fix in cache management for nested lists #1956

MaximilianSchreff · 2023-12-07T12:27:56Z

SystemDS was previously not supporting nested lists correctly since the data of CacheableData objects within nested loops were always deleted after a function call.
Normally, there are rmvar statements after function calls to remove alll variables used within the function. To protect CacheableData objects (e.g. matrices) from having their data removed by the rmvar statements we use a cleanup-enabled flag. This flag was not correctly set for variables that were within a nested list. These commits fix this problem by flagging all elements, also within nested lists.

Automated tests have been added to test the changes.

@phaniarnab

…ables

MaximilianSchreff · 2023-12-07T12:31:44Z

The bug can be triggered through a simple function call with a nested list as an argument.

Here is a dml script which does this.
list_bug.txt

Baunsgaard · 2023-12-30T11:49:32Z

LGTM,

I think the Queue make things slower instead of the boolean list,
but if we need to support the dynamic lengths of the Lists or nested Lists, then this fix makes sense.
And the live variables management is a low overhead to begin with so, all good from my side.

We need to run the tests again once the main branch is clear of failing tests.

MaximilianSchreff · 2023-12-30T14:34:39Z

@Baunsgaard
About the Queue - previously, we iterated through all variables and counted the number of cacheable objects. This was done to calculate the size of the boolean array but in a shallow manner. With the changes, we would now need to iterate through every single variable, i.e. laso recursively for lists, in order to get the size. That is why I chose a dynamic data structure. I'm not sure whether this is faster though. Since the number of live variables per function call is usually pretty low, I wasn't able to see any performance differences.

Baunsgaard · 2023-12-30T16:13:34Z

thanks for the PR, it is now merged.

MaximilianSchreff added 2 commits December 6, 2023 17:09

Fixed bug with memory management for nested lists

568717d

Added automated tests for modfified cache management function pinVari…

e9f8960

…ables

MaximilianSchreff mentioned this pull request Dec 7, 2023

ResNet Bottleneck Architecture #1957

Closed

j143 added this to the systemds-3.2.0 milestone Dec 17, 2023

Baunsgaard closed this in 61a385f Dec 30, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug fix in cache management for nested lists #1956

Bug fix in cache management for nested lists #1956

MaximilianSchreff commented Dec 7, 2023

MaximilianSchreff commented Dec 7, 2023

Baunsgaard commented Dec 30, 2023

MaximilianSchreff commented Dec 30, 2023

Baunsgaard commented Dec 30, 2023

Bug fix in cache management for nested lists #1956

Bug fix in cache management for nested lists #1956

Conversation

MaximilianSchreff commented Dec 7, 2023

MaximilianSchreff commented Dec 7, 2023

Baunsgaard commented Dec 30, 2023

MaximilianSchreff commented Dec 30, 2023

Baunsgaard commented Dec 30, 2023