Unionize actor heap large and small chunks #4568

dipinhora · 2024-12-06T14:15:26Z

So they can better fight for their rights..

Prior to this commit, small_chunk_t and large_chunk_t were distinct types. This was great for correctness but prevented a useful optimization.

This commit changes things so that small_chunk_t and large_chunk_t have been combined into chunk_t which has a union in it for the small/large chunk specific fields. There are now a lot of assertions added in the relevant functions that deal with small and large chunks to ensure that the received chunk is of the right type because the compiler can no longer help enforce correctness.

This allows for small and large chunk recycling to take advantage of the fact that the block backing a small chunk is the same size as the block backing the smallest large chunk size allowed so they both now use the same recycled chunk list.

The large chunk recycling has also been enhanced to take advantage of the long tail distribution of allocation sizes and there are now multiple size specific chunk lists for large chunk recycling with one extra large chunk list for recycling that has all chunks bigger than the largest size specific list that is kept sorted and searched/used as the old large chunk recycling list was.

jemc · 2024-12-10T19:40:30Z

Looks like the CI failure is unrelated. It seems we need to update some GitHub Actions versions in our CI workflows.

SeanTAllen · 2024-12-12T18:34:53Z

@dipinhora please rebase this against main when you have a chance.

So they can better fight for their rights.. Prior to this commit, small_chunk_t and large_chunk_t were distinct types. This was great for correctness but prevented a useful optimization. This commit changes things so that small_chunk_t and large_chunk_t have been combined into chunk_t which has a union in it for the small/large chunk specific fields. There are now a lot of assertions added in the relevant functions that deal with small and large chunks to ensure that the received chunk is of the right type because the compiler can no longer help enforce correctness. This allows for small and large chunk recycling to take advantage of the fact that the block backing a small chunk is the same size as the block backing the smallest large chunk size allowed so they both now use the same recycled chunk list. The large chunk recycling has also been enhanced to take advantage of the long tail distribution of allocation sizes and there are now multiple size specific chunk lists for large chunk recycling with one extra large chunk list for recycling that has all chunks bigger than the largest size specific list that is kept sorted and searched/used as the old large chunk recycling list was.

dipinhora · 2024-12-13T15:12:20Z

rebased

jemc · 2024-12-17T19:45:49Z

Sorry that nobody has carved out the time to give this a meaningful review yet. Sean and I both feel this requires attention. Sorry for the delay.

dipinhora · 2024-12-17T19:51:44Z

Sorry that nobody has carved out the time to give this a meaningful review yet. Sean and I both feel this requires attention. Sorry for the delay.

no worries.. and no need to apologize.. folks should be holidaying at this time of the year after all..

SeanTAllen · 2025-01-08T00:47:14Z

@dipinhora what would you imagine the impact of this change would be on "the average" pony program?

dipinhora · 2025-01-08T02:37:22Z

@dipinhora what would you imagine the impact of this change would be on "the average" pony program?

same as #4531 but more efficient as this enhances the previous work to better exploit the fact that the memory chunk backing a small chunk is the same size as that backing the smallest allowed large chunk along with allowing for some per-size recyclable large chunk lists trading some more memory to computation savings (if desired)..

SeanTAllen · 2025-01-08T02:50:09Z

I have some worries about the "compiler used to help us, now asserts". This feels easier to break. But I am mostly ignorant about this code. It's been a long time since I worked closely with it.

Do you have suggestions for tests we can do in CI to add an additional level of assurance that nothing gets broken in the future?

dipinhora · 2025-01-08T03:00:30Z

I have some worries about the "compiler used to help us, now asserts". This feels easier to break. But I am mostly ignorant about this code. It's been a long time since I worked closely with it.

note: this more closely matches how this code used to be before small_chunk_t/large_chunk_t were split for memory efficiency purposes a while back.. back then, the compiler also wasn't able to help enforce correctness but there also didn't used to be any type punning with unions involved like here either..

Do you have suggestions for tests we can do in CI to add an additional level of assurance that nothing gets broken in the future?

i'm not sure what you're looking for here.. pony already has a comprehensive test suite of pony programs (full-program-tests and stdlib) that are run in CI that all would likely fail if this code was broken in some way..

SeanTAllen · 2025-01-08T03:02:07Z

@dipinhora I was thinking something that could point to "someone f'd up memory". I doubt there is anything but I wanted to see if asking shook something free as a thought.

dipinhora · 2025-01-08T03:04:13Z

@dipinhora I was thinking something that could point to "someone f'd up memory". I doubt there is anything but I wanted to see if asking shook something free as a thought.

that's what the assertions are for... 8*/

ponylang-main added the discuss during sync Should be discussed during an upcoming sync label Dec 6, 2024

dipinhora added 2 commits December 13, 2024 10:08

fix runtimestats heap test

e5c1692

dipinhora force-pushed the unionize_chunks branch from 9faf2d0 to e5c1692 Compare December 13, 2024 15:08

SeanTAllen requested a review from a team December 17, 2024 19:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unionize actor heap large and small chunks #4568

Unionize actor heap large and small chunks #4568

dipinhora commented Dec 6, 2024

jemc commented Dec 10, 2024

SeanTAllen commented Dec 12, 2024

dipinhora commented Dec 13, 2024

jemc commented Dec 17, 2024

dipinhora commented Dec 17, 2024

SeanTAllen commented Jan 8, 2025

dipinhora commented Jan 8, 2025

SeanTAllen commented Jan 8, 2025

dipinhora commented Jan 8, 2025

SeanTAllen commented Jan 8, 2025

dipinhora commented Jan 8, 2025

Unionize actor heap large and small chunks #4568

Are you sure you want to change the base?

Unionize actor heap large and small chunks #4568

Conversation

dipinhora commented Dec 6, 2024

jemc commented Dec 10, 2024

SeanTAllen commented Dec 12, 2024

dipinhora commented Dec 13, 2024

jemc commented Dec 17, 2024

dipinhora commented Dec 17, 2024

SeanTAllen commented Jan 8, 2025

dipinhora commented Jan 8, 2025

SeanTAllen commented Jan 8, 2025

dipinhora commented Jan 8, 2025

SeanTAllen commented Jan 8, 2025

dipinhora commented Jan 8, 2025