Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

LRU DashMap to cache objectMeta #10125

Closed
wants to merge 1 commit into from

Conversation

Lordworms
Copy link
Contributor

@Lordworms Lordworms commented Apr 17, 2024

Which issue does this PR close?

part of #9964

Closes #.

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

@@ -232,4 +337,64 @@ mod tests {
meta.clone()
);
}
#[test]
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

One thing that I think would be really helpful in validating if the cache is being used is showing it in our plans with EXPLAIN (i dont know which setting it would best belong in though - i.e. verbose or analyze). For example I believe postgres exposes information like this through its EXPLAIN BUFFERS. At this stage i dont think we need a new keyword though.

Then we could have a test that just reads from a local file and we can assert on the plans output that the source of the file metadata is the cache. This could also be extended for when we implement file caching.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ref to postgres

Specifically:

Include information on buffer usage. Specifically, include the number of shared blocks hit, read, dirtied, and written, the number of local blocks hit, read, dirtied, and written, the number of temp blocks read and written, and the time spent reading and writing data file blocks and temporary file blocks (in milliseconds) if [track_io_timing](https://www.postgresql.org/docs/current/runtime-config-statistics.html#GUC-TRACK-IO-TIMING) is enabled. A hit means that a read was avoided because the block was found already in cache when needed.```

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Got it, I would first try adding the filter as you suggested on the issue page. Thanks!

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I thought about this a little more and what i mentioned above may be tangential to this PR since i think it would be more about the cache trait method usage than this specific cache implementation. So i dont think it would be necessary for this.

@matthewmturner
Copy link
Contributor

@Lordworms i added one comment, i plan to review more later / tomorrow.

Copy link

Thank you for your contribution. Unfortunately, this pull request is stale because it has been open 60 days with no activity. Please remove the stale label or comment or this will be closed in 7 days.

@github-actions github-actions bot added the Stale PR has not had any activity for some time label Jun 23, 2024
@github-actions github-actions bot closed this Jun 30, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Stale PR has not had any activity for some time
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants