Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Benchmark Runner output #11

Open
maximusunc opened this issue Nov 17, 2023 · 8 comments
Open

Benchmark Runner output #11

maximusunc opened this issue Nov 17, 2023 · 8 comments

Comments

@maximusunc
Copy link
Collaborator

The Benchmark Runner has two main outputs, (1) a text output of the overall results metrics and (2) screenshot images of those metrics plotted.

Sample text output:

Benchmark: ameliorates
Results Directory: /var/folders/lr/65kxj04n3vz3ttm_vw0_tq8r0000gn/T/tmpzkjrkmhe/ameliorates/aragorn/2023-11-17_12-42-58

                        k=1     k=5     k=10    k=20
Precision @ k           0.0909  0.0818  0.0636  0.0545
Recall @ k              0.0476  0.2143  0.3333  0.5714
mAP @ k                 0.0909  0.1697  0.1877  0.2092
Top-k Accuracy          0.0476  0.2143  0.3333  0.5714

Mean Reciprocal Rank    0.23669455544455545

The screenshots are a dictionary where the keys are the plot types (i.e. precision @ k) and the values are raw image bytes.

@maximusunc
Copy link
Collaborator Author

I've also been playing around with how these would look in the Information Radiator and so far I've gotten this up and running.
Screenshot 2023-11-17 at 12 06 16 PM

@sierra-moxon
Copy link
Member

@maximusunc - is there any high-level notion of "passing a benchmark" or "failing a benchmark" in the current output?

@maximusunc
Copy link
Collaborator Author

Nope. If it runs successfully, then it passes.

@sierra-moxon
Copy link
Member

runs successfully == gives output such as that above?

@maximusunc
Copy link
Collaborator Author

Yes

@sierra-moxon
Copy link
Member

What is the hook in the output that allows a user to go back to the "run" of the test. E.g. see what the benchmark was, which component it ran on, etc.?

@maximusunc
Copy link
Collaborator Author

Good point. I will add that info in. It will most likely be included in the results metrics.

@sierra-moxon
Copy link
Member

e.g. #10 (comment)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants