Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Specific Settings of the ToMe Model #3

Open
leeyf99 opened this issue Oct 9, 2024 · 1 comment
Open

Specific Settings of the ToMe Model #3

leeyf99 opened this issue Oct 9, 2024 · 1 comment

Comments

@leeyf99
Copy link

leeyf99 commented Oct 9, 2024

Could you please clarify which pre-trained ToMe model is used when obtaining the "visual_patch" features? What is the setting for the "r" of ToMe? Additionally, I noticed that the "audio_patch" feature is not actually being utilized. Thanks.

@xia-zhe
Copy link

xia-zhe commented Nov 18, 2024

I trained the model using the parameter settings specified in the code, and the results are as follows:
Audio Count Acc: 77.48 %
Audio Compt Acc: 60.44 %
Audio Averg Acc: 71.20 %

Visual Count Acc: 76.69 %
Visual Local Acc: 77.06 %
Visual Averg Acc: 76.88 %

Audio-Visual Exist Acc: 76.92 %
Audio-Visual Count Acc: 76.36 %
Audio-Visual Local Acc: 59.89 %
Audio-Visual Compt Acc: 63.67 %
Audio-Visual Templ Acc: 66.55 %
Audio-Visual Averg Acc: 69.17 %

---->Overall Accuracy: 71.57 %

Could you clarify where the issue occurred? Is it related to the "audio_patch" feature?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants