You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Could you please clarify which pre-trained ToMe model is used when obtaining the "visual_patch" features? What is the setting for the "r" of ToMe? Additionally, I noticed that the "audio_patch" feature is not actually being utilized. Thanks.
The text was updated successfully, but these errors were encountered:
I trained the model using the parameter settings specified in the code, and the results are as follows:
Audio Count Acc: 77.48 %
Audio Compt Acc: 60.44 %
Audio Averg Acc: 71.20 %
Could you please clarify which pre-trained ToMe model is used when obtaining the "visual_patch" features? What is the setting for the "r" of ToMe? Additionally, I noticed that the "audio_patch" feature is not actually being utilized. Thanks.
The text was updated successfully, but these errors were encountered: