You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
First of all, I would like say that your work is amazing!!! It has helped me a lot!!! Thank you!
However, I have question about the difference between Hitnet_SF and Hitnet_KITTI. I see that the Hitnet_SF doesn't give the output of hypothesis confidence (w), so that during training, the loss for w won't be taken into account. Moreover, it seems that the Initialization part and Propagation part are also different between 2 models in your code. I see in the supplementary material of original paper, the authors say that the difference between models for each dataset is simply channel size of feature extractor.
I saw that your response for a closed issue that you succeeded on converting the original model weight, which is in format tensorflow buffer provided by the authors, into the pytorch model and obtain almost the same result on SF final_pass. It would be so grateful for me if you would share how did you interpret the tensorflow pb model to rewrite the entire model in pytorch.
Thank you again for your work and look forward for your response!
The text was updated successfully, but these errors were encountered:
Hitnet_SF and Hitnet_KITTI model structure are exactly same with origin tensorflow checkpoints. so I think there is some difference here between the author's implementation and the paper description.
for the weights convert from tensorflow to pytorch, i convert them one by one manually. and i write some unit tests to ensure the correctness of numerical conversions.
First of all, I would like say that your work is amazing!!! It has helped me a lot!!! Thank you!
However, I have question about the difference between Hitnet_SF and Hitnet_KITTI. I see that the Hitnet_SF doesn't give the output of hypothesis confidence (w), so that during training, the loss for w won't be taken into account. Moreover, it seems that the Initialization part and Propagation part are also different between 2 models in your code. I see in the supplementary material of original paper, the authors say that the difference between models for each dataset is simply channel size of feature extractor.
I saw that your response for a closed issue that you succeeded on converting the original model weight, which is in format tensorflow buffer provided by the authors, into the pytorch model and obtain almost the same result on SF final_pass. It would be so grateful for me if you would share how did you interpret the tensorflow pb model to rewrite the entire model in pytorch.
Thank you again for your work and look forward for your response!
The text was updated successfully, but these errors were encountered: