You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
[Question] Could we consider utilizing the index_mapping feature of IntNBitTableBatchedEmbeddingBagsCodegen to enhance the inference performance of QuantManagedCollisionEmbeddingCollection#2669
In the current version of _mch_remap within the MCHManagedCollisionModule, the searchsorted method is used for remapping, which may not be the most efficient approach for inference. To enhance the inference performance of the QuantManagedCollisionEmbeddingCollection, could we consider using the index_mapping feature of IntNBitTableBatchedEmbeddingBagsCodegen?
The text was updated successfully, but these errors were encountered:
@tiankongdeguiji I think that could work. May be you could write QuantMCHManagedCollisionEmbeddingCollection to try it out?
thx!. The default value of the index_mapping is -1, whereas the default value of the _mch_remap is zch_size. Could you provide some suggestions on how to deal with the difference?
In the current version of _mch_remap within the
MCHManagedCollisionModule
, thesearchsorted
method is used for remapping, which may not be the most efficient approach for inference. To enhance the inference performance of theQuantManagedCollisionEmbeddingCollection
, could we consider using theindex_mapping
feature ofIntNBitTableBatchedEmbeddingBagsCodegen
?The text was updated successfully, but these errors were encountered: