[layer] Memory optimization for backwarding of embedding layer #2289

skykongkong8 · 2023-08-29T02:58:03Z

Unlike other layers, embedding layer uses specific form of Tensor called IndexedSlices and this data contains the specific indicies of the Tensor that we are interested in.
Thus, during the backwarding process, we do not have to set all the value-tensor-shaped gradient in VarGrad, but we can optimize it by formulating the specific part for the Gradient Tensor.
In current NNTrainer code, there is no such consideration like above, but it use same-shaped but zero-filled Tensor for un-interested-indexed portion of the Tensor. (redundant size Tensor declaration)
As far as I am concerned, we should work on this part in the near future for the memory optimization purpose.

The text was updated successfully, but these errors were encountered:

taos-ci · 2023-08-29T02:58:05Z

cibot: Thank you for posting issue #2289. The person in charge will reply soon.

github-actions · 2025-01-29T02:28:05Z

This issue is stale because it has been open 90 days with no activity. Remove stale label or comment or this will be closed in 3 days.

skykongkong8 changed the title ~~[layer] backwarding of embedding layer~~ [layer] Memory optimization for backwarding of embedding layer Aug 29, 2023

github-actions bot added the Stale label Jan 29, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[layer] Memory optimization for backwarding of embedding layer #2289

[layer] Memory optimization for backwarding of embedding layer #2289

skykongkong8 commented Aug 29, 2023 •

edited

Loading

taos-ci commented Aug 29, 2023

github-actions bot commented Jan 29, 2025

[layer] Memory optimization for backwarding of embedding layer #2289

[layer] Memory optimization for backwarding of embedding layer #2289

Comments

skykongkong8 commented Aug 29, 2023 • edited Loading

taos-ci commented Aug 29, 2023

github-actions bot commented Jan 29, 2025

skykongkong8 commented Aug 29, 2023 •

edited

Loading