Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Enable AIT lowering for eager processed models on AIMP #2055

Closed
wants to merge 1 commit into from

Conversation

PaulZhang12
Copy link
Contributor

Summary: Fix Lowering for eager model processing path on AIMP. Before this diff, the device passed to merge_pooled_embeddings did not have an associated index, which will cause lowering to fail. Once index was present, refer to this post: https://fb.workplace.com/groups/gpuinference/permalink/830694592261282/, resulting in the need for the separate module as suggested by kflu

Differential Revision: D57297185

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label May 29, 2024
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D57297185

PaulZhang12 added a commit to PaulZhang12/torchrec that referenced this pull request May 29, 2024
Summary:

Fix Lowering for eager model processing path on AIMP. Before this diff, the device passed to merge_pooled_embeddings did not have an associated index, which will cause lowering to fail. Once index was present, refer to this post: https://fb.workplace.com/groups/gpuinference/permalink/830694592261282/, resulting in the need for the separate module as suggested by kflu

Differential Revision: D57297185
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D57297185

PaulZhang12 added a commit to PaulZhang12/torchrec that referenced this pull request Jun 3, 2024
Summary:

Fix Lowering for eager model processing path on AIMP. Before this diff, the device passed to merge_pooled_embeddings did not have an associated index, which will cause lowering to fail. Once index was present, refer to this post: https://fb.workplace.com/groups/gpuinference/permalink/830694592261282/, resulting in the need for the separate module as suggested by kflu

Differential Revision: D57297185
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D57297185

PaulZhang12 added a commit to PaulZhang12/torchrec that referenced this pull request Jun 3, 2024
Summary:

Fix Lowering for eager model processing path on AIMP. Before this diff, the device passed to merge_pooled_embeddings did not have an associated index, which will cause lowering to fail. Once index was present, refer to this post: https://fb.workplace.com/groups/gpuinference/permalink/830694592261282/, resulting in the need for the separate module as suggested by kflu

Differential Revision: D57297185
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D57297185

Summary:

Fix Lowering for eager model processing path on AIMP. Before this diff, the device passed to merge_pooled_embeddings did not have an associated index, which will cause lowering to fail. Once index was present, refer to this post: https://fb.workplace.com/groups/gpuinference/permalink/830694592261282/, resulting in the need for the separate module as suggested by kflu

Reviewed By: ZhengkaiZ

Differential Revision: D57297185
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D57297185

PaulZhang12 added a commit that referenced this pull request Jun 5, 2024
Summary:
Pull Request resolved: #2055

Fix Lowering for eager model processing path on AIMP. Before this diff, the device passed to merge_pooled_embeddings did not have an associated index, which will cause lowering to fail. Once index was present, refer to this post: https://fb.workplace.com/groups/gpuinference/permalink/830694592261282/, resulting in the need for the separate module as suggested by kflu

Reviewed By: ZhengkaiZ

Differential Revision: D57297185

fbshipit-source-id: a2aa78c731dca91d123f7d7e91b8555d61cb890f
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. fb-exported
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants