[bc breaking] change x, w, dL_dY variable names to input, weight, grad_output #323

vkuzo · 2024-07-22T16:41:02Z

Stack from ghstack (oldest at bottom):

Summary:

The following naming scheme matches the rest of PyTorch better:

// forward
output = input @ weight_t
// backward
grad_input = grad_output @ weight
grad_weight = input_t @ grad_output

This PR changes all the previous references to x, w, dL_dY to
match the naming scheme above.

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:

Differential Revision: D60072596

…d_output Summary: The following naming scheme matches the rest of PyTorch better: ``` // forward output = input @ weight_t // backward grad_input = grad_output @ weight grad_weight = input_t @ grad_output ``` This PR changes all the previous references to `x`, `w`, `dL_dY` to match the naming scheme above. Test Plan: Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

…d_output Summary: The following naming scheme matches the rest of PyTorch better: ``` // forward output = input @ weight_t // backward grad_input = grad_output @ weight grad_weight = input_t @ grad_output ``` This PR changes all the previous references to `x`, `w`, `dL_dY` to match the naming scheme above. Test Plan: Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 5d1c7063d98ef2adebdb14b1757cc349d41e3020 Pull Request resolved: #323

…weight, grad_output" Summary: The following naming scheme matches the rest of PyTorch better: ``` // forward output = input @ weight_t // backward grad_input = grad_output @ weight grad_weight = input_t @ grad_output ``` This PR changes all the previous references to `x`, `w`, `dL_dY` to match the naming scheme above. Test Plan: Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

…d_output Summary: The following naming scheme matches the rest of PyTorch better: ``` // forward output = input @ weight_t // backward grad_input = grad_output @ weight grad_weight = input_t @ grad_output ``` This PR changes all the previous references to `x`, `w`, `dL_dY` to match the naming scheme above. Test Plan: Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 52683b06c816b8ab1cdef5142d5a7eaea1a9e0f2 Pull Request resolved: #323

vkuzo · 2024-07-22T16:47:04Z

.github/workflows/ufmt.yml

@@ -23,4 +23,7 @@ jobs:
        pip install black==23.3.0 usort==1.0.6 ufmt==2.1.0 libcst==1.0.1
    - name: Analyzing the code with ufmt
      run: |
+        ufmt format .


L26:28 is for easier debugging of differences between local machine and CI ufmt

is the ufmt config different than CI?

I've noticed this on a couple of PRs in the past. A better fix would be to align ufmt versions + env, but regardless this is useful for debugging.

…weight, grad_output" Summary: The following naming scheme matches the rest of PyTorch better: ``` // forward output = input @ weight_t // backward grad_input = grad_output @ weight grad_weight = input_t @ grad_output ``` This PR changes all the previous references to `x`, `w`, `dL_dY` to match the naming scheme above. Test Plan: Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

…d_output Summary: The following naming scheme matches the rest of PyTorch better: ``` // forward output = input @ weight_t // backward grad_input = grad_output @ weight grad_weight = input_t @ grad_output ``` This PR changes all the previous references to `x`, `w`, `dL_dY` to match the naming scheme above. Test Plan: Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: d1d6ffbcc37bcf1b39709e50156d6018120a2261 Pull Request resolved: #323

…weight, grad_output" Summary: The following naming scheme matches the rest of PyTorch better: ``` // forward output = input @ weight_t // backward grad_input = grad_output @ weight grad_weight = input_t @ grad_output ``` This PR changes all the previous references to `x`, `w`, `dL_dY` to match the naming scheme above. Test Plan: Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

…d_output Summary: The following naming scheme matches the rest of PyTorch better: ``` // forward output = input @ weight_t // backward grad_input = grad_output @ weight grad_weight = input_t @ grad_output ``` This PR changes all the previous references to `x`, `w`, `dL_dY` to match the naming scheme above. Test Plan: Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 37a6a51987b55cd894b70fa54e6b8669f48ccb47 Pull Request resolved: #323

drisspg · 2024-07-22T21:37:45Z

float8_experimental/float8_linear.py

-        ctx.save_for_backward(fp8_amax_dL_dY, fp8_amax_history_dL_dY, fp8_scale_dL_dY)
+        ctx.save_for_backward(
+            fp8_amax_grad_output, fp8_amax_history_grad_output, fp8_scale_grad_output
+        )
        ctx.scale_fn_name = scale_fn_name
        ctx.is_amax_initialized = is_amax_initialized
        ctx.linear_mm_config = linear_mm_config
        return tensor

    @staticmethod
    def backward(ctx, go):


should we update go? This one has confused me in the past

this PR only contains the user-facing part since that's the most crucial. LOC is already high, so IMO better to rename the non-user facing things in subsequent PRs, to keep things smaller and more reviewable.

drisspg

Awesome, thanks!

vkuzo · 2024-07-22T21:47:14Z

@vkuzo has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2024-07-23T02:56:37Z

This pull request has been merged in 603efc2.

Summary: In #323 we changed the user facing variable notation from `x/w/dL_dY` to `input/weight/grad_output`. This PR follows up by changing most of the internal variables to also match the new notation, to reduce confusion. Test Plan: ``` ./test/test_everything.sh ``` Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

…rad_output notation" Summary: In #323 we changed the user facing variable notation from `x/w/dL_dY` to `input/weight/grad_output`. This PR follows up by changing most of the internal variables to also match the new notation, to reduce confusion. Test Plan: ``` ./test/test_everything.sh ``` Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

…ion" Summary: In #323 we changed the user facing variable notation from `x/w/dL_dY` to `input/weight/grad_output`. This PR follows up by changing most of the internal variables to also match the new notation, to reduce confusion. Test Plan: ``` ./test/test_everything.sh ``` Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

Summary: Pull Request resolved: #335 In #323 we changed the user facing variable notation from `x/w/dL_dY` to `input/weight/grad_output`. This PR follows up by changing most of the internal variables to also match the new notation, to reduce confusion. Reviewed By: weifengpy Differential Revision: D60252071 fbshipit-source-id: b91ec5b975df550962418eafc93f1904d64a3dd8

vkuzo mentioned this pull request Jul 19, 2024

[bc breaking] unify filtering functions #322

Closed

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jul 22, 2024

vkuzo commented Jul 22, 2024

View reviewed changes

drisspg reviewed Jul 22, 2024

View reviewed changes

drisspg approved these changes Jul 22, 2024

View reviewed changes

facebook-github-bot closed this in 603efc2 Jul 23, 2024

facebook-github-bot added the Merged label Jul 23, 2024

vkuzo mentioned this pull request Jul 25, 2024

rename all variables to use input/weight/grad_output notation #335

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[bc breaking] change x, w, dL_dY variable names to input, weight, grad_output #323

[bc breaking] change x, w, dL_dY variable names to input, weight, grad_output #323

vkuzo commented Jul 22, 2024 •

edited

Loading

vkuzo Jul 22, 2024

drisspg Jul 22, 2024

vkuzo Jul 22, 2024

drisspg Jul 22, 2024

vkuzo Jul 22, 2024

drisspg left a comment

vkuzo commented Jul 22, 2024

facebook-github-bot commented Jul 23, 2024

[bc breaking] change x, w, dL_dY variable names to input, weight, grad_output #323

[bc breaking] change x, w, dL_dY variable names to input, weight, grad_output #323

Conversation

vkuzo commented Jul 22, 2024 • edited Loading

vkuzo Jul 22, 2024

Choose a reason for hiding this comment

drisspg Jul 22, 2024

Choose a reason for hiding this comment

vkuzo Jul 22, 2024

Choose a reason for hiding this comment

drisspg Jul 22, 2024

Choose a reason for hiding this comment

vkuzo Jul 22, 2024

Choose a reason for hiding this comment

drisspg left a comment

Choose a reason for hiding this comment

vkuzo commented Jul 22, 2024

facebook-github-bot commented Jul 23, 2024

vkuzo commented Jul 22, 2024 •

edited

Loading