[Relay] Remove DynamicToStatic pass from graph runtime build #10691

masahi · 2022-03-20T21:49:51Z

To solve this problem, we can either remove this pass from relay.build(...) pipeline or run DynamicToStatic in both VM and non-VM paths. I propose to remove it because (1) usually DynamicToStatic is supposed to be applied after model import and (2) the only case running DynamicToStatic during relay.build(...) helps is when the input is entirely static but a frontend fails to produce a static mod AND a user forgets to run DynamicToStatic after model import.

I hope the latter case happens rarely but if not, that's something we should fix in the frontend side. We should avoid relying on DynamicToStatic that runs during relay.build(...) since not all use cases of TVM use relay.build(...) (BYOC, for example).

masahi · 2022-03-22T00:05:15Z

I realized that this is not specific to meta scheduler (the one I tested), but the same error could happen for auto scheduler as well since it also relies on the structure of the input relay subgraph for a workload lookup.

All tests have passed without DynamicToStatic in relay.build(...). As the diff shows, this investigation uncovered a few cases where frontends are introducing dynamic ops carelessly, or some TF frontend tests are using graph runtime for a model containing dynamic inputs (but just happen to be working thanks to DynamicToStatic since dynamic-inputs are bound to a constant tensor before relay.build(...), and after the frontend conversion. This is a very strange usage and they should be using VM in the first place).

So I believe it is better to remove DynamicToStatic in relay.build(...) to prevent those sloppy coding in the frontends. But if people instead prefer running DynamicToStatic in the VM path as well, it's fine for me to do that. But I'd like to be convinced that such use of DynamicToStatic really solves non-trivial problems. Let me know your thought @mbrookhart @tkonolige @comaniac @zxybazh

masahi · 2022-03-22T00:07:32Z

python/tvm/relay/frontend/tflite.py

@@ -846,7 +846,7 @@ def convert_shape(self, op):
        input_tensors = self.get_input_tensors(op)
        assert len(input_tensors) == 1, "input tensors length should be 1"

-        out = _op.shape_of(self.get_tensor_expr(input_tensors[0]))
+        out = shape_of(self.get_tensor_expr(input_tensors[0]))


Before this PR, tflite mobilenet was returning a dynamic shape output 🤦‍♂️

masahi · 2022-03-22T00:08:16Z

src/relay/backend/utils.cc

  pass_seqs.push_back(transform::CombineParallelConv2D(3));
  pass_seqs.push_back(transform::CombineParallelDense(3));
  pass_seqs.push_back(transform::CombineParallelBatchMatmul(3));
  pass_seqs.push_back(transform::FoldConstant());
  pass_seqs.push_back(transform::FoldScaleAxis());
+  pass_seqs.push_back(transform::SimplifyExpr());


I realized that SimplifyExpr practically depends on FoldConstant be applied beforehand. So I swapped the order.

masahi · 2022-03-22T00:10:00Z

tests/python/frontend/tensorflow/test_forward.py

        if default_value == None:
            output = tf.sparse_to_dense(indices, oshape, values)
            compare_tf_with_tvm(
-                [sparse_indices, sparse_values], ["indices:0", "values:0"], output.name
+                [sparse_indices, sparse_values], ["indices:0", "values:0"], output.name, mode="vm"


This test was previously working on the graph runtime just because the dynamic input is bound to a constant tensor before relay.build(...) in TF tests. Such usage of dynamic inputs makes no sense so I just changed it to use vm for testing.

masahi · 2022-03-22T00:12:18Z

tests/python/frontend/tensorflow/test_forward.py

+                np_data,
+                "",
+                ["UniqueWithCounts:0", "UniqueWithCounts:1", "UniqueWithCounts:2"],
+                mode="vm",


Unique is naturally a dynamic op, but for some reason this test was running on the graph runtime and it happens to be working just because the dynamic input is bound to a constant tensor before relay.build(...). So the test was effectively running unique(const_tensor), which is not really useful.

tkonolige · 2022-03-22T16:30:48Z

I am in favor of remove DynamicToStatic from the default set of passes.

…10691) Closes apache#10692 To solve this problem, we can either remove this pass from `relay.build(...)` pipeline or run `DynamicToStatic` in both VM and non-VM paths. I propose to remove it because (1) usually `DynamicToStatic` is supposed to be applied after model import and (2) the only case running `DynamicToStatic` during `relay.build(...)` helps is when the input is entirely static but a frontend fails to produce a static mod AND a user forgets to run `DynamicToStatic` after model import. I hope the latter case happens rarely but if not, that's something we should fix in the frontend side. We should avoid relying on `DynamicToStatic` that runs during `relay.build(...)` since not all use cases of TVM use `relay.build(...)` (BYOC, for example).

masahi added 8 commits March 21, 2022 06:47

[Relay] Remove DynamicToStatic pass from graph runtime build

63947be

Fix the use of op.shape_of in tflite frontend

670dfa2

fix dtype mismatch in fill_value

464d686

add FoldConstant to SimplifyExpr's prerequisite pass

2c8df8b

always use vm for TF unique test since unique is always dynamic

4d87e89

removed the use of dyn broadcast in paddle prelu

9190456

try running simplifyexpr after foldconstant

b15431d

use vm in TF sparse_to_dense test since it depends on dynamic input

adcbf96

masahi commented Mar 22, 2022

View reviewed changes

mbrookhart approved these changes Mar 22, 2022

View reviewed changes

junrushao merged commit 4c608be into apache:main Mar 23, 2022

masahi mentioned this pull request Mar 23, 2022

[ONNX] Make freeze_param = True and run DynamicToStatic by default #10750

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Relay] Remove DynamicToStatic pass from graph runtime build #10691

[Relay] Remove DynamicToStatic pass from graph runtime build #10691

masahi commented Mar 20, 2022 •

edited

Loading

masahi commented Mar 22, 2022

masahi Mar 22, 2022

masahi Mar 22, 2022

masahi Mar 22, 2022

masahi Mar 22, 2022

tkonolige commented Mar 22, 2022

[Relay] Remove DynamicToStatic pass from graph runtime build #10691

[Relay] Remove DynamicToStatic pass from graph runtime build #10691

Conversation

masahi commented Mar 20, 2022 • edited Loading

masahi commented Mar 22, 2022

masahi Mar 22, 2022

Choose a reason for hiding this comment

masahi Mar 22, 2022

Choose a reason for hiding this comment

masahi Mar 22, 2022

Choose a reason for hiding this comment

masahi Mar 22, 2022

Choose a reason for hiding this comment

tkonolige commented Mar 22, 2022

masahi commented Mar 20, 2022 •

edited

Loading