Support the Resnet/Squeezenet/Mobilenet for speedup #2579

zheng-ningxin · 2020-06-19T05:35:25Z

In this pr, the speedup module will support the add/cat operations and the convolution layers that have more than 1 group. I have tested the speedup module on the resnet18, squeezenet1_1, and mobilenetv_2 and it works fine.

Signed-off-by: Ningxin <[email protected]>

model should be set to eval mode before the jit.trace call. Signed-off-by: Ningxin <[email protected]>

Signed-off-by: Ningxin <[email protected]>

In the original way, addmm will also triger the dependency set searching, which may lead to a wrong dependency set. Signed-off-by: Ningxin <[email protected]>

Signed-off-by: Ningxin <[email protected]>

…p_conflict

The name of the node is not a unique identifier globally. Signed-off-by: Ningxin <[email protected]>

mask_conflict can fix the mask conflict of the layers that has channel dependency. This part should be called before the speedup function, so that, the speedup module can handle the model with residual connection/concat operations. Signed-off-by: Ningxin <[email protected]>

update the interface. if we alreay have the traced graph of the model we donnot need to trace the model again. Signed-off-by: Ningxin <[email protected]>

Add unittest for tools in analysis_utils to verify the correctness of the visulization, channel dependency, and mask conflict. Signed-off-by: Ningxin <[email protected]>

Signed-off-by: Ningxin <[email protected]>

…is_utils

zheng-ningxin · 2020-06-19T09:41:15Z

I find another problem in the TorchModuleGraph (#2581). It may be too late to fix this problem before this release(code freeze at 6.22), but fortunately, there are not many models with this problem. I'll try to fix it in the next release. Thanks~

Signed-off-by: Ningxin <[email protected]>

chicm-ms · 2020-06-21T06:51:54Z

src/sdk/pynni/tests/test_compression_utils.py

@@ -11,13 +11,13 @@

 from nni.compression.torch import L1FilterPruner


Would you please add test cases to verify the model speedup correctness for resnet, squeezenet and mobilenet like this test case?
https://github.com/microsoft/nni/blob/master/src/sdk/pynni/tests/test_model_speedup.py#L106

QuanluZhang · 2020-06-22T02:32:08Z

src/sdk/pynni/nni/_graph_utils.py

+        input_shapes = [t.type().sizes() for t in input_tensors]
+        cat_info['in_shape'] = input_shapes
+        return cat_info
+
    def _extract_shape_info(self, node):


this function is written by me, it is only for view (pretty limited). maybe we can generalize this function to extract different module's shape if needed in future.

QuanluZhang · 2020-06-22T02:53:15Z

src/sdk/pynni/nni/_graph_utils.py

+        Returns
+        -------
+        dict
+            Include auxiliary information for the cat operation.


might be better to explain the content of the dict

QuanluZhang · 2020-06-22T03:06:44Z

src/sdk/pynni/nni/_graph_utils.py

+        # after the build_index function.
+        input_order = []
+        list_construct_cpp = list(cpp_node.inputs())[0].node()
+        input_tensors = list(list_construct_cpp.inputs())


so the order of tensors returned by .inputs() is the order of input arguments?

According to my observation and experimental results, yes it is.
However, because jit itself lacks documentation, I have no documentation to support this point.
I will read the source code of jit and double check it.

QuanluZhang · 2020-06-22T04:34:17Z

src/sdk/pynni/nni/compression/torch/speedup/compressor.py

        self.torch_graph = build_module_graph(model, dummy_input)

-    def infer_module_mask(self, module_name, mask=None, in_shape=None, out_shape=None):
+    def infer_module_mask(self, module_name, last_module, mask=None, in_shape=None, out_shape=None):
        """


please update docstring accordingly

QuanluZhang · 2020-06-22T07:37:54Z