[Compression V2] Movement pruning #4308

J-shang · 2021-11-11T03:26:10Z

Description

Implement movement pruning in this paper.
https://arxiv.org/abs/2005.07683

Checklist

test case
doc

How to test

QuanluZhang · 2021-11-19T08:51:19Z

docs/en_US/Compression/v2_pruning_algo.rst

+---------------
+
+Movement pruner is an implementation of movement pruning.
+This is a pruning by step algorithm, the masks may change during each step.


what is "step algorithm"?

I want to say pruning by step algorithm... means this pruner will generate and apply masks during each optimizer step.

I think every pruner is pruning step by step? What's the concrete meaning of step here?

oh yes, it is easy to be misunderstood here, this means after each optimizer.step(), the model will be applied a new mask. I will update the docstring later.

change to
This is a "fine-pruning" algorithm, which means the masks may change during each fine-tuning step.

QuanluZhang · 2021-11-19T11:31:40Z

nni/algorithms/compression/v2/pytorch/pruning/movement_pruner.py

+
+class PrunerScoredModuleWrapper(Module):
+    """
+    Wrap an module to enable data parallel, forward method customization and buffer registeration.


nni/algorithms/compression/v2/pytorch/pruning/movement_pruner.py

nni/algorithms/compression/v2/pytorch/pruning/tools/base.py

nni/algorithms/compression/v2/pytorch/pruning/tools/metrics_calculator.py

zheng-ningxin · 2021-11-25T01:59:15Z

docs/en_US/Compression/v2_pruning_algo.rst

+
+   # ignore the parameters with `weight_score` in name if you want to finetune with masks
+   optimizer_grouped_parameters = [{
+        "params": [p for n, p in model.named_parameters() if "weight_score" not in n and p.requires_grad]


What is weight_score for? Can we handle this automatically so that user don't have to modify the optimizer manually? Besides, whether weight_score limits our appliable scenario to a specific implement version/repo of transformer?

weight_score is register in wrapper as a parameter, it is the sum of - weight * weight_grad.
It's OK that user directly use optimizer = Adam(model.named_parameters(), lr=2e-5), just some computing resources were wasted. But it's a good idea that we handle this automatically, I will try this.

weight_score will not limit our appliable scenario, all module that has weight can use this pruner.

Can we handle this automatically so that user don't have to modify the optimizer manually?

fix it

J-shang added 5 commits November 10, 2021 22:55

update movement pruner

b30f225

update

8929257

update warm up step

fb04029

update movement pruning

dc2b98f

update example

7c9e7ac

liuzhe-lz mentioned this pull request Nov 12, 2021

NNI 2021 Oct~Nov Iteration Plan #4211

Closed

86 tasks

J-shang added 4 commits November 15, 2021 10:59

bug fix

cc10f9d

update

b98dd39

bug fix

9bf5923

update example

da2dac8

J-shang force-pushed the movement-pruning branch from a560efd to da2dac8 Compare November 16, 2021 06:26

J-shang added 3 commits November 16, 2021 14:27

fix example

51532bf

update example

2ee9349

update example

69b3cfa

liuzhe-lz requested review from QuanluZhang and zheng-ningxin November 17, 2021 02:35

J-shang added 4 commits November 17, 2021 15:48

update doc string

fb46561

update doc

e032e27

update test

78a3b6a

remove reduntant import

70f80f3

J-shang marked this pull request as ready for review November 18, 2021 06:12

QuanluZhang reviewed Nov 19, 2021

View reviewed changes