microsoft · QuanluZhang · Nov 27, 2020 · Jul 20, 2020 · Jul 20, 2020 · Jul 31, 2020
diff --git a/README.md b/README.md
@@ -135,6 +135,7 @@ Within the following table, we summarized the current NNI capabilities, we are g
               <li><a href="docs/en_US/NAS/Proxylessnas.md">ProxylessNAS</a></li>
               <li><a href="docs/en_US/Tuner/BuiltinTuner.md#NetworkMorphism">Network Morphism</a></li>
               <li><a href="docs/en_US/NAS/TextNAS.md">TextNAS</a></li>
+              <li><a href="docs/en_US/NAS/Cream.md">Cream</a></li>
             </ul>
           </ul>
           <a href="docs/en_US/Compressor/Overview.md">Model Compression</a>

diff --git a/docs/en_US/NAS/Cream.md b/docs/en_US/NAS/Cream.md
@@ -0,0 +1,79 @@
+# Cream of the Crop: Distilling Prioritized Paths For One-Shot Neural Architecture Search
+
+## Introduction
+One-shot weight sharing methods have recently drawn great attention in neural architecture search due to high efficiency and competitive performance. However, weight sharing across models has an inherent deficiency, i.e., insufficient training
+of subnetworks in the hypernetwork. To alleviate this problem, we present a simple yet effective architecture distillation method. The central idea is that subnetworks can learn collaboratively and teach each other throughout the training
+process, aiming to boost the convergence of individual models. We introduce the concept of prioritized path, which refers to the architecture candidates exhibiting superior performance during training. Distilling knowledge from the prioritized
+paths is able to boost the training of subnetworks. Since the prioritized paths are changed on the fly depending on their performance and complexity, the final obtained paths are the cream of the crop. We directly select the most promising
+one from the prioritized paths as the final architecture, without using other complex search methods, such as reinforcement learning or evolution algorithms. The experiments on ImageNet verify such path distillation method can improve the
+convergence ratio and performance of the hypernetwork, as well as boosting the training of subnetworks. The discovered architectures achieve superior performance compared to the recent MobileNetV3 and EfficientNet families under aligned
+settings. Moreover, the experiments on object detection and more challenging search space show the generality and robustness of the proposed method.
+
+## Reproduction Results
+
+## Examples
+
+[Example code](https://github.com/microsoft/nni/tree/master/examples/nas/cream)
+
+## Requirements
+* python >= 3.6
+* torch >= 1.2
+* torchscope
+* apex (not necessary, please make sure your nvcc CUDA version is the same with pytorch CUDA verision)
+
+## Data Preparation 
+You need to first download the [ImageNet-2012](http://www.image-net.org/) to the folder `./data/imagenet` and move the validation set to the subfolder `./data/imagenet/val`. To move the validation set, you cloud use the following script: <https://raw.githubusercontent.com/soumith/imagenetloader.torch/master/valprep.sh> 
+
+Put the imagenet data in ${Root}/data. It should be like following:
+```buildoutcfg
+${Root}/data/imagenet/train
+${Root}/data/imagenet/val
+...
+```
+
+
+## Quick Start
+
+### I. Search
+
+First, build environments for searching.
+```
+pip install -r ./examples/nas/cream/requirements.txt
+```
+
+To search for an architecture, you need to configure the parameters `flops_minimum` and `flops_maximum` to specify the desired model flops, such as [0,600]MB flops. You can specify the flops interval by changing these two parameters in `./examples/nas/cream/supernet.sh`
+```buildoutcfg
+--flops_minimum 0 # Minimum Flops of Architecture
+--flops_maximum 600 # Maximum Flops of Architecture
+```
+
+After you specify the flops of the architectures you would like to search, you can search an architecture now by running:
+```buildoutcfg
+sh ./experiments/scripts/supernet.sh
+
+```
+
+### II. Test
+To test our trained of models, you need to use `model_selection` in `./examples/nas/cream/test.sh` to specify which model to test.
+```buildoutcfg
+--model_selection 42 # test 42m model
+--model_selection 470 # test 470m model
+......
+```
+
+After specifying the flops of the model, you need to write the path to the resume model in `./examples/nas/cream/test.sh`.
+```buildoutcfg
+--resume './experiments/ckps/42.pth.tar'
+--resume './experiments/ckps/470.pth.tar'
+......
+```
+
+We provide 14M/42M/114M/285M/470M/600M pretrained models in [google drive](https://drive.google.com/drive/folders/1CQjyBryZ4F20Rutj7coF8HWFcedApUn2).
+After downloading the pretrained models and adding `--model_selection` and `--resume` in './experiments/scripts/test.sh', you need to use the following command to test the model.
+```buildoutcfg
+sh ./experiments/scripts/test.sh
+```
+
+The test result will be saved in `./retrain`. You can configure the `--ouput` in `./examples/nas/cream/test.sh` to specify a path for it.
+
+
diff --git a/examples/nas/cream/Cream.md b/examples/nas/cream/Cream.md
@@ -0,0 +1,94 @@
+# Cream of the Crop: Distilling Prioritized Paths For One-Shot Neural Architecture Search
+
+## Introduction
+One-shot weight sharing methods have recently drawn great attention in neural architecture search due to high efficiency and competitive performance. However, weight sharing across models has an inherent deficiency, i.e., insufficient training
+of subnetworks in the hypernetwork. To alleviate this problem, we present a simple yet effective architecture distillation method. The central idea is that subnetworks can learn collaboratively and teach each other throughout the training
+process, aiming to boost the convergence of individual models. We introduce the concept of prioritized path, which refers to the architecture candidates exhibiting superior performance during training. Distilling knowledge from the prioritized
+paths is able to boost the training of subnetworks. Since the prioritized paths are changed on the fly depending on their performance and complexity, the final obtained paths are the cream of the crop. We directly select the most promising
+one from the prioritized paths as the final architecture, without using other complex search methods, such as reinforcement learning or evolution algorithms. The experiments on ImageNet verify such path distillation method can improve the
+convergence ratio and performance of the hypernetwork, as well as boosting the training of subnetworks. The discovered architectures achieve superior performance compared to the recent MobileNetV3 and EfficientNet families under aligned
+settings. Moreover, the experiments on object detection and more challenging search space show the generality and robustness of the proposed method.
+
+## Reproduction Results
+
+## Examples
+
+[Example code](https://github.com/microsoft/nni/tree/master/examples/nas/cream)
+
+## Requirements
+* python >= 3.6
+* torch >= 1.2
+* torchscope
+* apex (not necessary, please make sure your nvcc CUDA version is the same with pytorch CUDA verision)
+
+## Data Preparation 
+You need to first download the [ImageNet-2012](http://www.image-net.org/) to the folder `./data/imagenet` and move the validation set to the subfolder `./data/imagenet/val`. To move the validation set, you cloud use the following script: <https://raw.githubusercontent.com/soumith/imagenetloader.torch/master/valprep.sh> 
+
+Put the imagenet data in ${Root}/data. It should be like following:
+```buildoutcfg
+${Root}/data/imagenet/train
+${Root}/data/imagenet/val
+...
+```
+
+
+## Quick Start
+
+### I. Search
+
+First, build environments for searching.
+```
+pip install -r ./examples/nas/cream/requirements.txt
+```
+
+To search for an architecture, you need to configure the parameters `flops_minimum` and `flops_maximum` to specify the desired model flops, such as [0,600]MB flops. You can specify the flops interval by changing these two parameters in `./examples/nas/cream/supernet.sh`
+```buildoutcfg
+--flops_minimum 0 # Minimum Flops of Architecture
+--flops_maximum 600 # Maximum Flops of Architecture
+```
+
+After you specify the flops of the architectures you would like to search, you can search an architecture now by running:
+```buildoutcfg
+sh ./experiments/scripts/supernet.sh
+
+```
+
+### II. Test
+To test our trained of models, you need to use `model_selection` in `./examples/nas/cream/test.sh` to specify which model to test.
+```buildoutcfg
+--model_selection 42 # test 42m model
+--model_selection 470 # test 470m model
+......
+```
+
+After specifying the flops of the model, you need to write the path to the resume model in `./examples/nas/cream/test.sh`.
+```buildoutcfg
+--resume './experiments/ckps/42.pth.tar'
+--resume './experiments/ckps/470.pth.tar'
+......
+```
+
+We provide 14M/42M/114M/285M/470M/600M pretrained models in [google drive](https://drive.google.com/drive/folders/1CQjyBryZ4F20Rutj7coF8HWFcedApUn2).
+After downloading the pretrained models and adding `--model_selection` and `--resume` in './experiments/scripts/test.sh', you need to use the following command to test the model.
+```buildoutcfg
+sh ./experiments/scripts/test.sh
+```
+
+The test result will be saved in `./retrain`. You can configure the `--ouput` in `./examples/nas/cream/test.sh` to specify a path for it.
+
+
+### PyTorch
+
+```eval_rst
+..  autoclass:: nni.nas.pytorch.cdarts.CdartsTrainer
+    :members:
+
+..  autoclass:: nni.nas.pytorch.cdarts.RegularizedDartsMutator
+    :members:
+
+..  autoclass:: nni.nas.pytorch.cdarts.DartsDiscreteMutator
+    :members:
+
+..  autoclass:: nni.nas.pytorch.cdarts.RegularizedMutatorParallel
+    :members:
+```
diff --git a/examples/nas/cream/__init__.py b/examples/nas/cream/__init__.py
diff --git a/examples/nas/cream/dataset/__init__.py b/examples/nas/cream/dataset/__init__.py
@@ -0,0 +1,3 @@
+from dataset.loader import create_loader
+from dataset.base_dataset import Dataset, AugMixDataset
+from dataset.utils import resolve_data_config