Adding Imagenet Example #680

PareesaMS · 2023-08-09T20:03:23Z

This example activated DeepSpeed on the implementation of training a set of popular model architectures on ImageNet dataset. The models include ResNet, AlexNet, and VGG, and the
baseline implementation could be found at pytorch examples Github repository. DeepSpeed activation allows for ease in
running the code in distributed manner, allowing for easily applying fp16 quantization benefitting Zero stage1 memory reduction.

yaozhewei · 2023-08-16T21:54:35Z

training/imagenet/README.md

+## DeepSpeed Optimizations
+
+Applying fp16 quantization and Zero stage 1 memory optimization we were able to reduce the required memory. The table bellow summarizes the results of running resnet 50 on one
+node 16 V100 GPUs:


on a DGX-1 node (with 16 V100 GPUs)

yaozhewei · 2023-08-16T21:55:14Z

training/imagenet/README.md

+------------------|-------------------
+
+Furthermore, the memory optimization had no adverse impact on accuracy, a point illustrated by the graph below.
+![resnet-plot](C:\Users\pagolnar\OneDrive - Microsoft\Reports-presentations\Resnet-plot)


the image link is wrong.

yaozhewei · 2023-08-16T21:55:57Z

training/imagenet/README.md

+Baseline| ? | -
+Baseline with DS activated | 1.66 | -
+DS + fp16 | 1.04 | ?
+Ds + fp16 + Zero 1 | 0.81 | ?


besides memory, how about the training speed

Fixed the table. Did not measure the training speed. Should I repeat the experiments?

yaozhewei · 2023-08-16T21:56:52Z

training/imagenet/README.md

+ImageNet dataset is large and time-consuming to download. To get started quickly, run `main.py` using dummy data by "--dummy". It's also useful for training speed benchmark. Note that the loss or accuracy is useless in this case.
+
+```bash
+python main.py -a resnet18 --dummy


where is deepspeed?

yaozhewei · 2023-08-16T21:58:00Z

training/imagenet/requirements.txt

@@ -0,0 +1,2 @@
+torch


deepspeed is also a requirement?

Definitely. Fixed the issue

yaozhewei · 2023-08-16T21:59:49Z

training/imagenet/README.md

+Baseline| ? | -
+Baseline with DS activated | 1.66 | -
+DS + fp16 | 1.04 | ?
+Ds + fp16 + Zero 1 | 0.81 | ?


table format is not correct. take a look at rendered website

Adding Imagenet example

Unverified

This commit is not signed, but one or more authors requires that any commit attributed to them is signed.

Learn about vigilant mode

fe9abf1

PareesaMS requested review from jeffra, samyam, tjruwase, ShadenSmith, conglongli, awan-10, eltonzheng, minjiaz, RezaYazdaniAminabadi, duli2012, mrwyattii, yaozhewei, arashb and xiaoxiawu-microsoft as code owners August 9, 2023 20:03

yaozhewei reviewed Aug 16, 2023

View reviewed changes

PareesaMS added 9 commits October 3, 2023 21:50

Fix some typos

52f54a7

Fix issues with the table and image

Unverified

This commit is not signed, but one or more authors requires that any commit attributed to them is signed.

Learn about vigilant mode

9211898

Fixes some issues in the parameters

2031a0c

Fix the plot

751f34a

Typo

d9b9af2

Fixing some alignments

9886e84

alignments

c75fced

Move resnetplot to assets

c8eb49b

Remove extra image

5579393

mrwyattii approved these changes Nov 8, 2023

View reviewed changes

Merge branch 'master' into dev/pagolnar/ex_imagenet

08127ce

mrwyattii merged commit ccb2a34 into master Nov 8, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding Imagenet Example #680

Adding Imagenet Example #680

PareesaMS commented Aug 9, 2023

yaozhewei Aug 16, 2023

PareesaMS Oct 4, 2023

yaozhewei Aug 16, 2023

PareesaMS Oct 4, 2023

yaozhewei Aug 16, 2023

PareesaMS Oct 4, 2023

yaozhewei Aug 16, 2023

PareesaMS Oct 4, 2023

yaozhewei Aug 16, 2023

PareesaMS Oct 4, 2023

yaozhewei Aug 16, 2023

PareesaMS Oct 4, 2023

Adding Imagenet Example #680

Adding Imagenet Example #680

Conversation

PareesaMS commented Aug 9, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment