Add MLCube support for Object Detection Benchmark #501

davidjurado · 2021-07-23T15:52:16Z

Benchmark execution with MLCube

Project setup

# Create Python environment and install MLCube Docker runner 
virtualenv -p python3 ./env && source ./env/bin/activate && pip install pip==24.0 && pip install mlcube-docker

# Fetch the Object Detection workload
git clone https://github.com/mlcommons/training && cd ./training
git fetch origin pull/501/head:feature/object_detection && git checkout feature/object_detection
cd ./object_detection/mlcube

Dataset

The COCO dataset will be downloaded and extracted. Sizes of the dataset in each step:

Dataset Step	MLCube Task	Format	Size
Download (Compressed dataset)	download_data	Tar/Zip files	~20.5 GB
Extract (Uncompressed dataset)	download_data	Jpg/Json files	~21.2 GB
Total	(After all tasks)	All	~41.7 GB

Tasks execution

Parameters are defined at these files:

MLCube user parameters: mlcube/workspace/parameters.yaml
Project user parameters: pytorch/configs/e2e_mask_rcnn_R_50_FPN_1x.yaml
Project default parameters: pytorch/maskrcnn_benchmark/config/defaults.py

# Download COCO dataset. Default path = /workspace/data
mlcube run --task=download_data -Pdocker.build_strategy=always

# Run benchmark. Default paths = ./workspace/data
mlcube run --task=train -Pdocker.build_strategy=always

Demo execution

These tasks will use a demo dataset (39M) to execute a faster training workload for a quick demo (~12 min):

# Download subsampled dataset. Default path = /workspace/demo
mlcube run --task=download_demo -Pdocker.build_strategy=always

# Run benchmark. Default paths = ./workspace/demo and ./workspace/demo_output
mlcube run --task=demo -Pdocker.build_strategy=always

It's also possible to execute the two tasks in one single instruction:

mlcube run --task=download_demo,demo -Pdocker.build_strategy=always

Aditonal options

Parameters defined at mculbe/mlcube.yaml could be overridden using: --param=input

We are targeting pull-type installation, so MLCube images should be available on docker hub. If not, try this:

mlcube run ... -Pdocker.build_strategy=always

github-actions · 2021-07-23T15:52:30Z

MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅

johntran-nv · 2023-03-16T18:47:51Z

@mmarcinkiewicz are you the right person to review this one?

ShriyaPalsamudram · 2023-03-20T19:14:02Z

object_detection/README.md

+
+```bash
+mlcube run ... -Pdocker.build_strategy=always
+```


This is the benchmark README template. So can you please add sections that have been moved to README.old back?

Done, thanks for pointing this out, move mlcube explanation into the mlcube folder

ShriyaPalsamudram · 2023-03-20T19:15:37Z

object_detection/download_dataset.sh

+curl -O http://images.cocodataset.org/zips/train2017.zip
+echo "Extracting train2017.zip:"
+n_files=`unzip -l  train2017.zip| grep .jpg | wc -l`
+unzip train2017.zip  | { I=-1; while read; do printf "Progress: $((++I*100/$n_files))%%\r"; done; echo ""; }

 # TBD: MD5 verification
 # $md5sum *.zip *.tgz


Can you please add a checksum verification step to make sure changes do not affect the dataset?

Done, added the validation inside the download_dataset.sh file.

ShriyaPalsamudram · 2023-03-20T19:16:58Z

object_detection/mlcube/workspace/parameters.yaml

@@ -0,0 +1,5 @@
+SAVE_CHECKPOINTS: "True" # Instead of False use empty value


SAVE_CHECKPOINTS should be False by default since that code path is not well tested in the recent past.

ShriyaPalsamudram

Can you please share a log from an end-to-end training run using mlcube so it can be compared to the previous workflow?

nv-rborkar · 2024-03-08T03:37:28Z

@davidjurado can you please address Shriya's feedback. We can then merge this PR.

…bject_detection

Add download data task to MLCube

e238dd7

davidjurado added 6 commits July 26, 2021 11:58

Add unzipping progress

7f69f94

Add train task

a3bc71c

specify other parameter files location

648c70a

Add support to override parameters at command line

16f75fd

Update to MLCube config v2.0

efca6e0

Update readme

9395cfd

matthew-frank added object_detection object_detection benchmark MLCube labels Dec 2, 2022

johntran-nv requested a review from mmarcinkiewicz March 16, 2023 18:47

ShriyaPalsamudram reviewed Mar 20, 2023

View reviewed changes

Merge branch 'master' of github.com:mlcommons/training into feature/o…

52b7253

…bject_detection

davidjurado requested a review from a team as a code owner March 22, 2024 15:48

davidjurado added 7 commits March 22, 2024 15:56

Add MD5 verification

3a9fe5f

Fix SAVE_CHECKPOINTS flag

ef001bb

Fix MLCube readme

711ecaa

Add demo tasks

0db618f

update scripts

573f782

update demo download link

9f9ab23

Fix dependencies

489ed0f

davidjurado force-pushed the feature/object_detection branch from 2c969eb to 489ed0f Compare November 15, 2024 15:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add MLCube support for Object Detection Benchmark #501

Add MLCube support for Object Detection Benchmark #501

davidjurado commented Jul 23, 2021 •

edited

Loading

github-actions bot commented Jul 23, 2021 •

edited

Loading

johntran-nv commented Mar 16, 2023

ShriyaPalsamudram Mar 20, 2023

davidjurado Mar 22, 2024

ShriyaPalsamudram Mar 20, 2023

davidjurado Mar 22, 2024

ShriyaPalsamudram Mar 20, 2023

davidjurado Mar 22, 2024

ShriyaPalsamudram left a comment

nv-rborkar commented Mar 8, 2024

		@@ -0,0 +1,5 @@
		SAVE_CHECKPOINTS: "True" # Instead of False use empty value

Add MLCube support for Object Detection Benchmark #501

Are you sure you want to change the base?

Add MLCube support for Object Detection Benchmark #501

Conversation

davidjurado commented Jul 23, 2021 • edited Loading

Benchmark execution with MLCube

Project setup

Dataset

Tasks execution

Demo execution

Aditonal options

github-actions bot commented Jul 23, 2021 • edited Loading

johntran-nv commented Mar 16, 2023

ShriyaPalsamudram Mar 20, 2023

Choose a reason for hiding this comment

davidjurado Mar 22, 2024

Choose a reason for hiding this comment

ShriyaPalsamudram Mar 20, 2023

Choose a reason for hiding this comment

davidjurado Mar 22, 2024

Choose a reason for hiding this comment

ShriyaPalsamudram Mar 20, 2023

Choose a reason for hiding this comment

davidjurado Mar 22, 2024

Choose a reason for hiding this comment

ShriyaPalsamudram left a comment

Choose a reason for hiding this comment

nv-rborkar commented Mar 8, 2024

davidjurado commented Jul 23, 2021 •

edited

Loading

github-actions bot commented Jul 23, 2021 •

edited

Loading