[WIP] workflow toml specification and initial implementation #1

johnnychen94 · 2021-08-05T21:13:00Z

Key features I have in mind:

language-agnostic: it supports running with any language (python, julia, shell, and others)
implementation-agnostic: all information should be stored in an exchange format. Here I use TOML because it's Julia's standard library.
verbose and reproducibility: it contains all the information to reproduce
flexibility: duplication is allowed, flexibility is more important
sandbox & atomic: if some task fails, it cleans up all its intermediate temporary results.

My design:

A workflow consists of multiple ordered stages. These stages are often dependent, e.g., one stage consumes the outputs from earlier stages.
A stage consists of multiple tasks. Tasks are independent and can run concurrently.
The workflow runs in a nested mapreduce way:
1. Workflow handler dispatches work to stage handler sequentially, stage handler dispatches work to task runners concurrently(or sequentially).
2. task runners run and output some content, stage handler collects them and notifies task runners to delete the data. All results of stage handler are kept as the workflow output.

The idea comes from mainly two designs: dvc yaml and datasets toml

Example: benchmark framework

The initial purpose of this is to support a generic and wide-range benchmark framework, there are some key challenges to achieve this:

we want to benchmark multiple frameworks, e.g., Images.jl, OpenCV, scikit-images, and others
function f in framework A might not have its corresponding function in other frameworks
What people want to benchmark is usually application-oriented so the benchmark target and script can vary very frequently.
How people would like to view the benchmark results are highly opinioned.

A natural idea is to separate the data producing stage and data analysis stage. We can produce any many benchmark results as we want, as long as we properly tag them. Then in the data visualization stage, we use filters to collect results that we're interested.

BenchmarkTools and PkgBenchmark use a tree design(nested groups) to organize the benchmark tasks. I reckon it a bad design because 1) it introduces a overly compact form and makes it very hard to adjust benchmark targets, 2) it makes future result filtering much harder. Thus I choose to use "name"+"tags" design, in the visualization and analysis stage:

"name" is used to check if there if a function f has multiple implementations.
"tags" are used to filter the entire benchmark dataset and give a small scope of data that we might be interested.

A unique ID is generated by join "name" and "tags" (e.g., join([name, sort(tags)...])

cc: @ashwani-rathee

codecov · 2021-08-05T21:15:10Z

Codecov Report

Merging #1 (962c9f7) into master (9571943) will not change coverage.
The diff coverage is 0.00%.

@@          Coverage Diff           @@
##           master      #1   +/-   ##
======================================
  Coverage        0   0.00%           
======================================
  Files           0       4    +4     
  Lines           0      71   +71     
======================================
- Misses          0      71   +71

Impacted Files	Coverage Δ
src/Workflows.jl	`0.00% <0.00%> (ø)`
src/drivers.jl	`0.00% <0.00%> (ø)`
src/parsing.jl	`0.00% <0.00%> (ø)`
src/report.jl	`0.00% <0.00%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 9571943...962c9f7. Read the comment docs.

johnnychen94 · 2021-08-05T21:17:39Z

example/benchmark/Benchmark.toml

+    [stages.run]
+        metrics = ["time"]
+        out = ["results.csv"]
+        driver = "csv"


Currently, it outputs

id,time dilate_juliaimages_morphology,0.623235 erode_juliaimages_morphology,0.627855 dilate_skimage_morphology,1.5773831250000003

and if I change to metrics = ["time", "memory"], then it becomes

id,time,memory dilate_juliaimages_morphology,0.636088,1280096 erode_juliaimages_morphology,0.641488,1280096 dilate_morphology_skimage,1.8287903700000008,

cc: @ashwani-rathee

I should make metrics an optional field and defaults to all results. Or just remove this field.

johnnychen94 · 2021-08-05T21:38:55Z

example/benchmark/Benchmark.toml

+    [stages.run]
+        metrics = ["time"]
+        out = ["results.csv"]
+        driver = "csv"


I should make metrics an optional field and defaults to all results. Or just remove this field.

johnnychen94 · 2021-08-05T21:41:22Z

example/benchmark/scripts/skimage/morphology/dilate.py

+count, time = timeit.Timer('dilation(img, square(3))', globals=globals()).autorange()
+
+# export
+print(json.dumps({"time": 1e3*time/count})) # ms


shell runner pass data via stdout=IOBuffer()

johnnychen94 · 2021-08-05T21:43:24Z

example/benchmark/scripts/juliaimages/morphology/erode.jl

+Dict(
+    "memory" => rst.memory, # byte
+    "time" => median(rst.times)/1e6, # ms
+) |> JSON3.write


Julia runner could choose to directly pass a Dict, but I want to make it language-agnostic so for consistency, I choose to let it pass a json String.

johnnychen94 · 2022-02-13T12:51:13Z

closed in favor of #8

initial implementation

962c9f7

johnnychen94 force-pushed the jc/workflow branch from 7b21a19 to 962c9f7 Compare August 5, 2021 21:16

johnnychen94 commented Aug 5, 2021

View reviewed changes

This was referenced Feb 4, 2022

ImageIO Benchmarks JuliaIO/ImageIO.jl#21

Open

support extensible custom runner #2

Closed

johnnychen94 closed this Feb 13, 2022

johnnychen94 deleted the jc/workflow branch February 13, 2022 12:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] workflow toml specification and initial implementation #1

[WIP] workflow toml specification and initial implementation #1

johnnychen94 commented Aug 5, 2021 •

edited

Loading

codecov bot commented Aug 5, 2021 •

edited

Loading

johnnychen94 Aug 5, 2021

johnnychen94 Aug 5, 2021

johnnychen94 Aug 5, 2021

johnnychen94 Aug 5, 2021

johnnychen94 Aug 5, 2021

johnnychen94 commented Feb 13, 2022

[WIP] workflow toml specification and initial implementation #1

[WIP] workflow toml specification and initial implementation #1

Conversation

johnnychen94 commented Aug 5, 2021 • edited Loading

Example: benchmark framework

codecov bot commented Aug 5, 2021 • edited Loading

Codecov Report

johnnychen94 Aug 5, 2021

Choose a reason for hiding this comment

johnnychen94 Aug 5, 2021

Choose a reason for hiding this comment

johnnychen94 Aug 5, 2021

Choose a reason for hiding this comment

johnnychen94 Aug 5, 2021

Choose a reason for hiding this comment

johnnychen94 Aug 5, 2021

Choose a reason for hiding this comment

johnnychen94 commented Feb 13, 2022

johnnychen94 commented Aug 5, 2021 •

edited

Loading

codecov bot commented Aug 5, 2021 •

edited

Loading