[Glow Runtime] Top Level Task #2045

qcolombet · 2018-11-17T00:10:15Z

This is the top level issue to track all the work we plan to do to make the glow runtime supports concurrent execution, pipelining, batching and so on.

At a high level, the idea for the runtime is to be able to:

Enqueue inputs: Run input0, then run input1 as soon as the previous run is done, etc.
Slice the inputs into batch size and transparently run them: Take N input and sequentially run them in batches of M (where M is the size of the compiled model and N the actual run size.)
Pipeline work across models: Run input1 on model M1, then run the result of M1 on M2 while running input2 on M1, etc.

Among other things, the glow runtime will have to:

Manage input/output queues for each model (and communication with the devices)
Manage incoming model
Keep track of data dependencies and schedule next tasks to be done
Split inputs
Pad inputs
Dispatch workload on device
Keep track of the status of devices
Also, somewhat orthogonal to the runtime, but related, glow will need to:
Determine what and where to run things (graph partitioning)

Right now, we started by splitting the compilation and runtime stages properly.
This work is tracked in:
#2040, #1967, #1953, #1951

gcatron · 2018-12-05T23:11:43Z

Adding #2125 to the list. Work is being done on the HostManager. This is part of the new runtime design. The design has five major components: HostManager, Partitioner, Provisioner, DeviceManager, and Executor.

Partitioner:
This component is responsible for breaking up the provided network into subnetworks that can be run on multiple devices. It does its partitioning based on hardware constraints and heuristics to optimize execution time. It outputs a DAG to be used by the other components.

DeviceManager:
The DeviceManager handles interactions with the device. The manager handles initializing the device, copying constants to the device and preparing the device for execution. It also handles unloading networks from the device.

Provisioner:
The Provisioner takes the output from the partitioner and assigns sub networks to specific devices updating the DAG with device assignments. The Provisioner handles the code generation part of compilation and calls into the device manager to load the subnetworks to the device.

Executor:
The Executor handles the execution of the network. It walks the DAG calling execution of each sub network in accordance with their dependencies.

HostManager:
The HostManager is the container for the other components. It serves as the interface externally, handling network init and run requests. The HostManager routes a request through the other components and holds the DAGs for each network.

gcatron mentioned this issue Dec 5, 2018

[WIP] Add host manager #2125

Closed

This was referenced Dec 6, 2018

[glow/runtime] First cut DeviceManager with CPU backend #2134

Closed

[WIP][DO NOT ACCEPT][Runtime] Add runtime Executor class #2133

Closed

This was referenced Dec 11, 2018

[WIP] Add Provisioner #2159

Closed

[WIP] Adding RuntimeTypes.h #2161

Closed

nickgg mentioned this issue Jan 9, 2019

DeviceManager interface and CPUDeviceManager #2249

Merged

gcatron mentioned this issue Jan 9, 2019

Separate Code Generation and Constant Collection during Compile #2254

Closed

This was referenced Jan 17, 2019

Add provisioner for new Runtime #2276

Closed

Add Host Manager #2284

Merged

gcatron mentioned this issue Jan 24, 2019

Added Provisioner and unittest #2293

Merged

This was referenced Jan 24, 2019

[Partitioner] First Graph Partitioning #2268

Merged

Graph Partitioning #2298

Open

gcatron mentioned this issue Mar 13, 2019

Added new compileFunctions method #2531

Merged

gcatron mentioned this issue Mar 22, 2019

Develop Benchmark for Glow Runtime #2579

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Glow Runtime] Top Level Task #2045

[Glow Runtime] Top Level Task #2045

qcolombet commented Nov 17, 2018

gcatron commented Dec 5, 2018 •

edited

Loading

[Glow Runtime] Top Level Task #2045

[Glow Runtime] Top Level Task #2045

Comments

qcolombet commented Nov 17, 2018

gcatron commented Dec 5, 2018 • edited Loading

gcatron commented Dec 5, 2018 •

edited

Loading