Merge windowsai (winml layering) into master #2956

smk2007 · 2020-01-31T01:17:34Z

This PR is for comment on bringing the newly layered Windows ML components into the master branch for the ONNX runtime for windows.

This is anticipation of a beta release of these bits over the next couple of months. We will include more documentation on how to use , how the layers work, and the relationships of WinML and ORT and DML (which we introduced to master in ORT v1.0) .

Some of the things we have done in this PR:
- Added a top level directory "/winml"
- Contributed all of the windows inbox code from the Windows.AI.MachineLearning namespace into that directory . Making it available using the MIT license.
- Started a layering effort to have the new Windows.AI.MachineLearning.dll consume the onnxruntime.dll c_abi that we introduced in v1.0 .
- Added an "adapter" module that gets linked into the core onnxruntime.dll. This adapter is private to the WinML component, and provides ABI functionality required for the layering effort. this is not a new public ABI and is not support for developer to call.
- Made the WinML ABI fully available for public developers to call.
- You can now include both of these DLL's (WinML + ORT) in your projects that want to use the WinML ABI as a redist component.
- Added cmakery for all of this work. There is now a "use_winml" build flag in addition to the "use_dml" build flag
- Added google test unit tests for the newly added WinML ABI. these are under the top level "winml/test" folder.
- And several other things :)

Enjoy !

…g against kernel32.lib etc (#2346) add onecoreuap_apiset.lib in order to avoid linking against kernel32.lib etc and violating our OS layering requirements. We linked against onecoreuap_apiset.lib in VB so we will continue doing this, but I am still unsure why not to link against onecore instead since that is where we ship. However, since Sheil is the owner of this code we will wait to discuss with him before changing anything.

* update build instructions to include --build_shared_lib * fix line breaks

* Task 23998197: add winml_lib_core into onnnxruntime.dll * PR feedback build break on perf_test

#2382) this is a big PR. we are going to move it up to layer_dev , which is still a L3 so we are still safe to do work there agile. we are going to move this into the L3 so that ryan can start doing intergration testing. we will pause for a full code review and integration test result prior to going into the L2. >>>> raw comments from previous commits >>> * LearningModelSession is cleaned up to use the adapter, and parts of binding are. * moved everything in the winmladapter made it all nano-com using, WRL to construct objects in the ORT side. base interfaces for everythign for winml to call cleaned up a bunch of winml to use the base interfaces. * more pieces * GetData across the abi. * renamed some namepsace cleaned up OrtValue cleaned up Tensor cleaned up custom ops. everything *but* learnignmodel should be clean * make sure it's building. winml.dll is still a monolith.

…into layer_dev

everything builds clean. step !

* model moved over. everything builds clean. step ! * weak ref comment

…r creating winml objects. fixes model load.

* model moved over. everything builds clean. step ! * weak ref comment * added a wrapper for RoGetActivationFactory to hook back into winml for creating winml objects. fixes model load.

* add option to enable winml telemetry * add option to enable winml telemetry * clean logs while developping * clean the log of GUID * compile onnxruntime_common with winml telemetry * use option for use_telemetry * rename option winml_use_telemetry to onnxruntime_use_telemetry * little change

fixed the debug build. squeezenet passes using winmlrunner for CPU and GPU

* model moved over. everything builds clean. step ! * weak ref comment * added a wrapper for RoGetActivationFactory to hook back into winml for creating winml objects. fixes model load. * fixed some lifetime management. fixed the debug build. squeezenet passes using winmlrunner for CPU and GPU

* model moved over. everything builds clean. step ! * weak ref comment * added a wrapper for RoGetActivationFactory to hook back into winml for creating winml objects. fixes model load. * fixed some lifetime management. fixed the debug build. squeezenet passes using winmlrunner for CPU and GPU * PR feedback.

* model moved over. everything builds clean. step ! * weak ref comment * added a wrapper for RoGetActivationFactory to hook back into winml for creating winml objects. fixes model load. * fixed some lifetime management. fixed the debug build. squeezenet passes using winmlrunner for CPU and GPU * PR feedback. * couple of fixes and coded getmutabledata()

* model moved over. everything builds clean. step ! * weak ref comment * added a wrapper for RoGetActivationFactory to hook back into winml for creating winml objects. fixes model load. * fixed some lifetime management. fixed the debug build. squeezenet passes using winmlrunner for CPU and GPU * PR feedback. * couple of fixes and coded getmutabledata() * fixed 2 more heap corruptions

* Add opset and IR check. * Add test case for future opsets. #2371

found a leak in nvidia driver, but skipped it. all winmlapitests pass now

onnxruntime/core/platform/windows/debug_alloc.cc

onnxruntime/core/session/ort_apis.h

tools/ci_build/github/azure-pipelines/nuget/templates/cpu.yml

tools/ci_build/github/azure-pipelines/win-gpu-ci-pipeline.yml

winml/dll/module.cpp

snnn · 2020-01-31T06:59:08Z

Overall it LGTM. There are some tiny things that must get fixed.
You may exclude "mask_rcnn_keras" from the test cases to make your build pass. I'll take care it later.

pranavsharma · 2020-01-31T07:35:01Z

BUILD.md

@@ -84,6 +84,7 @@ For other system requirements and other dependencies, please see [this section](
 |**Build Shared Library**|--build_shared_lib||
 |**Build Python wheel**|--build_wheel||
 |**Build C# and C packages**|--build_csharp||
+|**Build WindowsML**|--use_winml<br>--use_dml<br>--build_shared_lib|WindowsML depends on DirectML and the OnnxRuntime shared library.|


Can we avoid users from typing --build_shared_lib and just infer it when --use_winml is specified? It's just one less thing to remember for users.

cmake/precompiled_header.cmake

cmake/wil.cmake

cmake/winml_cppwinrt.cmake

cmake/winml_sdk_helpers.cmake

docs/Privacy.md

include/onnxruntime/core/common/exceptions.h

onnxruntime/core/graph/schema_registry.cc

onnxruntime/core/platform/telemetry.h

onnxruntime/core/platform/windows/debug_alloc.cc

onnxruntime/core/session/inference_session.h

pranavsharma · 2020-01-31T08:02:17Z

I haven't checked all the files under the winml/ folder. Please make sure licence headers and copyright notices are added in all these new files.

winml/adapter/winml_adapter_c_api.h

winml/dll/winml.rc

* CR feedback * fix weird formatting on privacy readme * Add 'All rights reserved.' everywhere * readd all rights reserved to winml_provider_factory.h * remove extra space in comment * remove extra whitespace

cmake/CMakeLists.txt

tools/ci_build/github/azure-pipelines/nuget/templates/gpu.yml

Merge up to commit 4f4f4bc There were several very large pull requests in public master: #2956 #2958 #2961 **BERT-Large, FP16, seq=128:** Batch = 66 Throughput = 189.049 ex/sec **BERT-Large, FP16, seq=512:** Batch = 10 Throughput = 36.6335 ex/sec **BERT-Large, FP32, seq=128:** Batch = 33 Throughput = 42.2642 ex/sec **BERT-Large, FP32, seq=512:** Batch = 5 Throughput = 9.32792 ex/sec **BERT-Large LAMB convergence:** ![image.png](https://aiinfra.visualstudio.com/530acbc4-21bc-487d-8cd8-348ff451d2ff/_apis/git/repositories/adc1028e-6f04-44b7-a3cf-cb157be4fb65/pullRequests/5567/attachments/image.png) `$ python watch_experiment.py --subscription='4aaa645c-5ae2-4ae9-a17a-84b9023bc56a' --resource_group='onnxtraining' --workspace='onnxtraining' --remote_dir='logs/tensorboard/' --local_dir='D:/tensorboard/bert-large/fp16/lamb/seq128/lr3e-3/wr0.2843/master/' --run='BERT-ONNX_1581120364_71872cef'` **E2E**: PASSED https://aiinfra.visualstudio.com/Lotus/_build/results?buildId=117300&view=results

Adrian Tsai and others added 30 commits November 7, 2019 11:51

Initial Commit

7390b64

Initial changes for layering

444bfcc

more snipping to get core into ort

b6f5eef

update build instructions to include --build_shared_lib (#2358)

a3a6a97

* update build instructions to include --build_shared_lib * fix line breaks

Task 23998197: add winml_lib_core into onnnxruntime.dll (#2368)

5406801

* Task 23998197: add winml_lib_core into onnnxruntime.dll * PR feedback build break on perf_test

return proper error when the model path isn't found (#2391)

895342b

Merge remote-tracking branch 'origin/windowsai' into layer_dev

8b37bd0

Merge branch 'layer_dev' of https://github.com/microsoft/onnxruntime …

00cee34

…into layer_dev

model moved over.

f07fdf9

everything builds clean. step !

weak ref comment

f32bbd5

Layer dev paulm (#2408)

2bfa3c6

* model moved over. everything builds clean. step ! * weak ref comment

Merge remote-tracking branch 'origin/layer_dev' into layer_dev_paulm

8f95b77

added a wrapper for RoGetActivationFactory to hook back into winml fo…

7f9a7f5

…r creating winml objects. fixes model load.

Layer dev paulm (#2414)

5bd2c1e

* model moved over. everything builds clean. step ! * weak ref comment * added a wrapper for RoGetActivationFactory to hook back into winml for creating winml objects. fixes model load.

fixed some lifetime management.

b4047a0

fixed the debug build. squeezenet passes using winmlrunner for CPU and GPU

Merge remote-tracking branch 'origin/layer_dev' into layer_dev_paulm

54c785f

PR feedback.

a3542e1

couple of fixes and coded getmutabledata()

acc6ea5

merge

9253e23

Merge remote-tracking branch 'origin/layer_dev' into layer_dev_paulm

72b3a91

fixed 2 more heap corruptions

3da841b

Add opset and IR check when loading model (#2413)

6203a03

* Add opset and IR check. * Add test case for future opsets. #2371

fixed map and sequence when passing stl types across the ABI .

d8941f1

found a leak in nvidia driver, but skipped it. all winmlapitests pass now