Syncing a fork #1

leeexyz · 2019-03-09T11:20:16Z

Thanks for contributing to TVM! Please refer to guideline https://docs.tvm.ai/contribute/ for useful information and tips. After the pull request is submitted, please request code reviews from Reviewers.

* TFLite frontend tutorial * Modify as suggestion

* [Relay][Frontend] Add CoreML Support * pip install six in CI * remove triggering nnvm coreml test * set opt_level=2 for nnvm coreml test case

) * nms fixed for gpu, tested on cuda and opencl devices, ssd now can run fully on the gpu * sort updated to use virtual thread * typo fixed * fix lint * fix lint * add support when batch_size > 1 * intel graphics conv2d bugs fixed for inception_v3 * intel conv2d api updated, nn input size 4 condition added * review addressed * move conv_tags to attributes * opencl ctx fixed * nms_ir index simplified

…odify TFLite whl files path (#2529)

* [QUANTIZE] Quantization implementation. * Update. * Update. * Update. * Update.

@FrozenGene

* [COMMUNITY] @FrozenGene -> Reviewer * Fix

* [Relay][Frontend] Add Caffe2 Support * [Relay][Frontend] Add Caffe2 Support (fix unsed import) * [Relay][Frontend] Add Caffe2 Support (fix caffe2 model import) * [Relay][Frontend] Add Caffe2 Support (fix model install and reflect code reviews) * [Relay][Frontend] Add Caffe2 Support (fix caffe2 model import) * [Relay][Frontend] Add Caffe2 Support (fix caffe2 model import) * [Relay][Frontend] Add Caffe2 Support (fix caffe2 model import) * [Relay][Frontend] Add Caffe2 Support (fix caffe2 frontend import) * [Relay][Frontend] Add Caffe2 Support (rename function name in test_forward) * [Relay][Frontend] Add Caffe2 Support (fix caffe2 model import) * [Relay][Frontend] Add Caffe2 Support (fix caffe2 model import) * [Doc] Caffe2 frontend tutorial * [Doc] Caffe2 frontend tutorial * [Doc] Caffe2 frontend tutorial * [Relay][Frontend] Add Caffe2 Support (remove unsed file)

* Enable reverse in reshape * Fix lint and typo * Put reverse_reshape into a separate op * Fix pylint

* [RELAY][FRONTEND] Tensorflow frontend support. * * LSTM removed for a while. * * basic ops are good. * * nn wip * * wip * * python2.7 corrections. * * NN ops are good. * * e2e models working good * * all good except LSTM * * rebase, tutorials and CI trigger. * * CI errors. * * enable opt_level=3 * * Docstrings cleanup. testing.tf utils moved to relay from nnvm. * * tutorials update. * * LSTM work good now. * * Rebase * * CI error * * enable PTB. * * rebase. * * tutorials * Update python/tvm/relay/frontend/tensorflow.py Co-Authored-By: srkreddy1238 <[email protected]> * * review comments. * CI fix. * * review comments.

* [TVM][BUGFIX] Fix reductions in split axes * A test case for the problem * Fix the fix: skip loops that are related to reduction AND are unrelated to axis

* [SCRIPT] Refactor grep for multiple patterns Tidy up the use of grep. Use -E rather than run multiple grep instances. * [SCRIPT] Refactor grep use in pipeline. Prefer to use stdin redirection rather than create a pipeline. * [SCRIPT] Refactor placement and cleanup of temporary files. Place temporary files in the conventional /tmp location. Avoid poisoning file name space by using $$. Ensure the temporary files get cleaned up, even when the script fails / exits early. * [SCRIPT] Improve robustness of task_lint.sh error handling. Ensure script failures are caught and propagated. Rather than trying to explicitly catch and propagate failures with explicit "|| exit" annotations, use the "set -e" idom from docker/install scripts and have the shell catch and propagate errors in the general case and special case the grep instances where non zero exit is permitted and should be ignored.

* [DOCKER] Make all install .sh scripts directly executable. * [DOCKER] Use curl -L consistently. Make the use of the curl -L option in docker build scripts consistent. * [DOCKER] Drop use of --force-yes The --force-yes option is generally not recommend, it can leave systems in an undefined state. The use of --allow-* options is preferred. In this particular case the --force-yes option appears to serve no purpose. Dropping it. * [DOCKER] Drop superflous repeated apt-get update. The "apt-get update && apt-get install" idiom is necessary and specific to Dockerfile. In shell the repeated apt-get update is superflous. Drop the duplicates. * [DOCKER] Robustness -e -u -o pipefail The install scripts used to construct docker environments do not, in general, propagate errors. Some of the scripts use adhoc && directives to chain together short sequences of commands but there are numerous failure modes which are silently ignored. This patch puts in place some consistent, basic, shell error trapping across all of the install scripts. Note this is a step forward towards more robust scripts but it is not a complete solution. * [DOCKER] Shallow clone. Use shallow clone to reduce bandwidth requirements of repeated docker (re)-builds. * [DOCKER] Use clone --branch rather than clone then checkout Use the git clone --branch idiom rather than git clone && git checkout. This paves the way for using --depth=1

…2693)

@ZihengJiang

* Defined a common base class for TensorComputeOp and ComputeOp * Made changes requested by @ZihengJiang * added a testcase to assert that `tensorize` does not have any effect on TensorComputeOp ops.

* Add batch_dot and cpu schedule * Add relay support for batch_dot * Rename batch_dot to batch_matmul * nits * Add missing file * Put batch_matmul and dense x86 schedule in separate files * Fix pylint * Remove unused import * Add cuda schedule for batch_matmul * Add test case with larger batch size * Add batch_matmul in api doc * Fix quantize pass rounding error * Fix pylint and minor change * bug fix

…2719)

* Add slice axis op in mxnet converter * Fix lint

* fix docments * delete e

… with control flow (#2496) add test remove dead code stash do it add more test

* use LLVM linker * error message improved in case of filenotfound * linting error fixed

* [RELAY][OP] Proposal * Fix * Fix test

…is negative (#2739) * bug fix * trigger ci

…ded stack op (#2729) * add stack op frontend * concate moved * topi stack added * stack added * fix stack bugs and tested * conv2d alterlayout udpated for relay * fix pylint * fix cmake warnings * cmake warnings fixed

…2744) * Fix bug in mxnet converter for slick_like * More tolerance for topi_conv2d_NCHWc

* Improve NNVM to Relay conversion * fix pylint * support __lshift_scalar__, abs, ceil, floor, and trunc to pass CI

…generating (apache#5962) * Code migration Start (#1) * Init commit: Code migration Start * Add loop_state.cc/h * Add ComputeDAG basic test * Split transform_step out & Update more UTs (apache#3) * Split transform_step out * Update GetProducers & GetConsumers * Update UTs * Add UT for CacheReadWrite & Some bug fix * Add search_task, measure and serialization (apache#4) * Add FollowSplit & FollowFusedSplit tests * Update dag.InferBound & its UT * Add search_task, measure and serialization * Update Serialization UT * Add MetaTileRewritePolicy (apache#5) * Add feature * Add cost_model, meta_tile_rewrite_policy * Add MetaTileRewritePolicy basic UT * Basic Python API for State (apache#6) * Add Basic Python API for State * Add UTs for State * Add Python API: Measure & Task (apache#7) * Update the return value of state operation * Add task * Copy measure.py & utils.py * Fix LocalBuilder * Fix LocalRunner * Add ansor.auto_schedule() API; First AutoSchedule working version(apache#8) * Add basic Python support for ansor.auto_schedule * Update AutoSchedule API * Bug fix for get the attach point of a fused iter * Update UT after infer bug fix * Bug fix & Add python serialization API (apache#10) * Delete C++ UT hack since Python is ready * Add ndarray.non_empty * Update Serialization python API * Improve code style, python wrapper and test cases (apache#11) * Update c++ code style and unit test * Update python State wrapper and test cases * fix unit tests * Add RPCRunner & OpenCL/CUDA test (apache#12) * Add RPCRunner & OpenCL search test * Add CUDA search test * Add RPCRunner test * rebase to upstream/master * Add Ansor basic tutorial (apache#13) * Add basic tutorial * migrate feature extraction (apache#14) * Add XGBModel & RPCRunnerWarpper (apache#15) * Add XGBModel & RPCRunnerWarpper * Revert "Add Parallel Granularity Mutation" * Migrate workload_registry.py (apache#16) * add workload registry * update * update * add task scheduler (apache#17) * Add conv2d cuda tutorial with workload registry (apache#18) * add tune_test.py (the old tune_wkl.py) (apache#19) * add tune_test.py (the old tune_wkl.py) * update * fix measure * fix for gpu * Code refine for tune_test.py & Add a pre load callback (apache#20) * Bug fix for tutorials * Add PreLoadMeasuredStates * Add search_callback support for task tuner * Code refine for tune_test.py * Update * Update * Update * Update * Bug fix * Add python custom sketch rule (apache#21) * Add custom sketch rule * Bug fix * Ansor Relay Integration (without layout rewrite) (apache#22) * relay integration * Add tune_op_subgraph.py & Some code clean for tune_network.py (apache#23) * Add single op tune scripts * Add tune subgraph support * Merge all op & all subgraph to one file * Rename file * add explicit_unroll_max_extent (apache#25) * Add Index simplification & API update (apache#26) * Add vectorized cooperative_fetching test * Update math simplify for vectorized CF * File rename * Update tune_network * API update * Update PreLoadMeasuredStates & Some bug fix (apache#27) * Add a threading wrapper to fix the test bug * Set default TVM_USE_AUTO_SCHEDULER to false * Update PreLoadMeasuredStates callback * Add tensorize step for loop_state (apache#31) * Add tensorize step * State python api update (apache#33) * Start to update api * Add compute_dag to state * API update * kernel layout rewrite (apache#28) * kernel layout rewrite * remove some hacks * add defuse_ops pass and move kernel_layout_rewrite pass after fuse_ops pass * set TVM_RELAY_DISABLE_BUILD_CACHE for task extraction and prepare_layout_rewrite * [cache flush] port cache flush to ansor (apache#32) * Improve relay integration (apache#34) * tmp checkpoint * Improve relay integration * Improve relay integration * Fix xgb error & Simplify dispatcher (apache#35) * Rename "MetaTileRewritePolicy" to "SketchPolicy". (apache#36) * Rename "MetaTileRewritePolicy" to "SketchPolicy". * Add a new class for auto_unroll_max_step, storage_offset in StageNode * fix tune_op_subgraph.py * rebase * Migrate all node::make to noderef's construct function (apache#37) * Start to move xxxnode::make to noderef() * Update * Update * Finish transform_step * Finish comute dag & auto schedule * Update * Update * Update * Update * Update * Code refine * Code refine * Code refine * Update * Update * Some lint fix & Recover the double constructor of tvm::PrimExpr (apache#39) * lint fix * clang-format-fix * pylint fix * Update * Recover the double constructor of tvm::PrimExpr * Fix pylint * pylint fix * pylint fix * Add MutateComputeLocation and MutateParallel in evolutionary search (apache#40) * Add MutateComputeLocation and MutateParallel in evolutionary search * fix lint * Improve loop state python API (stage_tensors -> stage_ops) (apache#41) * improve loop state python API (stage_tensors -> stage_ops) * fix * ComputeDAG bug fix & Add Custom TensorCore Matmul Example (apache#42) * Bug Fix * Sample example of Custom TensorCore Matmul * Rever Commits, Start to build minimum Ansor system * Code clean for minimum Ansor system * Bug fix & Delete AccessAnalyzer * Delete attachmap & Code clean * Doc update Update statenode::stages from vector to Array * Headfile update & Python doc update * clang-format fix * pylint fix * Update * Doc update * Update * Bug fix after code merge to the new master * clang-format fix * Update * Update * Update std::vector to Array; Update verbosity setting; Some commemts addressed * std::vector->Array & std::string->String * Add init_state to ComputeDAG * Update * Update some unordered_map to Map * clang-format fix * Comments addressed Delete ReplayAndInferBound Delete ReplaySteps & InferBoundCommon * Lint fix * Update * Update * Update * Update * Update * Update * Update * Update * Update * Rename ansor namespace to auto_schedule * Update * Rename ThreadPool to ParallelFor * Add parallel_for * Remove ThreadPool * Update python/tvm/auto_schedule/auto_schedule.py * trigger CI Co-authored-by: Lianmin Zheng <[email protected]> Co-authored-by: Minmin Sun (孙敏敏) <[email protected]> Co-authored-by: Zhao Wu <[email protected]>

leeexyz and others added 30 commits January 28, 2019 21:12

[DOC]Update doc in _api_internal.py and ir_pass.py (#2514)

174c92a

Optimize Linux shared library modules (*.so files) (#2445)

75f91c4

[Doc] TFLite frontend tutorial (#2508)

1120655

* TFLite frontend tutorial * Modify as suggestion

[BugFix] Copy intermediate result in debug runtime (#2520)

5a30a22

[Relay][Frontend] CoreML Support (#2476)

881a78b

* [Relay][Frontend] Add CoreML Support * pip install six in CI * remove triggering nnvm coreml test * set opt_level=2 for nnvm coreml test case

Remove an obsolete comment (#2527)

cde1f76

add min_repeat_ms to other CUDA tutorials (#2526)

ad8fc8c

[TFLite][Python 2] Solve TFLite frontend python 2 compatibility and M…

da972bd

…odify TFLite whl files path (#2529)

[OPT] Low-bit Quantization (#2116)

741b6bb

* [QUANTIZE] Quantization implementation. * Update. * Update. * Update. * Update.

Update rustc (#2524)

b6ae3e1

Add fallback for ApplyGraphBest (#2485)

b63182e

[RELAY] Copy subfunction arguments to output tuple field (#2537)

4d8ecb3

Fix typo in Evaluate inference time cost code (#2542)

3206719

[DOCS][RELAY] Sync up ops with code base (#2532)

b805c69

[COMMUNITY] @FrozenGene -> Reviewer (#2544)

7a18ed7

* [COMMUNITY] @FrozenGene -> Reviewer * Fix

[CI] Enable ANTLR in CPU env (#2548)

e012f81

[CI] Update rust format version (#2550)

18b2eba

[RUST][FRONTEND] Add rust frontend v0.1 (#2292)

e2970b2

[RELAY] TextPrinter: Use Map Format (#2553)

e0af5c2

print ast w/o metadata (#2533)

0f2e432

[VERSION] Move version script to the project root (#2556)

74b035a

Remove duplicate as Checks and CHECK value (#2531)

5b8ff8d

Misc refactor on graph runtime, layout node (#2557)

3a75b13

[Relay][OP] Add reverse_reshape (#2503)

40f7682

* Enable reverse in reshape * Fix lint and typo * Put reverse_reshape into a separate op * Fix pylint

[Python dep] Add missing dep pkg for relay (#2568)

7c12495

[Golang] bugfix #2517 (#2558)

389fbb5

[TVM][BUGFIX] Fix missing reduction init predicates (#2495)

f0ae8e3

* [TVM][BUGFIX] Fix reductions in split axes * A test case for the problem * Fix the fix: skip loops that are related to reduction AND are unrelated to axis

mshawcroft and others added 28 commits March 1, 2019 10:09

[Doc] Relay tutorial - Deploy the Pretrained Model on Raspberry Pi (#…

3e5a172

…2693)

Defined a common base class for TensorComputeOp and ComputeOp (#2587)

d546bb7

* Defined a common base class for TensorComputeOp and ComputeOp * Made changes requested by @ZihengJiang * added a testcase to assert that `tensorize` does not have any effect on TensorComputeOp ops.

[ARITH] Analyzer Infra, ConstIntBound, Modular (#2668)

1eb1dac

[EXPR] ir_operator.h->expr_operator.h Centralize const folder logic (#…

6d46060

…2719)

[RELAY][PASS] Common subexpression elimination (#2639)

1ca0393

[Tensorflow, NNVM, TOPI] Support for logical operators (#2453)

af69f87

[Relay][Frontend] Add a few mxnet ops in relay frontend (#2704)

c8373ec

[Relay][Frontend] Add slice axis op in mxnet converter (#2706)

2fa3a67

* Add slice axis op in mxnet converter * Fix lint

[DOCS] Fix tutorial (#2724)

25c50fc

* fix docments * delete e

[Relay] Higher order reverse mode automatic differentiation that work…

eae76b3

… with control flow (#2496) add test remove dead code stash do it add more test

Fix compilation on XCode 10 (#2731)

29e0000

[DOCKER] Pin pylint==1.9.4 (#2727)

115da9d

Docs: pip dependencies for testing (#2728)

fd9fa4b

[COMMUNITY] @sgrechanik-h -> Reviewer (#2732)

ada760a

use LLVM linker (#2713)

c8a3a59

* use LLVM linker * error message improved in case of filenotfound * linting error fixed

[RELAY][OP] Faster-RCNN Proposal OP (#2725)

fe06049

* [RELAY][OP] Proposal * Fix * Fix test

[Relay][Frontend][Bugfix] Fix bug in converting slice_axis when axis …

c2e60c6

…is negative (#2739) * bug fix * trigger ci

[VERSION] Update to 0.6.dev (#2736)

154e054

[RUNTIME][OPENCL] clFinish before releasing memory (#2737)

66cad62

[Bugfix][Relay][Frontend] Fix bug in mxnet converter for slick_like (#…

52d5cf8

…2744) * Fix bug in mxnet converter for slick_like * More tolerance for topi_conv2d_NCHWc

Improve NNVM to Relay conversion (#2734)

695647d

* Improve NNVM to Relay conversion * fix pylint * support __lshift_scalar__, abs, ceil, floor, and trunc to pass CI

[Relay] Add logical operators (#2743)

2239508

Fix vmlal.s16 code generation for int8 x int8 -> int32 (#2748)

a7e35fc

revert PR#2420 nms changes (#2747)

b0a0ae4

leeexyz merged commit 732d6aa into leeexyz:master Mar 9, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Syncing a fork #1

Syncing a fork #1

leeexyz commented Mar 9, 2019

Syncing a fork #1

Syncing a fork #1

Conversation

leeexyz commented Mar 9, 2019