Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[TOPI] Make cumsum IR reusable, add thrust scan (apache#7303)
* import changes from scan branch commit cf0d4fd Author: Masahiro Masuda <[email protected]> Date: Fri Dec 25 10:12:01 2020 +0900 get valid count test working commit eb142d3 Author: Masahiro Masuda <[email protected]> Date: Fri Dec 25 07:22:00 2020 +0900 integrate new cumsum change commit f89684d Author: Masahiro Masuda <[email protected]> Date: Fri Dec 25 06:56:46 2020 +0900 remove ceil_div from nms commit a2ad4de Author: Masahiro Masuda <[email protected]> Date: Sun Dec 20 20:36:34 2020 +0900 add api for returning reduction from ex scan output commit b7f4ef7 Author: Masahiro Masuda <[email protected]> Date: Sun Dec 20 19:49:07 2020 +0900 move ceil_div to utils commit a9a57e3 Author: Masahiro Masuda <[email protected]> Date: Sun Dec 20 19:38:15 2020 +0900 rename prefix_scan.py to scan.py commit 03ed43f Author: Masahiro Masuda <[email protected]> Date: Sat Dec 19 06:12:55 2020 +0900 surpress cpplint commit abceac9 Author: masa <[email protected]> Date: Fri Dec 18 20:36:24 2020 +0900 support more data type commit 3e7d1f8 Author: masa <[email protected]> Date: Fri Dec 18 20:09:51 2020 +0900 1d thrust scan working commit ac13b40 Author: masa <[email protected]> Date: Fri Dec 18 19:49:25 2020 +0900 adding thrust scan support commit 65634e8 Author: masa <[email protected]> Date: Fri Dec 18 19:01:11 2020 +0900 add thrust scan python stub commit 9876c90 Author: masa <[email protected]> Date: Fri Dec 18 20:55:14 2020 +0900 introduce prefix_scan.py and move scan ir in nms.py commit 667bdd3 Author: masa <[email protected]> Date: Fri Dec 18 15:06:18 2020 +0900 make the scan loop exclusive commit 480787b Author: mbrookhart <[email protected]> Date: Thu Dec 17 10:01:11 2020 -0700 Parallelize cumsum in get_valid_counts * fix for 1d scan * rename * cast to out dtype * do not run return reduction for inclusive scan * remove another ceil_div definition * adding scan test * add scheduling for scan op, fixed scan 1d test * pylint fix * add doc string * add more thrust scan test * add dynamic get valid count test, including empty size tensor * fix hard coded gpu targets for cpu only env * try retunring early if scan_size is 0 * another change for empty tensor and thrust path Co-authored-by: masa <[email protected]>
- Loading branch information