Copyfromto #30

eric-haibin-lin · 2017-05-07T06:08:52Z

Cast storage during copy from to if the storage type doesn't match. Now we have a temporary way to test dot_backward.
Also added FResource for dot_backward. @reminisce

eric-haibin-lin · 2017-05-10T22:04:02Z

@reminisce I changed the csr's python api from (row_idx, indptr) to (indptr, row_idx) so that it's consistent with backend order to avoid confusion.

sparse embedding unit test pass Conflicts: src/operator/tensor/elemwise_unary_op.h tests/cpp/ndarray_test.cc tests/python/unittest/test_sparse_ndarray.py tests/python/unittest/test_sparse_operator.py

reminisce · 2017-05-13T19:56:26Z

include/mxnet/ndarray.h

+    auto stype = storage_type();
+    CHECK_NE(stype, kDefaultStorage);
+    if (stype == kRowSparseStorage || stype == kCSRStorage) {
+      return aux_shape(0).Size() == 0;


For a csr, is it more correct to use indptr.Size() = num_rows+1 and indptr[num_rows]=0 to determine whether it's a zero csr? aux_shape(0).Size()=0 for csr are more like this csr has not been initialized, instead of this csr is a zero matrix.

Yes that's why I commented that it's just a hint for whether the ndarray is initialized thus containing zeros. I assume this function will be called quite often implementing various sparse operators. indptr[num_rows] is more accurate but that involves reading memory from GPU, which is more expensive.
Usually csr is some user input which is non-zero. So this check works for most of the cases. Maybe I should change the change to storage_is_initialized to avoid confusion?

Agree that changing storage_is_initialized makes more sense.

reminisce · 2017-05-13T19:58:58Z

include/mxnet/ndarray.h

-          NDArrayStorageType storage_type_)
+    // Constructor for a non-default storage chunk
+    Chunk(NDArrayStorageType storage_type_, const TShape &storage_shape_, Context ctx_,
+          bool delay_alloc_, int dtype, std::vector<int> aux_types_,


std::vector aux_types_ --> const std::vector& aux_types_

…ted functions. (#30)

* Test input a graph. * Update foreach to execute the subgraph. * print inputs/outputs in foreach. * Remove print. * add test code for foreach. * exec foreach outside the engine. * Implements forward of foreach. * Add support for variable numbers of inputs and outputs. * Add a python wrapper for foreach. * Fix the order of inputs. * add test with lstm. * hide C version of foreach. * fix a bug temporarily. * Test free variables. * change for the new interface of InputGraph attribute. * Add attribute to the subgraph. * Handle free variables. * Get all input symbols of a subgraph. * Fix shape, dtype and storage inference. * reorganize the output of foreach. * Add a gluon RNN unroll with symbol foreach. * print unnecessary print. * have imperative and symbolic foreach. * Fix an error after moving foreach. * Fix imperative foreach * Fix a minor problem. * Use CachedOp to execute subgraph. * update TODO. * make foreach op use FStatefulComputeEx. TODO we need to change stateful executor to handle subgraph. * Add backward. * Fix bugs. * enable backward test in lstm. * Fix a bug in foreach backward for free variables. * change for the new CachedOp. * Detect the backward computation. * Fix bugs in foreach. * fix tests. * update tests. * check state shape. * enable nested foreach. * remove print. * fix a bug in test. * handle infer storage type for backward. * address comments. * address comments. * move some common functions out. * address comments. * fix lint. * Fix lint. * add doc. * undo modification in imperative.h * add doc and remove example code. * fix lint. * fix lint. * Fix lint. * make nd.foreach and sym.foreach consistent. * fix compile error. * address comments. * update. * check for loop only works for dense arrays. * move control flow op out of nn/ * fix include. * add a test in gluon. * work for GPU. * small fix. * remove subgraph_name * create loop state for reuse in the future. * move code. * Revert "remove subgraph_name" This reverts commit 977f562. * cut graph. * rename new var nodes. * Fix tests. * Fix bugs caused by ctypes (#29) * Add save/load json in testcases for foreach (#30) * support subgraph in stateful executor. * Fix compilation. * fix a bug when a subgraph has variable nodes. * Fix a bug of getting symbols. * copy var nodes. * Fix getting op states. * fix lint error. * address comments. * fix lint error. * simplify the execution of subgraph in the main thread. * fix lint error. * avoid waiting for computation in each iteration. * reuse cached op for inference. * share memory across mini-batches. * reuse memory. reuse memory between iterations in inference. reuse memory between mini-batches in training. * add tests for multiple batches. * remove entry. * add benchmark for foreach. * benchmark large batch size. * Fix the benchmark for GPU. * address comments. * update shape/dtype/storage inference. * update contrib API docs. * support nested foreach. * use a single CachedOp for all iterations. * use large dim. * update benchmark. * update benchmark. * update benchmark. * update benchmark. * return symbol arrays correctly in MXSymbolCutSubgraph. * return symbol arrays in MXSymbolGetInputSymbols. * fix lint error. * use cachedop to infer storage in backward. * fix scala API. * update comments. * fix scala. * fix test. * fix attribute name. * move benchmark. * fix the mapping of operator inputs/outputs and subgraph inputs/outputs. * add tests for dtype/shape inference. * reorganize tests. * fix a bug of cutting NodeEntry. When two node entries refer to the same output of a node, we should create only one var node for these two node entries. * fix lint error. * handle the case that outputs are inputs. * handle the case that inputs aren't used. * handle the case without output data. * fix a bug in foreach backward. * fix a bug when there isn't output data. * Fix lint error. * test diff Gluon RNN cells. * test all symbol RNN cells. * adjust the test precision. * Fix a bug in getting a list of variable names. We can't get a list of variable names from a hashtable. The order can't be guaranteed. Python2 and Python3 output different orders. * fix lint error. * Test 1D array. * fix a bug when subgraph inputs and outputs share NDArray. * fix. * fix * add comments.

eric-haibin-lin added 16 commits May 5, 2017 00:34

Refactor copyfromto

db616c7

reuse data/aux data memory for sparse ndarray when possible

47e8f1a

randomized some python & cpp tests

e52f058

bug fix for identity_attr_like_rhs

6b2a13a

bug fix for _backward_dot

0428a38

Enable auto-fallback for ForwardOp

1574219

attach op fallback bug fix

e229d1e

Fix lint

db1cf95

Compiles on GPU

03bf9d2

use kernel for rsp->dns. TODO: use memcopy

7d67b78

Add wrapper for aux data

94e8e2c

rewrite python sparsend creation interface

64b4c65

add doc for csr rsp constructor

12cbe6a

fix lint

886553d

more docs

586fbea

pass lint

67b66b6

eric-haibin-lin added 5 commits May 11, 2017 00:51

sparse embedding draft

c32272e

sparse embedding unit test pass Conflicts: src/operator/tensor/elemwise_unary_op.h tests/cpp/ndarray_test.cc tests/python/unittest/test_sparse_ndarray.py tests/python/unittest/test_sparse_operator.py

bug fix for NDArray init in graph executor

c6dc026

Add zeros() for csr. Handle zeros input in some csr dot operators

77eef59

_slice for csr

1601d6a

fix lint

a9b04f6

reminisce reviewed May 13, 2017

View reviewed changes

change is_zeros_hint to storage_initialized

c4172ed

eric-haibin-lin merged commit 1103a9a into master May 14, 2017

eric-haibin-lin deleted the copyfromto branch June 15, 2017 20:35

eric-haibin-lin pushed a commit that referenced this pull request Apr 4, 2018

[NODE] Move op inside node attribute (#30)

36bab5c

eric-haibin-lin pushed a commit that referenced this pull request Apr 4, 2018

[COMPILER] GraphHash based cache system, allow dump and query duplica…

4585a07

…ted functions. (#30)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Copyfromto #30

Copyfromto #30

eric-haibin-lin commented May 7, 2017

eric-haibin-lin commented May 10, 2017

reminisce May 13, 2017

eric-haibin-lin May 13, 2017

reminisce May 13, 2017

reminisce May 13, 2017

eric-haibin-lin May 13, 2017

Copyfromto #30

Copyfromto #30

Conversation

eric-haibin-lin commented May 7, 2017

eric-haibin-lin commented May 10, 2017

reminisce May 13, 2017

Choose a reason for hiding this comment

eric-haibin-lin May 13, 2017

Choose a reason for hiding this comment

reminisce May 13, 2017

Choose a reason for hiding this comment

reminisce May 13, 2017

Choose a reason for hiding this comment

eric-haibin-lin May 13, 2017

Choose a reason for hiding this comment