Add sparse sum, mean and dot operator support #162

kalyc · 2018-08-28T23:43:03Z

Summary

Add sparse support for sum, mean and dot operators

Related Issues

Continuing - #159
Missing sparse operators

PR Overview

[y] This PR requires new unit tests [y/n] (make sure tests are included)
[n] This PR requires to update the documentation [y/n] (make sure the docs are up-to-date)
[y] This PR is backwards compatible [y/n]
[n] This PR changes the current API [y/n]

roywei

Thanks for the contribution, few comments in line, and one additional comment:

can we test this on an end to end model, for example the one in sparse benchmark scripts?

roywei · 2018-08-29T00:36:54Z

tests/keras/backend/backend_test.py

@@ -1702,6 +1702,107 @@ def test_sparse_concat(self):
            assert k_s_d.shape == k_d.shape
            assert_allclose(k_s_d, k_d, atol=1e-05)

+    @pytest.mark.skipif((K.backend() != 'mxnet'),
+                        reason='Testing only for MXNet backend')
+    def test_sparse_sum(self):


Why keep a duplicate test here? We can just use the separated file for testing as @sandeep-krishnamurthy suggested.

My bad - looks like I didn't commit the changed backend_test file. Will update. Have moved the sparse tests to mxnet_sparse_test file

roywei · 2018-08-29T00:38:04Z

keras/backend/mxnet_backend.py

@@ -576,6 +594,15 @@ def eval(x):
        return x


+def _forward_pass(x):


suggest to move this function together with other internal helper functions if plan to reuse it.

roywei · 2018-08-29T00:58:42Z

keras/backend/mxnet_backend.py

-    raise NotImplementedError('MXNet Backend: Sparse operations are not supported yet.')
+    if hasattr(tensor, 'tocoo'):
+        return tensor.toarray()
+    elif isinstance(tensor, mx.sym.Symbol):


Why do we need to check for mxnet symbol? mxnet symbol only exists in mxnet_backend file. From a keras user perspective, he/she will not use a mxnet symbol during model construction/training

I checked for all possible tensor data structures that Keras-MXNet supports in theis_tensor method

kalyc · 2018-08-29T20:39:00Z

Addressed comments. Testing on benchmark model not yet done.

roywei

LGTM, looking forward to see a sparse example.

sandeep-krishnamurthy

Nice work!.
Few suggested changes. Please fix before merging.

sandeep-krishnamurthy · 2018-08-31T00:25:37Z

keras/backend/mxnet_backend.py

+        if isinstance(_forward_pass(tensor)[0], mx.ndarray.sparse.CSRNDArray) or \
+                isinstance(_forward_pass(tensor)[0], mx.ndarray.sparse.RowSparseNDArray):
+                return True
+    elif hasattr(tensor, 'tocoo'):


nit: but a very minor perf improvement if you swap these if elif . hasattr(..) faster than isinstance, forward_pass etc.

sandeep-krishnamurthy · 2018-08-31T00:27:12Z

keras/backend/mxnet_backend.py

+        sym._keras_shape = tuple([d if d != 0 else None for d in shape])
+        sym._mxnet_placeholder = True
+        sym._uses_learning_phase = False
+        print(sym)


I think you missed to remove this print.

sandeep-krishnamurthy · 2018-08-31T00:31:11Z

tests/keras/backend/mxnet_sparse_test.py

+
+class TestMXNetSparse(object):
+
+    @pytest.mark.skipif((K.backend() != 'mxnet'),


You can skip all tests in a file this way - https://github.com/awslabs/keras-apache-mxnet/blob/master/tests/keras/layers/wrappers_test.py#L15
and avoid skipif for each test.

lupesko · 2018-09-04T05:47:52Z

tests/keras/backend/mxnet_sparse_test.py

+        x_r = np.array([0, 2, 2, 3], dtype=np.int64)
+        x_c = np.array([4, 3, 2, 3], dtype=np.int64)
+
+        x_sparse_matrix = sparse.csr_matrix((x_d, (x_r, x_c)), shape=(4, 5))


Lines 25 to 29 repeat across this file 8 times or so.
Doesn't it make sense to extract a common generate_test_sparse_tensor function instead or repeating the same code block?

kalyc · 2018-09-05T00:09:11Z

Addressed refactoring comments

sandeep-krishnamurthy · 2018-09-05T00:20:29Z

Thanks. LGTM. Going ahead with merging these operators. Please follow up with an end to end example using sparse tensor (may be think of having this end to end example work for your benchmarking as well)

kalyc · 2018-09-05T00:28:17Z

Yes will do - we will need to update the existing benchmark script to use these operators as well.

kalyc added 6 commits August 27, 2018 14:51

Add sparse sum and mean operator

690c97f

Use mx.sym.sum and mx.sym.mean operators API

3ff0da1

Add sparse data handling and more tests for sparse helper functions

bb51b88

Fix PEP-8 style check

e262050

Move MXNet sparse tests to another file

59500ef

Add default stype for _keras_variable and fix PEP style

1d5fe6f

kalyc requested review from sandeep-krishnamurthy and roywei August 28, 2018 23:43

kalyc mentioned this pull request Aug 28, 2018

Add sparse sum and mean operator #159

Closed

roywei reviewed Aug 29, 2018

View reviewed changes

Remove MXNet specific sparse tests from backend_test file

6313335

roywei approved these changes Aug 30, 2018

View reviewed changes

sandeep-krishnamurthy approved these changes Aug 31, 2018

View reviewed changes

lupesko reviewed Sep 4, 2018

View reviewed changes

Address refactoring comments

7e402f0

sandeep-krishnamurthy merged commit d961bf3 into awslabs:dev Sep 5, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add sparse sum, mean and dot operator support #162

Add sparse sum, mean and dot operator support #162

kalyc commented Aug 28, 2018

roywei left a comment

roywei Aug 29, 2018

kalyc Aug 29, 2018

roywei Aug 29, 2018

kalyc Aug 29, 2018

roywei Aug 29, 2018

kalyc Aug 29, 2018

kalyc commented Aug 29, 2018

roywei left a comment

sandeep-krishnamurthy left a comment

sandeep-krishnamurthy Aug 31, 2018

kalyc Sep 4, 2018

sandeep-krishnamurthy Aug 31, 2018

kalyc Sep 4, 2018

sandeep-krishnamurthy Aug 31, 2018

kalyc Sep 4, 2018

lupesko Sep 4, 2018

kalyc Sep 4, 2018

kalyc commented Sep 5, 2018

sandeep-krishnamurthy commented Sep 5, 2018

kalyc commented Sep 5, 2018

		@@ -576,6 +594,15 @@ def eval(x):
		return x


		def _forward_pass(x):


		class TestMXNetSparse(object):

		@pytest.mark.skipif((K.backend() != 'mxnet'),

Add sparse sum, mean and dot operator support #162

Add sparse sum, mean and dot operator support #162

Conversation

kalyc commented Aug 28, 2018

Summary

Related Issues

PR Overview

roywei left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kalyc commented Aug 29, 2018

roywei left a comment

Choose a reason for hiding this comment

sandeep-krishnamurthy left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kalyc commented Sep 5, 2018

sandeep-krishnamurthy commented Sep 5, 2018

kalyc commented Sep 5, 2018