Benchmark iterator, avoid redundant queue, remove managers. #3119

brendan-ai2 · 2019-08-07T01:18:09Z

Adds a script to benchmark iterators.
- Average speed
- Introspects queues
Removes a bottleneck when MultiprocessDatasetReader and MultiprocessIterator are used in conjunction.
- Specifically, removes a redundant queue that was populated by a single process.
Removes managers which have significant overhead.
Results on benchmarking the iterator from training_config/bidirectional_language_model.jsonnet:
- original code, no multiprocessing: 0.047 s/b over 10000 batches
- original code, workers = 1: 0.073 s/b over 10000 batches
- original code, workers = 10: 0.078 s/b over 10000 batches
- this PR (-queue), workers = 1: 0.073 s/b over 10000 batches
- this PR (-queue), workers = 10: 0.046 s/b over 10000 batches
- this PR (-queue, - manager), workers = 1: 0.063 s/b over 10000 batches
- this PR (-queue, - manager), workers = 10: 0.020 s/b over 10000 batches
- Notably, previously we did not see any benefit from scaling to multiple workers. Now we do, albeit worse than linearly. More work required there.
- To be clear, this is testing the reader + iterator in isolation. Not training.
Related issues: How can I make batch instances(converting input text fields to tensor objects) faster or efficiently? #2962, Unable to see the gains by using MultiProcessDatasetReader #1890

brendan-ai2 · 2019-08-17T00:43:43Z

Thanks, feedback addressed!

joelgrus

looks good, modulo a comment on a non-obvious test

joelgrus · 2019-08-20T21:32:55Z

allennlp/tests/data/dataset_readers/multiprocess_dataset_reader_test.py

+        # Half of 100 files * 4 sentences / file
+        i = 0
+        for instance in reader.read(self.identical_files_glob):
+            if i == 200:


I would probably just add a comment here, along the lines of "i == 200 means this is the 201st instance, which is too many, so break out and fail the test"

Good call, done

…lp into speedups-single-queue

brendan-ai2 · 2019-08-23T21:57:13Z

Thanks for the review! (I'm dragging my heels on merging due to some test discrepancies between TC and my local machine. I'll ping for another pass if the fix is anything major.)

brendan-ai2 · 2019-08-27T01:33:27Z

allennlp/data/iterators/multiprocess_iterator.py

@@ -33,13 +36,51 @@ def instances() -> Iterator[Instance]:

    output_queue.put(index)

+    # We need to ensure we've gotten all the tensors out of this queue before


@joelgrus, if you're interested, this comment describes the fix for the issue I alluded to earlier. Given all these edge cases I see why you preferred using a Manager in the first version of this code!

brendan-ai2 · 2019-08-29T02:26:18Z

allennlp/data/dataset_readers/multiprocess_dataset_reader.py

+        https://eli.thegreenplace.net/2009/06/12/safely-using-destructors-in-python/.
+        """
+        for process in self.processes:
+            process.terminate()


@joelgrus, last bug should be fixed now. It was a nasty race in the tests due to the stray child processes. I ended up using __del__ here for that. I know that using with is preferred in Python, but I'm not sure we have that as an option given that we're attempting to fit the Iterable interface. We aren't holding any circular refs here, so this should be safe, IIUC. Certainly it fixes the problem empirically. Any concerns?

if it works, I have no concerns 😇

(I am somewhat far from this code at this point, and I'm not really a multiprocessing expert)

brendan-ai2 · 2019-08-29T19:40:28Z

Thanks for the review!

…3119) - Adds a script to benchmark iterators. - Average speed - Introspects queues - Removes a bottleneck when `MultiprocessDatasetReader` and `MultiprocessIterator` are used in conjunction. - Specifically, removes a redundant queue that was populated by a single process. - Removes managers which have significant overhead. - Results on training_config/bidirectional_language_model.jsonnet: - original code, no multiprocessing: 0.047 s/b over 10000 batches - original code, workers = 1: 0.073 s/b over 10000 batches - original code, workers = 10: 0.078 s/b over 10000 batches - this PR (-queue), workers = 1: 0.073 s/b over 10000 batches - this PR (-queue), workers = 10: 0.046 s/b over 10000 batches - this PR (-queue, - manager), workers = 1: 0.063 s/b over 10000 batches - this PR (-queue, - manager), workers = 10: 0.020 s/b over 10000 batches - Notably, previously we did not see any benefit from scaling to multiple workers. Now we do, albeit worse than linearly. More work required there. - Related issues: allenai#2962, allenai#1890

brendan-ai2 added 30 commits June 5, 2019 18:41

scripts

6cb9f2e

pass loop

20064ee

Merge branch 'master' into speedups-timing

d337056

fix

7ca8e8d

timing

4e876fc

div by zero fix

77a4d13

fix

5e735f9

fix

8a31485

fix

492a6cf

fix

fb5bc92

fix

9da4d81

fix

f46eb3f

fix

15c0759

fix

00dad6f

fix

eb37282

fix

83b0f2b

fix

2dd3b51

fix

2710899

fix

2eca72e

Better debugging

bf5d73f

Refactoring to avoid need for iterating over reader

5b91c84

dif threads

fa4d57b

Merge branch 'speedups-timing' into speedups-single-queue

96e8742

type fix

8937e14

pure timing

5417c09

Merge branch 'speedups-timing' into speedups-single-queue

d0931ed

time to first

c4bfd38

Merge branch 'speedups-timing' into speedups-single-queue

5a664c9

Put queue logging on a timer

281c2f2

Merge branch 'speedups-timing' into speedups-single-queue

c9f0076

brendan-ai2 added 2 commits August 20, 2019 13:47

Update another reference

290e816

Merge branch 'master' into speedups-single-queue

f5fbc2d

joelgrus approved these changes Aug 20, 2019

View reviewed changes

brendan-ai2 added 5 commits August 20, 2019 16:44

Merge branch 'master' into speedups-single-queue

afcd9ca

Merge branch 'master' into speedups-single-queue

cf0e22a

Merge branch 'master' into speedups-single-queue

f2c546b

Merge branch 'master' into speedups-single-queue

45bb1de

Merge branch 'speedups-single-queue' of github.com:brendan-ai2/allenn…

5e51a43

…lp into speedups-single-queue

brendan-ai2 added 5 commits August 27, 2019 01:12

Fix crashes on linux

68e9168

Remove redundant closes and joins

21553fe

Better comment

5b57987

Better comment

9313908

Merge branch 'master' into speedups-single-queue

ac720d5

brendan-ai2 commented Aug 27, 2019

View reviewed changes

brendan-ai2 added 7 commits August 27, 2019 20:48

fix lint

df4cc2d

Merge branch 'master' into speedups-single-queue

31ab0db

Isolate weird test

7497ed6

combine

f9a4178

RAII

c5a9cde

Replace code

e5e885e

Merge branch 'master' into speedups-single-queue

71425e1

brendan-ai2 commented Aug 29, 2019

View reviewed changes

Merge branch 'master' into speedups-single-queue

dc23b04

brendan-ai2 merged commit bbaf1fc into allenai:master Aug 29, 2019

brendan-ai2 mentioned this pull request Sep 27, 2019

Speedups timing #3060

Closed

brendan-ai2 mentioned this pull request Dec 17, 2019

Unable to see the gains by using MultiProcessDatasetReader #1890

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Benchmark iterator, avoid redundant queue, remove managers. #3119

Benchmark iterator, avoid redundant queue, remove managers. #3119

brendan-ai2 commented Aug 7, 2019 •

edited

Loading

brendan-ai2 commented Aug 17, 2019

joelgrus left a comment

joelgrus Aug 20, 2019

brendan-ai2 Aug 27, 2019

brendan-ai2 commented Aug 23, 2019

brendan-ai2 Aug 27, 2019

brendan-ai2 Aug 29, 2019

joelgrus Aug 29, 2019

brendan-ai2 commented Aug 29, 2019

		@@ -33,13 +36,51 @@ def instances() -> Iterator[Instance]:

		output_queue.put(index)

		# We need to ensure we've gotten all the tensors out of this queue before

Benchmark iterator, avoid redundant queue, remove managers. #3119

Benchmark iterator, avoid redundant queue, remove managers. #3119

Conversation

brendan-ai2 commented Aug 7, 2019 • edited Loading

brendan-ai2 commented Aug 17, 2019

joelgrus left a comment

Choose a reason for hiding this comment

joelgrus Aug 20, 2019

Choose a reason for hiding this comment

brendan-ai2 Aug 27, 2019

Choose a reason for hiding this comment

brendan-ai2 commented Aug 23, 2019

brendan-ai2 Aug 27, 2019

Choose a reason for hiding this comment

brendan-ai2 Aug 29, 2019

Choose a reason for hiding this comment

joelgrus Aug 29, 2019

Choose a reason for hiding this comment

brendan-ai2 commented Aug 29, 2019

brendan-ai2 commented Aug 7, 2019 •

edited

Loading