Use pathos for multiprocessing #7002

nonhermitian · 2021-09-09T17:17:40Z

Summary

This fixes multiprocessing issues on osx for py38 and higher. This should also allow for parallel processing on Win systems but I cannot test this, and so have left that flag to False.

Details and comments

requirements.txt

Co-authored-by: Matthew Treinish <[email protected]>

mtreinish · 2021-09-09T18:06:25Z

I'd like to do some testing on this locally (I used to be able to reproduce the py3.9 linux hangs quite reliably), but if this can fix the issues we've had I'm definitely a fan of this. From a quick scan of the docs it also looks like it opens us up an option to distributing work between multiple machines (something like an alternative approach to #3466) which would be cool to build on. But I wasn't familiar with pathos before now so I need to do some reading :)

nonhermitian · 2021-09-09T18:16:01Z

Ahh it looks like Pathos recycles the process ID and thus the unique names that we use in places needs to be tweaked.

nonhermitian · 2021-09-09T18:27:51Z

Yeah the blocker is the way Schedules define their unique name. It is not unique anymore with this change, whereas circuit names still are. So I need to figure out to port the circuit unique name stuff over.

kdk · 2021-09-09T21:24:57Z

This should also allow for parallel processing on Win systems but I cannot test this, and so have left that flag to False.

Is the CI coverage we have for windows sufficient to green-light this, assuming it passes CI? If not, maybe we can reach out to some windows users for additional testing.

nonhermitian · 2021-09-09T21:53:31Z

The issue is not working in the usual sense, but more that parallel usually causes a big slowdown on win due to the lack of a fork command. I think we need an actual win user to test serial vs parallel transpilation over several circuits to see how it pans out.

mtreinish · 2021-09-09T22:00:42Z

The issue is not working in the usual sense, but more that parallel usually causes a big slowdown on win due to the lack of a fork command. I think we need an actual win user to test serial vs parallel transpilation over several circuits to see how it pans out.

I can test this in my windows vm tomorrow, last time I looked at it (#3547 (comment) ) there was an improvement but there were the normal spawn caveats (https://qiskit.org/documentation/release_notes.html#release-notes-0-17-0-known-issues ) that made it potentially annoying to default on, which is why we documented the issue and let users opt in.

…into pathos

nonhermitian · 2021-09-10T10:22:19Z

So the issue with the unique names was the way we check if the process is main or not. I simplified this logic to just look at process IDs and all works fine now.

nonhermitian · 2021-09-10T10:25:29Z

The one caveat here is that when defining routines in an interpreter, e.g. ipython, jupyter notebooks, custom functions to be executed in parallel must have the import statements in them. For scripts this is not the case:

This worker pool leverages the parallelpython (pp) module, and thus has many of the limitations associated with that module. The function f and the sequences in args must be serializable. The maps in this worker pool have full functionality when run from a script, but may be somewhat limited when used in the python interpreter. Both imported and interactively-defined functions in the interpreter session may fail due to the pool failing to find the source code for the target function.

https://pathos.readthedocs.io/en/latest/pathos.html#id3

nonhermitian · 2021-09-10T13:11:19Z

Hmm, this looks like it hangs on Py38 tests in a similar spot as @mtreinish pointed out on Py39. Perhaps this is not something that actually helps much. Not sure why visualizations would cause a crash in parallel processing through.

mtreinish · 2021-09-13T22:52:11Z

Hmm, this looks like it hangs on Py38 tests in a similar spot as @mtreinish pointed out on Py39. Perhaps this is not something that actually helps much. Not sure why visualizations would cause a crash in parallel processing through.

I actually don't think it's the visualization tests. When I looked into this before for #6188 it was getting stuck on the deserialization of sympy expressions on parallel calls. I was most commonly seeing this in the algorithms tests. But since the process hangs the other test workers continue to run until they finish running their tests

We really need to add a per test method timeout fixture to make debugging these easier. It won't work on windows (since signals behave differently there) but that's ok as long as it doesn't interrupt normal test execution.

nonhermitian · 2021-11-06T13:59:27Z

Closing as probably will not work.

nonhermitian added 2 commits September 9, 2021 13:10

Use pathos for multiprocessing

33bc2b0

remove blocking for darwin systems

b85480d

nonhermitian requested a review from a team as a code owner September 9, 2021 17:17

Merge branch 'main' into pathos

8603fdd

mtreinish reviewed Sep 9, 2021

View reviewed changes

requirements.txt Outdated Show resolved Hide resolved

Update requirements.txt

9525e76

Co-authored-by: Matthew Treinish <[email protected]>

fix naming

c328d5e

nonhermitian requested review from manoelmarques and woodsp-ibm as code owners September 10, 2021 09:12

nonhermitian added 5 commits September 10, 2021 05:14

Merge branch 'main' into pathos

910f60f

Add back py39 block to parallel_map

133fcfb

lint

22f468a

Merge branch 'pathos' of https://github.com/nonhermitian/qiskit-terra …

7035b2c

…into pathos

remove unused import

43abb30

nonhermitian closed this Nov 6, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use pathos for multiprocessing #7002

Use pathos for multiprocessing #7002

nonhermitian commented Sep 9, 2021

mtreinish commented Sep 9, 2021

nonhermitian commented Sep 9, 2021

nonhermitian commented Sep 9, 2021

kdk commented Sep 9, 2021

nonhermitian commented Sep 9, 2021

mtreinish commented Sep 9, 2021

nonhermitian commented Sep 10, 2021

nonhermitian commented Sep 10, 2021

nonhermitian commented Sep 10, 2021

mtreinish commented Sep 13, 2021

nonhermitian commented Nov 6, 2021

Use pathos for multiprocessing #7002

Use pathos for multiprocessing #7002

Conversation

nonhermitian commented Sep 9, 2021

Summary

Details and comments

mtreinish commented Sep 9, 2021

nonhermitian commented Sep 9, 2021

nonhermitian commented Sep 9, 2021

kdk commented Sep 9, 2021

nonhermitian commented Sep 9, 2021

mtreinish commented Sep 9, 2021

nonhermitian commented Sep 10, 2021

nonhermitian commented Sep 10, 2021

nonhermitian commented Sep 10, 2021

mtreinish commented Sep 13, 2021

nonhermitian commented Nov 6, 2021