[WiP] Parallel Rubocop, fixes #117 #175

jurriaan · 2013-05-15T13:58:41Z

Changed a lot of things to get it to work correctly.
Including some hacks in the specs.

Feedback would be appreciated 😄

coveralls · 2013-05-15T14:00:00Z

Coverage decreased (-0%) when pulling ce6599dc0cd0e9327c7bb0d9a3309828cd10985c on jurriaan:parallel into 9f34175 on bbatsov:master.

jurriaan · 2013-05-15T14:03:12Z

It's not bug free, still some bugs left. it is not displaying errors for example.
But it's almost 4 times as fast as the non parallel version on my system

bbatsov · 2013-05-15T14:07:27Z

lib/rubocop/cli.rb

+           inspect_file(file, source, config, report)
+         end
+        report
+      end.each { |report| report.display unless report.empty? }


I'd extract this into a separate method display_report to make code a bit more clear.

Why not integrate it with the display_summary method?

It's easier to test stuff when they are decoupled, that's why I suggested a separate method.

bbatsov · 2013-05-15T14:10:45Z

4 times speedup is most impressive! And we'll get another big boost when @whitequark releases Parser 2.0, since it contains a crucial performance optimization. I'll have a look at the code now. :-)

bbatsov · 2013-05-15T14:17:48Z

The code looks good to me.

coveralls · 2013-05-15T14:25:55Z

Coverage remained the same when pulling 4f9515ddebbba99f5801c772a178798258b58261 on jurriaan:parallel into 9f34175 on bbatsov:master.

coveralls · 2013-05-15T14:26:08Z

Coverage decreased (-5.02%) when pulling 4f9515ddebbba99f5801c772a178798258b58261 on jurriaan:parallel into 9f34175 on bbatsov:master.

jonas054 · 2013-05-15T14:28:45Z

lib/rubocop/cli.rb

-          # no more checking in the file.
-          report << syntax_cop
-          @total_offences += syntax_cop.offences.count
+         # In case of a syntax error we just report that error and do


Oops. Incorrect indentation, right? Should be two spaces.

Fixing it in next commit. :)

jurriaan · 2013-05-15T14:33:00Z

The run method has way too many lines. I could extract this block into a separate method. What do you think?

bbatsov · 2013-05-15T14:46:33Z

@jurriaan Sounds reasonable to me. Breaking big methods into smaller ones is rarely a bad idea :-)

coveralls · 2013-05-15T14:59:12Z

Coverage decreased (-0%) when pulling 021793848f1d43c120be256cd6ff1ccbdbd28e05 on jurriaan:parallel into 9f34175 on bbatsov:master.

coveralls · 2013-05-16T16:21:19Z

Coverage decreased (-0%) when pulling 70be472 on jurriaan:parallel into 273318c on bbatsov:master.

jurriaan · 2013-05-17T15:44:32Z

@bbatsov I'm having some trouble trying to rebase this on the latest master. Errors are now stored in a instance variable of CLI, which is a problem because this branch uses processes instead of threads (so it uses all cores, even with MRI ruby). When using Kernel#fork the instance variables are not shared between processes, but copied. So, if cops fail, it's not reported correctly.

My previous solution was adding the errors to the reports. Should I change the current functionality so it does add them to the reports again? What do you think?

bbatsov · 2013-05-17T19:51:16Z

I think you should do some tests with threads on MRI - IO ops should be parallelized even there. You'd also do well to compare process vs thread performance on MRI/Rubinius and JRuby now that we support them all.

If we decided to use processes storing the errors in the reports is a viable idea.

jurriaan · 2013-05-17T19:57:54Z

It's much faster if you use processes on MRI.
Tested it by running time rubocop in my rubocop branch.
On my system MRI Ruby 2.0.0p0 with 8 threads it runs in 9.6 s, with 8 processes it runs in 4.6 s.

jurriaan · 2013-05-19T13:06:35Z

@bbatsov What do you think? I don't want to wait too long with rebasing.

bbatsov · 2013-05-19T13:10:31Z

@jurriaan I'm worried that spawning processes on JRuby(I guess this will spin a few JVMs) might be very slow. Do a test with it to make sure it's performing ok and we can proceed.

Alternatively we might simply check the RUBY_ENGINE and select threads or processes based on it.

jurriaan · 2013-05-19T13:31:35Z

It's not possible to use threads at the moment because of the @parser_tokens which will be overwritten by concurrent threads.
Why not move the parsing away from the CLI class?
If you move the parsing related stuff to their own class (one instance per file) and merge this with the report class it would be much easier to run in parallel and I also think it will be more easy to maintain.

bbatsov · 2013-05-19T19:05:53Z

@jurriaan That's a reasonable idea. Go ahead and extract the parsing logic.

whitequark · 2013-05-19T19:12:14Z

BTW, I think that parallel gem uses threads on JRuby.

jurriaan · 2013-05-19T19:19:29Z

Correct, fork() doesnt work by default on jruby

On 19 mei 2013, at 21:12, Peter Zotov [email protected] wrote:

BTW, I think that parallel gem uses threads on JRuby.

—
Reply to this email directly or view it on GitHub.

bbatsov · 2013-05-20T09:23:05Z

@jurriaan That means you'll definitely need to extract parsing away.

jurriaan · 2013-05-20T09:24:06Z

@bbatsov I'll create a separate pull request for that when I have time :)

bbatsov reviewed May 15, 2013
View reviewed changes

jonas054 reviewed May 15, 2013
View reviewed changes

Parallel Rubocop

70be472

bbatsov closed this Jul 3, 2013

josh mentioned this pull request Dec 16, 2016

WIP Parallel Rubocop #3794

Closed

11 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WiP] Parallel Rubocop, fixes #117 #175

[WiP] Parallel Rubocop, fixes #117 #175

jurriaan commented May 15, 2013

coveralls commented May 15, 2013

jurriaan commented May 15, 2013

bbatsov May 15, 2013

jurriaan May 15, 2013

bbatsov May 15, 2013

bbatsov commented May 15, 2013

bbatsov commented May 15, 2013

coveralls commented May 15, 2013

coveralls commented May 15, 2013

jonas054 May 15, 2013

jurriaan May 15, 2013

jurriaan commented May 15, 2013

bbatsov commented May 15, 2013

coveralls commented May 15, 2013

coveralls commented May 16, 2013

jurriaan commented May 17, 2013

bbatsov commented May 17, 2013

jurriaan commented May 17, 2013

jurriaan commented May 19, 2013

bbatsov commented May 19, 2013

jurriaan commented May 19, 2013

bbatsov commented May 19, 2013

whitequark commented May 19, 2013

jurriaan commented May 19, 2013

bbatsov commented May 20, 2013

jurriaan commented May 20, 2013

[WiP] Parallel Rubocop, fixes #117 #175

[WiP] Parallel Rubocop, fixes #117 #175

Conversation

jurriaan commented May 15, 2013

coveralls commented May 15, 2013

jurriaan commented May 15, 2013

bbatsov May 15, 2013

Choose a reason for hiding this comment

jurriaan May 15, 2013

Choose a reason for hiding this comment

bbatsov May 15, 2013

Choose a reason for hiding this comment

bbatsov commented May 15, 2013

bbatsov commented May 15, 2013

coveralls commented May 15, 2013

coveralls commented May 15, 2013

jonas054 May 15, 2013

Choose a reason for hiding this comment

jurriaan May 15, 2013

Choose a reason for hiding this comment

jurriaan commented May 15, 2013

bbatsov commented May 15, 2013

coveralls commented May 15, 2013

coveralls commented May 16, 2013

jurriaan commented May 17, 2013

bbatsov commented May 17, 2013

jurriaan commented May 17, 2013

jurriaan commented May 19, 2013

bbatsov commented May 19, 2013

jurriaan commented May 19, 2013

bbatsov commented May 19, 2013

whitequark commented May 19, 2013

jurriaan commented May 19, 2013

bbatsov commented May 20, 2013

jurriaan commented May 20, 2013