How can I reuse the same chromium instance? #100

Richacinas · 2021-01-20T07:58:53Z

I would like to use the same Chromium instance every time, so only a new tab/page would be open for a new request.

Is that possible with Grover?

Thanks a lot

Richacinas · 2021-01-20T08:16:53Z

I have the feeling that my process is taking up so long because this library is creating a new Chromium instance over and over. Does it make sense?

abrom · 2021-01-20T08:43:10Z

Not really practical with how the initialisation of the Grover instance occurs (ie the content to be rendered passed through the initialiser) and somewhat goes against the direction the project has been moving.

In #53 the schmooze gem was removed due to the way it leaked Chromium processes (through how it uses the GC to clean up workers), however I can imagine it would be possible to use something like schmooze to achieve what you're after, I just don't see it happening within this project. Schmooze boots up a single NodeJS instance and would allow re-entrant calls, although TBH I'm not sure if it would support persisting the Puppeteer instances across calls. Another option would be to consider something like the Ruby Puppeteer port, but I found it wasn't as performant as calling to Chromium through NodeJS.

I do like to see Grover support as many use-cases as possible, but this feels like quite a significant side-step and not really something I see being in Grover.

drnic · 2021-07-06T04:09:06Z

#115 allows you to access a remote existing chromium. Helpful here?

feliperaul · 2021-11-02T12:25:40Z

@abrom Andrew, first of all, huge props for this amazing project.

I would greatly appreciate a way for Grover to use a long-running Puppeteer instance. The gem doesn't have to do all the work by itself, we could even use a systemd service or Ubuntu or something (and just provide the instructions in the readme for the ones that want to use it). I think that it would greatly reduce the time to generate PDFs since it would avoid the Chrome startup time every time someone clicks the "download PDF" link on our application.

abrom · 2021-11-02T13:26:46Z

Per my previous comments, I'm not convinced this is the path forward for Grover.. However some relatively simple tweaks would allow you to create a single Grover instance with a re-useable NodeJS/Puppeteer/Chromium setup.

See https://github.com/Studiosity/grover/compare/reuse-processor

The Grover instance method definitions need to change of course because you're going to want to pass different URL/options in per conversion!

grover = Grover.new
pdf1 = grover.to_pdf 'https://github.com'
pdf2 = grover.to_pdf 'https://www.google.com'

N.B

It uses a finaliser to clean up the NodeJS process so be aware that you're at the mercy of the Ruby garbage collector on that front RE process/memory cleanup..
I've wrapped the processor call in a Mutex to prevent any threading issues (given the interface can only be accessed by one thread at a time!).
I haven't tested this except for the cursory example above.. Use at your own risk!
This will not work with the middleware.. although you're welcome to tweak that if that's still something you need. You'd likely want to initialise the instance in the middleware initialiser, although I wouldn't be able to guarantee the behaviour with multi-threaded or multi-process web servers.. you'd need to make sure that any initialisation happens AFTER the thread/forked process is created!
I definitely have no intention to merge this and deploy as a part of the gem!

In terms of how much of an improvement it makes.. short answer.. some, but not as much as you'd think (if anything):

> grover = Grover.new
> start_time = Time.now; g.to_pdf 'https://www.google.com.au', path: '/tmp/foo.pdf'; end_time = Time.now; end_time - start_time
 => 3.06107 
> start_time = Time.now; g.to_pdf 'https://www.google.com.au', path: '/tmp/foo.pdf'; end_time = Time.now; end_time - start_time
 => 2.565773

Compared to the current code:

> start_time = Time.now; Grover.new('https://www.google.com.au').to_pdf('/tmp/foo.pdf'); end_time = Time.now; end_time - start_time
 => 3.108409 
> start_time = Time.now; Grover.new('https://www.google.com.au').to_pdf('/tmp/foo.pdf'); end_time = Time.now; end_time - start_time
 => 2.761781

... practically identical 😉

feliperaul · 2021-11-06T13:30:36Z

@abrom Thanks so much for the detailed answer, it would take me a long time to benchmark this.

abrom · 2021-11-06T14:32:39Z

Well the comparison was pretty simplistic. You'd really need to run it hundreds of times and compare not just the runtime but the memory usage etc too

abrom closed this as completed Jan 29, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How can I reuse the same chromium instance? #100

How can I reuse the same chromium instance? #100

Richacinas commented Jan 20, 2021

Richacinas commented Jan 20, 2021

abrom commented Jan 20, 2021

drnic commented Jul 6, 2021

feliperaul commented Nov 2, 2021

abrom commented Nov 2, 2021

feliperaul commented Nov 6, 2021

abrom commented Nov 6, 2021

How can I reuse the same chromium instance? #100

How can I reuse the same chromium instance? #100

Comments

Richacinas commented Jan 20, 2021

Richacinas commented Jan 20, 2021

abrom commented Jan 20, 2021

drnic commented Jul 6, 2021

feliperaul commented Nov 2, 2021

abrom commented Nov 2, 2021

feliperaul commented Nov 6, 2021

abrom commented Nov 6, 2021