Fix mem sort for HISAT2 output #158

apeltzer · 2019-02-18T13:41:33Z

Copying in the conversation from nf-core slack:

So we keep seeing some issues with the RNAseq workflow when sorting the hisat2 output - memory requested on the server seems to be higher than what is provided, thus the jobs get killed by the scheduler...
e.g. something like Excceed job memory limit 63348736 > 82914560
which happens here

rnaseq/main.nf

Line 676 in e837637

-@ ${task.cpus} $avail_mem \\

I'd suggest dropping this memory request, as I never really saw issues without it in other pipelines sorting BAM files...
Sarek e.g. does this somewhat similar thing to set it permanently to 2G https://github.com/SciLifeLab/Sarek/blob/88921e6944604d077afd8a5bc0a89b8e93937313/main.nf#L174
Suggestions ?

ewels · 2019-02-18T16:44:16Z

Wow, that's like 30% over... Are you sure that the final script is actually using the -m flag? Note that it can be empty depending on the config.

We should probably at least leave some overhead from that calculation though at the very least.

apeltzer · 2019-02-18T17:59:24Z

Hm, what I thought when looking at the command is that the -m <XYZ> flag isn't used at the moment. The command simply states the $avail_mem instead of using the -m $avail_mem value, which might cause some trouble too for big bam files? Maybe that is the real reason?

I'd also be happy to compute the avail_mem as is and then deduct something as you mentioned it - let me know and I'll adapt it, maybe trying first to just incorporate the -m switch ?

ewels · 2019-02-18T18:01:17Z

I think the -m flag is inside that variable, no? It's intentional - so that the -m flag isn't left empty if we don't know what memory the process has.

apeltzer · 2019-02-18T18:05:47Z

It is indeed - I changed to using - 100000000 as in hisat, star and other memory intensive processes to keep hopefully within the limits 👍 Sorry my bad, I didn't see it...

ewels · 2019-02-18T21:47:02Z

Nice! Is it possible for you to test this before merging to see if it fixes the problem? Or should we just merge?

apeltzer · 2019-02-18T22:02:06Z

I can let Silvia test tomorrow 👍

apeltzer · 2019-02-21T13:55:24Z

We're testing at the moment - will keep you posted on updates

apeltzer · 2019-02-22T12:26:58Z

Local testing revealed that this seems to work fine even for weird samples that were not running beforehand 👍

ewels · 2019-02-25T11:29:41Z

main.nf

@@ -671,11 +671,11 @@ if(params.aligner == 'hisat2'){
        file "where_are_my_files.txt"

        script:
-        def avail_mem = task.memory ? "-m ${task.memory.toBytes() / task.cpus}" : ''
+        def avail_mem = task.memory ? "-m ${(task.memory.toBytes() - 6000000000) / task.cpus}" : ''


Nice! Could you just add in some extra code that checks that this isn't a negative number / stupidly small? If so then it can just be a blank string and left unspecified again. I'm just thinking that on some small (eg. testing) envs, there may be under 6GB memory given to this process.

apeltzer added 2 commits February 18, 2019 14:39

Fix samtools sort taking too much memory

59eb9d3

Fix memory issues with hisat2 sorting

dbf783a

apeltzer requested a review from a team February 18, 2019 13:41

Fix avail_mem usage hopefully

52bef54

Deduct - 100000000 to fix memory issues hopefully

3663260

remove the -m part

92e53ee

Increase deduction

63354e1

apeltzer added 5 commits February 22, 2019 09:16

use more cores (1 i/o, 4 sorting)

1f8de20

Use safer defaults

60e6574

Revert to using 4 cpus

0d8da06

Better 5 down

3bdfa9f

Use 6GB down

a05c5bf

ewels reviewed Feb 25, 2019

View reviewed changes

Add check for enough memory to hisat2sort

afb19ad

ewels approved these changes Feb 28, 2019

View reviewed changes

ewels merged commit fa57e21 into nf-core:dev Feb 28, 2019

This was referenced Mar 15, 2019

samtools sort requests too much memory nf-core/methylseq#81

Closed

Leave some spare memory for samtools nf-core/methylseq#82

Merged

apeltzer deleted the fix-mem-sort branch June 8, 2019 14:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix mem sort for HISAT2 output #158

Fix mem sort for HISAT2 output #158

apeltzer commented Feb 18, 2019

ewels commented Feb 18, 2019

apeltzer commented Feb 18, 2019

ewels commented Feb 18, 2019

apeltzer commented Feb 18, 2019 •

edited

Loading

ewels commented Feb 18, 2019

apeltzer commented Feb 18, 2019

apeltzer commented Feb 21, 2019

apeltzer commented Feb 22, 2019

ewels Feb 25, 2019

apeltzer Feb 25, 2019

Fix mem sort for HISAT2 output #158

Fix mem sort for HISAT2 output #158

Conversation

apeltzer commented Feb 18, 2019

ewels commented Feb 18, 2019

apeltzer commented Feb 18, 2019

ewels commented Feb 18, 2019

apeltzer commented Feb 18, 2019 • edited Loading

ewels commented Feb 18, 2019

apeltzer commented Feb 18, 2019

apeltzer commented Feb 21, 2019

apeltzer commented Feb 22, 2019

ewels Feb 25, 2019

Choose a reason for hiding this comment

apeltzer Feb 25, 2019

Choose a reason for hiding this comment

apeltzer commented Feb 18, 2019 •

edited

Loading