Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

problem parsing kallisto abundance.h5 #56

Closed
fruce-ki opened this issue May 9, 2018 · 4 comments
Closed

problem parsing kallisto abundance.h5 #56

fruce-ki opened this issue May 9, 2018 · 4 comments
Assignees
Labels
bug RATs not behaving the way it is intended to. urgent Requires immediate resolution.
Milestone

Comments

@fruce-ki
Copy link
Collaborator

fruce-ki commented May 9, 2018

This issue seems to be unrelated to #55 .

EDIT:
If any users out there experience this error, please let us know. As is, I am unable to pinpoint the cause of it. I only know it is not caused by RATs.
/EDIT

h5read('./kallisto_quant/Hs_GRCh37.67.1/abundance.h5', '/aux/ids/') succeeds on my mac but fails on both the login node and the HPC nodes.

Mac session:

R version 3.4.1 (2017-06-30)
Platform: x86_64-apple-darwin15.6.0 (64-bit)
Running under: macOS High Sierra 10.13.4

Matrix products: default
BLAS: /System/Library/Frameworks/Accelerate.framework/Versions/A/Frameworks/vecLib.framework/Versions/A/libBLAS.dylib
LAPACK: /Library/Frameworks/R.framework/Versions/3.4/Resources/lib/libRlapack.dylib

locale:
[1] en_GB.UTF-8/en_GB.UTF-8/en_GB.UTF-8/C/en_GB.UTF-8/en_GB.UTF-8

attached base packages:
[1] stats     graphics  grDevices utils     datasets  methods   base     

other attached packages:
[1] rhdf5_2.22.0        data.table_1.10.4-3

loaded via a namespace (and not attached):
[1] zlibbioc_1.24.0 compiler_3.4.1  tools_3.4.1   

Ningal session:

R version 3.4.3 (2017-11-30)
Platform: x86_64-pc-linux-gnu (64-bit)
Running under: CentOS release 6.8 (Final)

Matrix products: default
BLAS: /homes/kfroussios/local_installs/miniconda3/envs/mybasics/lib/R/lib/libRblas.so
LAPACK: /homes/kfroussios/local_installs/miniconda3/envs/mybasics/lib/R/lib/libRlapack.so

locale:
 [1] LC_CTYPE=en_GB.UTF-8       LC_NUMERIC=C              
 [3] LC_TIME=en_GB.UTF-8        LC_COLLATE=en_GB.UTF-8    
 [5] LC_MONETARY=en_GB.UTF-8    LC_MESSAGES=en_GB.UTF-8   
 [7] LC_PAPER=en_GB.UTF-8       LC_NAME=C                 
 [9] LC_ADDRESS=C               LC_TELEPHONE=C            
[11] LC_MEASUREMENT=en_GB.UTF-8 LC_IDENTIFICATION=C       

attached base packages:
[1] stats     graphics  grDevices utils     datasets  methods   base     

other attached packages:
[1] rhdf5_2.22.0         data.table_1.10.4-3  RevoUtils_10.0.8    
[4] RevoUtilsMath_10.0.1

loaded via a namespace (and not attached):
[1] zlibbioc_1.24.0 compiler_3.4.3 

HPC session:

R version 3.4.3 (2017-11-30)
Platform: x86_64-conda_cos6-linux-gnu (64-bit)
Running under: CentOS Linux 7 (Core)

Matrix products: default
BLAS/LAPACK: /homes/kfroussios/local_installs/miniconda3/envs/nodebasics/lib/R/lib/libRblas.so

locale:
 [1] LC_CTYPE=en_GB.UTF-8       LC_NUMERIC=C              
 [3] LC_TIME=en_GB.UTF-8        LC_COLLATE=en_GB.UTF-8    
 [5] LC_MONETARY=en_GB.UTF-8    LC_MESSAGES=en_GB.UTF-8   
 [7] LC_PAPER=en_GB.UTF-8       LC_NAME=C                 
 [9] LC_ADDRESS=C               LC_TELEPHONE=C            
[11] LC_MEASUREMENT=en_GB.UTF-8 LC_IDENTIFICATION=C       

attached base packages:
[1] stats     graphics  grDevices utils     datasets  methods   base     

other attached packages:
[1] rhdf5_2.22.0      data.table_1.11.0

loaded via a namespace (and not attached):
[1] zlibbioc_1.24.0 compiler_3.4.3 
@fruce-ki fruce-ki added bug RATs not behaving the way it is intended to. urgent Requires immediate resolution. labels May 9, 2018
@fruce-ki fruce-ki self-assigned this May 9, 2018
@fruce-ki
Copy link
Collaborator Author

fruce-ki commented May 9, 2018

It does not seem to be an issue with the version of rhdf5 nor data.table as they are identical between the Mac that works and ningal that does not.

The only discernible difference is R itself, 3.4.1 on the one that works and 3.4.3 on those that don't work.

Inevitably I have to test with the latest R.

@fruce-ki fruce-ki added this to the 0.6.4 milestone May 9, 2018
@fruce-ki
Copy link
Collaborator Author

fruce-ki commented May 9, 2018

The issue is NOT reproducible on Mac OSX with R 3.4.3 and rhdf5 2.22.0.
It could be another mangled configuration issue on our HPC? But why only the kallisto files and not the salmon/wasabi ones?

@fruce-ki fruce-ki added user input needed Asking for input from the users and public. and removed urgent Requires immediate resolution. labels May 9, 2018
@fruce-ki
Copy link
Collaborator Author

fruce-ki commented May 9, 2018

Meanwhile work-around is by employing kallisto's h5dump subcommand, to export the .h5 to plaintext that can be parsed manually in a loop to create the required lists of tables for RATs.

@fruce-ki fruce-ki modified the milestone: 0.6.4 May 9, 2018
fruce-ki added a commit that referenced this issue May 10, 2018
@fruce-ki fruce-ki changed the title crash - NULL content from parsing kallisto abundance.h5 problem parsing kallisto abundance.h5 May 10, 2018
@fruce-ki
Copy link
Collaborator Author

The workaround has now been implemented as part of fish4rodents(). It is now possible to load the bootstrap data from kallisto's plaintext format instead of extracting from the abundance.h5 file.

This does not fix the problem, but I do not believe the problem is caused by an error in RATs, so I consider this issue closed from our perspective.

@fruce-ki fruce-ki added urgent Requires immediate resolution. and removed user input needed Asking for input from the users and public. labels May 10, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug RATs not behaving the way it is intended to. urgent Requires immediate resolution.
Projects
None yet
Development

No branches or pull requests

1 participant