You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I was pretty stoked to use vRhyme for my viral binning protocol, but unfortunately haven't been successful in running the program without any errors.
I initially wasn't able to supply my own sorted bam files that come from minimap2 -x sr (which is X3-4 times faster and also more accurate than bowtie2 - would recommend adding this as a mapping option). It would give me the same error as Issue #26https://github.com/AnantharamanLab/vRhyme/issues/26 and would not produce the coverage table. I then gave up and thought to instead try out using the internal bowtie2 aligner but still ran into a different error.
This time I checked thelog_vRhyme_paired_reads.tsv and the pairings are correct. I also checked and the vRhyme_coverage_values.tsv file is not empty .
Despite that I get the following log and error:
Date: 2024-01-18 (y-m-d)
Start: 14:32:05 (h:m:s)
Program: vRhyme v1.1.0
Time (min) | Log
--------------------------------------------------------------------
0.0 Initializing and validating vRhyme parameters
0.01 Paired end read file(s) identified. Running bowtie2 on 3 set of paired files
5.13 Extracting coverage information from BAM files
5.82 Coverage extraction complete. Generating coverage table
5.82 Performing pairwise coverage comparisons
5.86 Running Prodigal on filtered sequences
5.95 Generating codon usage features
5.95 Generating nucleotide features
5.99 Performing pairwise distance calculations
6.0 Performing machine learning classification
workflow/software/vRhyme/vRhyme/vRhyme:16: DeprecationWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html
import pkg_resources
Traceback (most recent call last):
File "workingdir/vRhyme/vRhyme/vRhyme", line 960, in<module>
net_data = machine_stuff.machine_stuff(distances, presets, model_method, pairs_machine, cohen_machine, iterations, cohen_check)
File "workingdir/vRhyme/vRhyme/scripts/machine_stuff.py", line 73, in machine_stuff
model_ET = pickle.load(read_model_ET)
File "sklearn/tree/_tree.pyx", line 728, in sklearn.tree._tree.Tree.__setstate__
File "sklearn/tree/_tree.pyx", line 1434, in sklearn.tree._tree._check_node_ndarray
ValueError: node array from the pickle has an incompatible dtype:
- expected: {'names': ['left_child', 'right_child', 'feature', 'threshold', 'impurity', 'n_node_samples', 'weighted_n_node_samples', 'missing_go_to_left'], 'formats': ['<i8', '<i8', '<i8', '<f8', '<f8', '<i8', '<f8', 'u1'], 'offsets': [0, 8, 16, 24, 32, 40, 48, 56], 'itemsize': 64}
- got : [('left_child', '<i8'), ('right_child', '<i8'), ('feature', '<i8'), ('threshold', '<f8'), ('impurity', '<f8'), ('n_node_samples', '<i8'), ('weighted_n_node_samples', '<f8')]
Any idea on how we can resolve this? I was reading some blogs online saying it's related to the version of scikit-learn. If that is the case can you include the version of the software in the conda installation? This way we are guaranteed to fully reproduce your outcomes.
If you need my raw sequence files I'm happy to somehow send them to you. I can also send you the bam files generated from minimap2.
Best,
Erfan
The text was updated successfully, but these errors were encountered:
erfanshekarriz
changed the title
ValueError: node array from the pickle has an incompatible dtype
Scikit-learn version issueValueError: node array from the pickle has an incompatible dtypeJan 18, 2024
Please help me update the installation instructions in the READ.md file. I would also strongly recommend noting the versions of all software above to allow longterm stability and reproducibility.
erfanshekarriz
changed the title
Scikit-learn version issueValueError: node array from the pickle has an incompatible dtype
Scikit-learn version issue ValueError: node array from the pickle has an incompatible dtypeJan 18, 2024
Hello there.
I was pretty stoked to use vRhyme for my viral binning protocol, but unfortunately haven't been successful in running the program without any errors.
I initially wasn't able to supply my own sorted bam files that come from
minimap2 -x sr
(which is X3-4 times faster and also more accurate than bowtie2 - would recommend adding this as a mapping option). It would give me the same error as Issue #26https://github.com/AnantharamanLab/vRhyme/issues/26
and would not produce the coverage table. I then gave up and thought to instead try out using the internal bowtie2 aligner but still ran into a different error.This is the command I ran:
This time I checked the
log_vRhyme_paired_reads.tsv
and the pairings are correct. I also checked and thevRhyme_coverage_values.tsv
file is not empty .Despite that I get the following log and error:
Any idea on how we can resolve this? I was reading some blogs online saying it's related to the version of scikit-learn. If that is the case can you include the version of the software in the conda installation? This way we are guaranteed to fully reproduce your outcomes.
If you need my raw sequence files I'm happy to somehow send them to you. I can also send you the bam files generated from minimap2.
Best,
Erfan
The text was updated successfully, but these errors were encountered: