Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Latest STAR release 2.7.0d requires genome index rebuild #2693

Open
rdocking opened this issue Feb 22, 2019 · 7 comments
Open

Latest STAR release 2.7.0d requires genome index rebuild #2693

rdocking opened this issue Feb 22, 2019 · 7 comments

Comments

@rdocking
Copy link
Contributor

Hi there,

I recently upgraded by bcbio install (I had to use the method recommended at #2676 to get around conda issues). Testing the new install, I saw the following error on a small-RNA run:

EXITING because of FATAL ERROR: Genome version: 20201 is INCOMPATIBLE with running STAR version: 2.7.0d
SOLUTION: please re-generate genome from scratch with running version of STAR, or with version: 2.7.0d

It looks like the new version of STAR (released a few days ago) requires rebuilds of the genome indices:

https://github.com/alexdobin/STAR/releases

For now, I'm going to try to downgrade like so:

bcbio_conda install STAR=2.6.1d

Looks like this will require rebuilds of a lot of genome indices - thanks for looking into this!

@rdocking
Copy link
Contributor Author

Looks like downgrading to 2.6.1d resolves the issue - might make sense to pin this version until rebuilt indices are available. Looks like the 2.7.x series of STAR also has a bunch of new single-cell analysis stuff incorporated.

roryk added a commit to chapmanb/cloudbiolinux that referenced this issue Feb 22, 2019
Workaround for bcbio/bcbio-nextgen#2693 until we can update the STAR
references.
@roryk
Copy link
Collaborator

roryk commented Feb 22, 2019

Thanks for the heads up @rdocking. Pretty awesome that you opened the issue and found a fix within 15 minutes. I pinned to 2.6.1d for now, but will leave this issue open until we update the indices and move to 2.7+.

@rdocking
Copy link
Contributor Author

Sounds good - happy to help!

@roryk
Copy link
Collaborator

roryk commented Aug 25, 2019

I'm working on updating this to go along with the work on arriba, so this should be all set by the next release.

@naumenko-sa
Copy link
Contributor

naumenko-sa commented May 29, 2020

STAR is still pinned to 2.6.1d
https://github.com/chapmanb/cloudbiolinux/blob/master/contrib/flavor/ngs_pipeline_minimal/packages-conda.yaml#L153

New star 2.7.3.a supports among other features spliced/unspliced alignments in SingleCell data:
https://github.com/alexdobin/STAR/releases

We don't store STAR index in a bucket and we don't download it via recipe.
Instead, we are generating it with
bcbio_nextgen.py upgrade -u --data -genomes hg38 --aligners star --cores 10
https://github.com/chapmanb/cloudbiolinux/blob/master/cloudbio/biodata/genomes.py#L736

So we can just unpin STAR and recommend to delete the old index and generate a new one,
or am I missing something?

@naumenko-sa naumenko-sa mentioned this issue May 29, 2020
90 tasks
@smoe
Copy link
Contributor

smoe commented Feb 10, 2021

Trying the above triggers

def _get_data_dir():
    base_dir = os.path.realpath(os.path.dirname(os.path.dirname(os.path.realpath(sys.executable))))
    if "anaconda" not in os.path.basename(base_dir) and "virtualenv" not in os.path.basename(base_dir):
        raise ValueError("Cannot update data for bcbio-nextgen not installed by installer.\n"
                         "bcbio-nextgen needs to be installed inside an anaconda environment \n"
                         "located in the same directory as the `genomes` directory.")
    return os.path.dirname(base_dir)

which is unfortunate - yes, I have fun with a bare metal installation without conda and a recent STAR install.

Would you accept a patch that points to the data directory via an environment variable and falls back to the current "anaconda" check if that environment var is not set / that dir not existing?

@roryk
Copy link
Collaborator

roryk commented Feb 10, 2021

Hi @smoe,

It has been on my TODO list forever to add support for the newer versions of STAR, thanks for bringing this up again. I think I'm going to support both versions for a while with some deprecation warnings, just to ease the transition for folks since rebuidling the indices is going to cause pain.

Yes-- I think a patch with that behavior makes sense.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants