Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

AAstringset inputs #5

Closed
TS404 opened this issue May 18, 2018 · 5 comments
Closed

AAstringset inputs #5

TS404 opened this issue May 18, 2018 · 5 comments
Assignees
Labels
enhancement New feature or request

Comments

@TS404
Copy link

TS404 commented May 18, 2018

Ideally, the functions get_phobius_file, get_signalp_file, and get_targetp_file should be a bit more flexible on input (ie accepting AAstringset format).

@missuse
Copy link
Owner

missuse commented May 18, 2018

AAstringset will be added as input format for all functions and the mentioned *_file functions will
be changed to accept all objects as the other functions . This will be deployed some time next week.

@missuse
Copy link
Owner

missuse commented May 20, 2018

Adding AAstringset as input to the functions has proven to require a design decision I am not sure how to make.
To be able to process objects of this class, I would need to rely on BiocGenerics package. It should go in Suggests. Since it is not available on CRAN I would need to specify Remotes. When there are Remotes specified the package can not pass the CRAN check. Therefore if I add it now, when I decide to push to CRAN I will have to remove it (not a good option). Or add it and never push to CRAN (not a good option).
There is a third option: to rely that every user that would provide objects of AAstringset to functions already has BiocGenerics (this is probably true in 99.99% cases).

Do you have any thoughts on the matter?

On a more bright side, functions get_signalp, get_targetp and get_phobius were added. They accept as input all objects like scan_ag.

Usage:

signalp_pred <- get_signalp(data = at_nsp[1:50,],
                            sequence,
                            Transcript.id)

@TS404
Copy link
Author

TS404 commented May 21, 2018

A proposed solution: It's relatively easy to convert from an AAstringset to seqinr list format. Perhaps each function could convert the sequence input to list of strings as its first step. That way it's fully compatible with BioConducor, without requiring BiocGenerics. Something like:

seq.AAstringset <- Biostrings::readAAStringset(file.fa)

seq.list <- lapply(lapply(seq.AAstringset,
                          as.character),
                   strsplit, "")

@missuse
Copy link
Owner

missuse commented May 21, 2018

I am not sure how.
The S4 generics such as as.vector and unlist that work on XStringSet-class are in BiocGenerics.

@missuse missuse added the enhancement New feature or request label Oct 14, 2021
@missuse missuse self-assigned this Oct 14, 2021
missuse pushed a commit that referenced this issue Oct 14, 2021
@missuse
Copy link
Owner

missuse commented Oct 14, 2021

I decided its ok to never publish on CRAN a long time ago. I have no excuse for not implementing this till now. If you encounter any bugs please let me know.

@missuse missuse closed this as completed Oct 14, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants