-
Notifications
You must be signed in to change notification settings - Fork 182
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
mmseqs taxonomy based on GTDB + NR viruses + NR eukaryotes #849
Comments
Essentially you need:
With all of that you can call:
for the tsv files you have to check that the second column (containing the accessions) in the |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Hi
I would like to taxonomically classify my protein sequences based on the GTDB taxonomy combined with the ncbi taxonomy of NR viruses and NR eukaryotes.
Do you have any suggestions on how I could build a mmseqs database consisting of these three databases and two taxonomies?
My current approach would be to create *dmp files according to your description for the gtdb and merge them with the *dmp files of the NR containing only viruses and eukaryotes.
The text was updated successfully, but these errors were encountered: