Skip to content

Genes are misassigned to "Envelope glycoprotein gp160" by eggnog #1159

@alexcritschristoph

Description

@alexcritschristoph

Are you using the latest release?
Yes

Describe the bug

I believe there is some weird data issue in the Eggnog distribution associated with funannotate. In my genome I have the gene "Parasitic phase-specific protein PSP-1" / "RTA3" (sequence at bottom, for curiosity).

But here "RTA3" gets matched to "Envelope glycoprotein gp160":
https://raw.githubusercontent.com/nextgenusfs/gene2product/master/ncbi_cleaned_gene_products.txt
(Search for RTA3 in that file)

So this causes the gene to incorrectly be annotated as Envelope glycoprotein gp160.

It looks like there are 12 of these faulty annotations in that file.

I don't see any examples in NCBI of this issue in submitted genomes. I am new to eukaryotic genome annotation for NCBI, and maybe the correct thing to do is just not used the eggnog annotations? I'll default to that for now.

What command did you issue?
funannotate annotate --eggnog

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions