What are the -PA suffix after gene names?
This notation represents translations. All translations are named after the gene, with addition of the suffix -Px, 'x' being a letter. Transcripts will have a "-Rx" suffix. This is based on the FlyBase notation.
E.g., given the AGP000123 gene in A. gambiae,
- the 1st transcript is "AGAP000123-RA" and the cognate translation is "AGAP000123-PA",
- the 2nd transcript is "AGAP000123-RB" and the cognate translation is "AGAP000123-PB",
- the 3rd transcript is "AGAP000123-RC" and the cognate translation is "AGAP000123-PC",
- etc.
It makes is easy to identify the gene given its translation identifier as you only need to remove the suffix to get the gene stable identifier.
If a transcript/translation is removed (because it was proved it doesn't exist), its ID will be remove but will NOT be reassigned. Thus the succession of letters might be interrupted.