The hypervariable Tpr multigene family of Theileria parasites, defined by a conserved, membrane-associated, C-terminal domain, includes several copies with defined orthology between species

Citation

Palmateer, N.C., Munro, J.B., Nagaraj, S., Crabtree, J., Pelle, R., Tallon, L., Nene, V., Bishop, R. and Silva, J.C. 2023. The hypervariable Tpr multigene family of Theileria parasites, defined by a conserved, membrane-associated, C-terminal domain, includes several copies with defined orthology between species. Journal of Molecular Evolution 91(6): 897–911.

Abstract/Description

Multigene families often play an important role in host-parasite interactions. One of the largest multigene families in Theileria parva, the causative agent of East Coast fever, is the T. parva repeat (Tpr) gene family. The function of the putative Tpr proteins remains unknown. The initial publication of the T. parva reference genome identified 39 Tpr family open reading frames (ORFs) sharing a conserved C-terminal domain. Twenty-eight of these are clustered in a central region of chromosome 3, termed the “Tpr locus”, while others are dispersed throughout all four nuclear chromosomes. The Tpr locus contains three of the four assembly gaps remaining in the genome, suggesting the presence of additional, as yet uncharacterized, Tpr gene copies. Here, we describe the use of long-read sequencing to attempt to close the gaps in the reference assembly of T. parva (located among multigene families clusters), characterize the full complement of Tpr family ORFs in the T. parva reference genome, and evaluate their evolutionary relationship with Tpr homologs in other Theileria species. We identify three new Tpr family genes in the T. parva reference genome and show that sequence similarity among paralogs in the Tpr locus is significantly higher than between genes outside the Tpr locus. We also identify sequences homologous to the conserved C-terminal domain in five additional Theileria species. Using these sequences, we show that the evolution of this gene family involves conservation of a few orthologs across species, combined with gene gains/losses, and species-specific expansions.

Permanent link to cite or share this item

External link to download this item

Author ORCID identifiers

Contributes to SDGs

SDG 2 - Zero hunger

CGIAR Action Areas

Share

Review Status

Peer Review

Language

en

AGROVOC Keywords

Subjects

Access Rights

Open Access Open Access

Usage Rights

CC-BY-4.0

Attention