The phylogenetic diversity of eukaryotic transcription

Author: Coulson Richard M. R.   Ouzounis Christos A.  

Publisher: Oxford University Press

ISSN: 1362-4962

Source: Nucleic Acids Research, Vol.31, Iss.2, 2003-01, pp. : 653-660

Disclaimer: Any content in publications that violate the sovereignty, the constitution or regulations of the PRC is not accepted or approved by CNPIEC.

Previous Menu Next

Abstract

Eukaryotic transcription is a highly regulated process involving interactions between large numbers of proteins. To analyse the phylogenetic distribution of the components of this process, six crown eukaryote group genomes were queried with a reference set of transcription‐associated (TA) pro teins. On average, one in 10 proteins encoded by these genomes were found to be homologous to sequences in the reference set. Analysis of families identified using an accurate sequence clustering algorithm and containing both TA proteins and eukaryotic sequences showed that in two‐thirds of the families the homologues originate from a single kingdom. Furthermore, in only 15% of the fungal‐specific clusters are the homologues present in both budding and fission yeast, as compared with the metazoan‐specific clusters where 53% of the homologues originate from two or more species. Families whose members comprise general transcription factor or RNA polymerase subunits exhibit a low degree of taxon specificity, suggesting that the transcription initiation complex is highly conserved. This contrasts with transcriptional regulator families, that are primarily taxon‐specific, indicating proteins controlling gene activation exhibit considerable sequence diversity across the eukaryotic domain.