TY - JOUR
T1 - Large-Scale Molecular Evolutionary Analysis Uncovers a Variety of Polynucleotide Kinase Clp1 Family Proteins in the Three Domains of Life
AU - Saito, Motofumi
AU - Sato, Asako
AU - Nagata, Shohei
AU - Tamaki, Satoshi
AU - Tomita, Masaru
AU - Suzuki, Haruo
AU - Kanai, Akio
N1 - Funding Information:
The authors thank Ms. Megumi Tsurumaki for her technical support during the study and all the members of the RNA Group at the Institute for Advanced Biosciences of Keio University, Japan, and for their insightful discussions. This work was supported in part by a JSPS KAKENHI Grant-in-Aid for Scientific Research (C) (grant number 17K07517), a JSPS KAKENHI Grant-in-Aid for Scientific Research on Innovative Areas “Hadean Bioscience” (grant number 26106003) (to A.K.), and research funds from the Yamagata Prefectural Government and Tsuruoka City, Japan. The funding bodies played no role in the study design, the data collection or analysis, the decision to publish, or the preparation of the manuscript.
Publisher Copyright:
© 2019 The Author(s) 2019. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
PY - 2019/9/25
Y1 - 2019/9/25
N2 - Clp1, a polyribonucleotide 5′-hydroxyl kinase in eukaryotes, is involved in pretRNA splicing and mRNA 3′-end formation. Enzymes similar in amino acid sequence to Clp1, Nol9, and Grc3, are present in some eukaryotes and are involved in prerRNA processing. However, our knowledge of how these Clp1 family proteins evolved and diversified is limited. We conducted a large-scale molecular evolutionary analysis of the Clp1 family proteins in all living organisms for which protein sequences are available in public databases. The phylogenetic distribution and frequencies of the Clp1 family proteins were investigated in complete genomes of Bacteria, Archaea and Eukarya. In total, 3,557 Clp1 family proteins were detected in the three domains of life, Bacteria, Archaea, and Eukarya. Many were from Archaea and Eukarya, but a few were found in restricted, phylogenetically diverse bacterial species. The domain structures of the Clp1 family proteins also differed among the three domains of life. Although the proteins were, on average, 555 amino acids long (range, 196-2,728), 122 large proteins with >1,000 amino acids were detected in eukaryotes. These novel proteins contain the conserved Clp1 polynucleotide kinase domain and various other functional domains. Of these proteins, >80% were from Fungi or Protostomia. The polyribonucleotide kinase activity of Thermus scotoductus Clp1 (Ts-Clp1) was characterized experimentally. Ts-Clp1 preferentially phosphorylates single-stranded RNA oligonucleotides (Km value for ATP, 2.5 μM), or single-stranded DNA at higher enzyme concentrations. We propose a comprehensive assessment of the diversification of the Clp1 family proteins and the molecular evolution of their functional domains.
AB - Clp1, a polyribonucleotide 5′-hydroxyl kinase in eukaryotes, is involved in pretRNA splicing and mRNA 3′-end formation. Enzymes similar in amino acid sequence to Clp1, Nol9, and Grc3, are present in some eukaryotes and are involved in prerRNA processing. However, our knowledge of how these Clp1 family proteins evolved and diversified is limited. We conducted a large-scale molecular evolutionary analysis of the Clp1 family proteins in all living organisms for which protein sequences are available in public databases. The phylogenetic distribution and frequencies of the Clp1 family proteins were investigated in complete genomes of Bacteria, Archaea and Eukarya. In total, 3,557 Clp1 family proteins were detected in the three domains of life, Bacteria, Archaea, and Eukarya. Many were from Archaea and Eukarya, but a few were found in restricted, phylogenetically diverse bacterial species. The domain structures of the Clp1 family proteins also differed among the three domains of life. Although the proteins were, on average, 555 amino acids long (range, 196-2,728), 122 large proteins with >1,000 amino acids were detected in eukaryotes. These novel proteins contain the conserved Clp1 polynucleotide kinase domain and various other functional domains. Of these proteins, >80% were from Fungi or Protostomia. The polyribonucleotide kinase activity of Thermus scotoductus Clp1 (Ts-Clp1) was characterized experimentally. Ts-Clp1 preferentially phosphorylates single-stranded RNA oligonucleotides (Km value for ATP, 2.5 μM), or single-stranded DNA at higher enzyme concentrations. We propose a comprehensive assessment of the diversification of the Clp1 family proteins and the molecular evolution of their functional domains.
KW - comprehensive identification
KW - experimental verification
KW - large protein
KW - molecular evolution
KW - multidomain protein
KW - protein family
UR - http://www.scopus.com/inward/record.url?scp=85072942663&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85072942663&partnerID=8YFLogxK
U2 - 10.1093/gbe/evz195
DO - 10.1093/gbe/evz195
M3 - Article
C2 - 31513263
AN - SCOPUS:85072942663
SN - 1759-6653
VL - 11
SP - 2713
EP - 2726
JO - Genome biology and evolution
JF - Genome biology and evolution
IS - 10
ER -