TY - JOUR
T1 - Transcription-associated mutagenesis increases protein sequence diversity more effectively than does random mutagenesis in Escherichia coli
AU - Kim, Hyunchul
AU - Lee, Baek Seok
AU - Tomita, Masaru
AU - Kanai, Akio
PY - 2010
Y1 - 2010
N2 - Background: During transcription, the nontranscribed DNA strand becomes single-stranded DNA (ssDNA), which can form secondary structures. Unpaired bases in the ssDNA are less protected from mutagens and hence experience more mutations than do paired bases. These mutations are called transcription-associated mutations. Transcription-associated mutagenesis is increased under stress and depends on the DNA sequence. Therefore, selection might significantly influence protein-coding sequences in terms of the transcription-associated mutability per transcription event under stress to improve the survival of Escherichia coli. Methodology/Principal Findings: The mutability index (MI) was developed by Wright et al. to estimate the relative transcription-associated mutability of bases per transcription event. Using the most stable fold of each ssDNA that have an average length n, MI was defined as (the number of folds in which the base is unpaired)/n ×(highest -ΔG of all n folds in which the base is unpaired), where ΔG is the free energy. The MI values show a significant correlation with mutation data under stress but not with spontaneous mutations in E. coli. Protein sequence diversity is preferred under stress but not under favorable conditions. Therefore, we evaluated the selection pressure on MI in terms of the protein sequence diversity for all the protein-coding sequences in E. coli. The distributions of the MI values were lower at bases that could be substituted with each of the other three bases without affecting the amino acid sequence than at bases that could not be so substituted. Start codons had lower distributions of MI values than did nonstart codons. Conclusions/Significance: Our results suggest that the majority of protein-coding sequences have evolved to promote protein sequence diversity and to reduce gene knockout under stress. Consequently, transcription-associated mutagenesis increases protein sequence diversity more effectively than does random mutagenesis under stress. Nonrandom transcription-associated mutagenesis under stress should improve the survival of E. coli.
AB - Background: During transcription, the nontranscribed DNA strand becomes single-stranded DNA (ssDNA), which can form secondary structures. Unpaired bases in the ssDNA are less protected from mutagens and hence experience more mutations than do paired bases. These mutations are called transcription-associated mutations. Transcription-associated mutagenesis is increased under stress and depends on the DNA sequence. Therefore, selection might significantly influence protein-coding sequences in terms of the transcription-associated mutability per transcription event under stress to improve the survival of Escherichia coli. Methodology/Principal Findings: The mutability index (MI) was developed by Wright et al. to estimate the relative transcription-associated mutability of bases per transcription event. Using the most stable fold of each ssDNA that have an average length n, MI was defined as (the number of folds in which the base is unpaired)/n ×(highest -ΔG of all n folds in which the base is unpaired), where ΔG is the free energy. The MI values show a significant correlation with mutation data under stress but not with spontaneous mutations in E. coli. Protein sequence diversity is preferred under stress but not under favorable conditions. Therefore, we evaluated the selection pressure on MI in terms of the protein sequence diversity for all the protein-coding sequences in E. coli. The distributions of the MI values were lower at bases that could be substituted with each of the other three bases without affecting the amino acid sequence than at bases that could not be so substituted. Start codons had lower distributions of MI values than did nonstart codons. Conclusions/Significance: Our results suggest that the majority of protein-coding sequences have evolved to promote protein sequence diversity and to reduce gene knockout under stress. Consequently, transcription-associated mutagenesis increases protein sequence diversity more effectively than does random mutagenesis under stress. Nonrandom transcription-associated mutagenesis under stress should improve the survival of E. coli.
UR - http://www.scopus.com/inward/record.url?scp=77956283292&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=77956283292&partnerID=8YFLogxK
U2 - 10.1371/journal.pone.0010567
DO - 10.1371/journal.pone.0010567
M3 - Article
C2 - 20479947
AN - SCOPUS:77956283292
SN - 1932-6203
VL - 5
JO - PloS one
JF - PloS one
IS - 5
M1 - e10567
ER -