TY - JOUR
T1 - Measures of Compositional Strand Bias Related to Replication Machinery and its Applications
AU - Arakawa, Kazuharu
AU - Tomita, Masaru
PY - 2012/3
Y1 - 2012/3
N2 - The compositional asymmetry of complementary bases in nucleotide sequences implies the existence of a mutational or selectional bias in the two strands of the DNA duplex, which is commonly shaped by strand-specific mechanisms in transcription or replication. Such strand bias in genomes, frequently visualized by GC skew graphs, is used for the computational prediction of transcription start sites and replication origins, as well as for comparative evolutionary genomics studies. The use of measures of compositional strand bias in order to quantify the degree of strand asymmetry is crucial, as it is the basis for determining the applicability of compositional analysis and comparing the strength of the mutational bias in different biological machineries in various species. Here, we review the measures of strand bias that have been proposed to date, including the ΔGC skew, the B 1 index, the predictability score of linear discriminant analysis for gene orientation, the signal-to-noise ratio of the oligonucleotide bias, and the GC skew index. These measures have been predominantly designed for and applied to the analysis of replication-related mutational processes in prokaryotes, but we also give research examples in eukaryotes.
AB - The compositional asymmetry of complementary bases in nucleotide sequences implies the existence of a mutational or selectional bias in the two strands of the DNA duplex, which is commonly shaped by strand-specific mechanisms in transcription or replication. Such strand bias in genomes, frequently visualized by GC skew graphs, is used for the computational prediction of transcription start sites and replication origins, as well as for comparative evolutionary genomics studies. The use of measures of compositional strand bias in order to quantify the degree of strand asymmetry is crucial, as it is the basis for determining the applicability of compositional analysis and comparing the strength of the mutational bias in different biological machineries in various species. Here, we review the measures of strand bias that have been proposed to date, including the ΔGC skew, the B 1 index, the predictability score of linear discriminant analysis for gene orientation, the signal-to-noise ratio of the oligonucleotide bias, and the GC skew index. These measures have been predominantly designed for and applied to the analysis of replication-related mutational processes in prokaryotes, but we also give research examples in eukaryotes.
KW - Bacterial replication
KW - GC skew
KW - Nucleotide composition bias
KW - Replication-related mutations
UR - http://www.scopus.com/inward/record.url?scp=84856413643&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84856413643&partnerID=8YFLogxK
U2 - 10.2174/138920212799034749
DO - 10.2174/138920212799034749
M3 - Review article
C2 - 22942671
AN - SCOPUS:84856413643
SN - 1389-2029
VL - 13
SP - 4
EP - 15
JO - Current Genomics
JF - Current Genomics
IS - 1
ER -