TY - JOUR
T1 - Introns and reading frames
T2 - Correlation between splicing sites and their codon positions
AU - Tomita, Masaru
AU - Shimizu, Nobuyoshi
AU - Brutlag, Douglas L.
PY - 1996/11
Y1 - 1996/11
N2 - Computer analyses of the entire GenBank database were conducted to examine correlation between splicing sites and codon positions in reading frames. Intron insertion patterns (i.e., splicing site locations with respect to codon positions) have been analyzed for all of the 64 codons of all the eukaryote taxonomic groups: primates, rodents, mammals, vertebrates, invertebrates, and plants. We found that reading frames are interrupted by an intron at a codon boundary (as opposed to the middle of a codon) significantly more often than expected. This observation is consistent with the exon shuffling hypothesis, because exons that end at codon boundaries can be concatenated without causing a frame shift and thus are evolutionarily advantageous. On the other hand, when introns interrupt at the middles of codons, they exist in between the first and second bases much more frequently than between the second and third bases, despite the fact that boundaries between the first and second bases of codons are generally far more important than those between the second and third bases. The reason for this is not clear and yet to be explained. We also show that the length of an exon is a multiple of 3 more frequently than expected. Furthermore, the total length of two consecutive exons is also more frequently a multiple of 3. All the observations above are consistent with results recently published by Long, Rosenberg, and Gilbert (1995).
AB - Computer analyses of the entire GenBank database were conducted to examine correlation between splicing sites and codon positions in reading frames. Intron insertion patterns (i.e., splicing site locations with respect to codon positions) have been analyzed for all of the 64 codons of all the eukaryote taxonomic groups: primates, rodents, mammals, vertebrates, invertebrates, and plants. We found that reading frames are interrupted by an intron at a codon boundary (as opposed to the middle of a codon) significantly more often than expected. This observation is consistent with the exon shuffling hypothesis, because exons that end at codon boundaries can be concatenated without causing a frame shift and thus are evolutionarily advantageous. On the other hand, when introns interrupt at the middles of codons, they exist in between the first and second bases much more frequently than between the second and third bases, despite the fact that boundaries between the first and second bases of codons are generally far more important than those between the second and third bases. The reason for this is not clear and yet to be explained. We also show that the length of an exon is a multiple of 3 more frequently than expected. Furthermore, the total length of two consecutive exons is also more frequently a multiple of 3. All the observations above are consistent with results recently published by Long, Rosenberg, and Gilbert (1995).
KW - exon
KW - intron
KW - sequence analysis
KW - splicing
UR - http://www.scopus.com/inward/record.url?scp=0029858773&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0029858773&partnerID=8YFLogxK
U2 - 10.1093/oxfordjournals.molbev.a025687
DO - 10.1093/oxfordjournals.molbev.a025687
M3 - Article
C2 - 8896374
AN - SCOPUS:0029858773
SN - 0737-4038
VL - 13
SP - 1219
EP - 1223
JO - Molecular Biology and Evolution
JF - Molecular Biology and Evolution
IS - 9
ER -