TY - JOUR
T1 - Prediction of gene structures from RNA-seq data using dual decomposition
AU - Inatsuki, Tatsumu
AU - Sato, Kengo
AU - Sakakibara, Yasubumi
N1 - Funding Information:
This work was supported in part by a Grant-in-Aid for Scientific Research (A) (KAKENHI) (No.23241066) from the Japan Society for the Promotion of Science (JSPS) to Y.S. and K.S., a Grant-in-Aid for Scientific Research on Innovative Areas (KAKENHI) (No.221S0002) from the Ministry of Education, Culture, Sports, Science and Technology to Y.S., and a Grant-in-Aid for Scientific Research (C) (KAKENHI) (No.25330348) from the JSPS to K.S.
Publisher Copyright:
© 2016 Information Processing Society of Japan.
PY - 2016/3
Y1 - 2016/3
N2 - Numerous computational algorithms for predicting protein-coding genes from genomic sequences have been developed, and hidden Markov models (HMMs) have frequently been used to model gene structures. For eukaryotes, more complex gene structures such as introns make gene prediction much harder due to isoforms of transcripts by alternative splicing machinery. We develop a novel gene prediction method for eukaryote genomes that extends the traditional HMM-based gene prediction model by incorporating comprehensive evidence of transcripts by using RNA sequencing (RNA-seq) technology. We formulate gene prediction as an integer programming problem, and solve it by the dual decomposition technique. To confirm the utility of the proposed algorithm, computational experiments on benchmark datasets were conducted. The results show that our algorithm efficiently and effectively employs RNA-seq data in gene structure prediction.
AB - Numerous computational algorithms for predicting protein-coding genes from genomic sequences have been developed, and hidden Markov models (HMMs) have frequently been used to model gene structures. For eukaryotes, more complex gene structures such as introns make gene prediction much harder due to isoforms of transcripts by alternative splicing machinery. We develop a novel gene prediction method for eukaryote genomes that extends the traditional HMM-based gene prediction model by incorporating comprehensive evidence of transcripts by using RNA sequencing (RNA-seq) technology. We formulate gene prediction as an integer programming problem, and solve it by the dual decomposition technique. To confirm the utility of the proposed algorithm, computational experiments on benchmark datasets were conducted. The results show that our algorithm efficiently and effectively employs RNA-seq data in gene structure prediction.
KW - Dual decomposition
KW - Gene structure prediction
KW - Hidden markov models
KW - Lagrangian relaxation
KW - RNA-seq
UR - http://www.scopus.com/inward/record.url?scp=84975090081&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84975090081&partnerID=8YFLogxK
U2 - 10.2197/ipsjtbio.9.1
DO - 10.2197/ipsjtbio.9.1
M3 - Article
AN - SCOPUS:84975090081
SN - 1882-6679
VL - 9
SP - 1
EP - 6
JO - IPSJ Transactions on Bioinformatics
JF - IPSJ Transactions on Bioinformatics
ER -