TY - JOUR
T1 - Functional data analysis of the dynamics of gene regulatory networks
AU - Ando, Tomohiro
AU - Imoto, Seiya
AU - Miyano, Satoru
PY - 2004/1/1
Y1 - 2004/1/1
N2 - A new method for constructing gene networks from microarray time-series gene expression data is proposed in the context of Bayesian network approach. An essential point of Bayesian network modeling is the construction of the conditional distribution of each random variable. When estimating the conditional distributions from gene expression data, a common problem is that gene expression data contain multiple missing values. Unfortunately, many methods for constructing conditional distributions require a complete gene expression value and may lose effectiveness even with a few missing value. Additionally, they treat microarray time-series gene expression data as static data, although time can be an important factor that affects the gene expression levels. We overcome these difficulties by using the method of functional data analysis. The proposed network construction method consists of two stages. Firstly, discrete microarray time-series gene expression values are expressed as a continuous curve of time. To account for the time dependency of gene expression measurements and the noisy nature of the microarray data, P-spline nonlinear regression models are utilized. After this preprocessing step, the conditional distribution of each random variable is constructed based on functional linear regression models. The effectiveness of the proposed method is investigated through Monte Carlo simulations and the analysis of Saccharomyces cerevisiae gene expression data.
AB - A new method for constructing gene networks from microarray time-series gene expression data is proposed in the context of Bayesian network approach. An essential point of Bayesian network modeling is the construction of the conditional distribution of each random variable. When estimating the conditional distributions from gene expression data, a common problem is that gene expression data contain multiple missing values. Unfortunately, many methods for constructing conditional distributions require a complete gene expression value and may lose effectiveness even with a few missing value. Additionally, they treat microarray time-series gene expression data as static data, although time can be an important factor that affects the gene expression levels. We overcome these difficulties by using the method of functional data analysis. The proposed network construction method consists of two stages. Firstly, discrete microarray time-series gene expression values are expressed as a continuous curve of time. To account for the time dependency of gene expression measurements and the noisy nature of the microarray data, P-spline nonlinear regression models are utilized. After this preprocessing step, the conditional distribution of each random variable is constructed based on functional linear regression models. The effectiveness of the proposed method is investigated through Monte Carlo simulations and the analysis of Saccharomyces cerevisiae gene expression data.
KW - Bayesian networks
KW - Functional data analysis
KW - P-spline
KW - Smoothing
KW - Time-series gene expression data
UR - http://www.scopus.com/inward/record.url?scp=22944434182&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=22944434182&partnerID=8YFLogxK
U2 - 10.1007/978-3-540-30478-4_7
DO - 10.1007/978-3-540-30478-4_7
M3 - Conference article
AN - SCOPUS:22944434182
SN - 0302-9743
VL - 3303
SP - 69
EP - 83
JO - Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science)
JF - Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science)
T2 - International Symposium KELSI 2004: Knowledge Exploration in Life Science Informatics
Y2 - 25 November 2004 through 26 November 2004
ER -