TY - JOUR
T1 - A Bayesian data combination approach for repeated durations under unobserved missing indicators
T2 - Application to interpurchase-timing in marketing
AU - Igari, Ryosuke
AU - Hoshino, Takahiro
N1 - Funding Information:
We would like to appreciate the referees for very constructive and insightful comments that substantially improved our paper. We also thank that Intage Inc. provided us the dataset we used for the empirical analysis. This work was supported by JPSP KAKENHI ( 15J03947 , 26285151 and 17K18568 ) and Yoshida Hideo Memorial Foundation provided for the authors.
Publisher Copyright:
© 2018 Elsevier B.V.
PY - 2018/10
Y1 - 2018/10
N2 - Intermittent missingness in repeated duration analysis is common in applied studies, but has not been rigorously considered in statistics. Under intermittent missingness, whether any missing events exist between two observed events is unknown. In other words, the missing indicators are never observed. Thus, if there exist any missing events between two observed events, researchers observe only the cumulative duration of the two or more events. A quasi-Bayes estimation method that utilizes population-level information is used to appropriately estimate the parameters under unobserved intermittent missingness. The proposed model consists of the following: (1) a latent variable model, (2) a latent missing indicator model separating the true and composite durations, (3) mixtures of duration models, and (4) moment restriction from population-level information to deal with nonignorable intermittent missingness. A new estimation procedure is used to simultaneously combine likelihood and the objective function of GMM with the latent variables; this is called Bayesian data combination. The model is applied to the interpurchase duration in database marketing using the purchase history data of Japan; these data capture the purchase incidences and stores.
AB - Intermittent missingness in repeated duration analysis is common in applied studies, but has not been rigorously considered in statistics. Under intermittent missingness, whether any missing events exist between two observed events is unknown. In other words, the missing indicators are never observed. Thus, if there exist any missing events between two observed events, researchers observe only the cumulative duration of the two or more events. A quasi-Bayes estimation method that utilizes population-level information is used to appropriately estimate the parameters under unobserved intermittent missingness. The proposed model consists of the following: (1) a latent variable model, (2) a latent missing indicator model separating the true and composite durations, (3) mixtures of duration models, and (4) moment restriction from population-level information to deal with nonignorable intermittent missingness. A new estimation procedure is used to simultaneously combine likelihood and the objective function of GMM with the latent variables; this is called Bayesian data combination. The model is applied to the interpurchase duration in database marketing using the purchase history data of Japan; these data capture the purchase incidences and stores.
KW - Dirichlet process mixture model
KW - Intermittent missingness
KW - Latent variable modeling
KW - Population-level information
KW - Quasi-Bayes
UR - http://www.scopus.com/inward/record.url?scp=85047632446&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85047632446&partnerID=8YFLogxK
U2 - 10.1016/j.csda.2018.04.001
DO - 10.1016/j.csda.2018.04.001
M3 - Article
AN - SCOPUS:85047632446
SN - 0167-9473
VL - 126
SP - 150
EP - 166
JO - Computational Statistics and Data Analysis
JF - Computational Statistics and Data Analysis
ER -