TY - JOUR
T1 - Gene-based Hardy–Weinberg equilibrium test using genotype count data
T2 - application to six types of cancers
AU - Nishino, Jo
AU - Miya, Fuyuki
AU - Kato, Mamoru
N1 - Publisher Copyright:
© The Author(s) 2025.
PY - 2025/12
Y1 - 2025/12
N2 - Background: An alternative approach to investigate associations between genetic variants and disease is to examine deviations from the Hardy–Weinberg equilibrium (HWE) in genotype frequencies within a case population, instead of case–control association analysis. The HWE analysis requires disease cases and demonstrates a notable ability in mapping recessive variants. Allelic heterogeneity is a common phenomenon in diseases. While gene-based case–control association analysis successfully incorporates this heterogeneity, there are no such approaches for HWE analysis. Therefore, we proposed a gene-based HWE test (gene-HWT) by aggregating single-nucleotide polymorphism (SNP)-level HWE test statistics in a gene to address allelic heterogeneity. Results: This method used only genotype count data and publicly available linkage disequilibrium information and has a very low computational cost. Extensive simulations demonstrated that gene-HWT effectively controls the type I error at a low significance level and outperforms SNP-level HWE test in power when there are multiple causal variants within a gene. Using gene-HWT, we analyzed genotype count data from a genome-wide association study of six cancer types in Japanese individuals and suggest DGKE and ANO3 as potential germline factors in colorectal cancer. Furthermore, FSTL4 was suggested through a combined analysis across the six cancer types, with particularly notable associations observed in colorectal and prostate cancers. Conclusions: These findings indicate the potential of gene-HWT to elucidate the genetic basis of complex diseases, including cancer.
AB - Background: An alternative approach to investigate associations between genetic variants and disease is to examine deviations from the Hardy–Weinberg equilibrium (HWE) in genotype frequencies within a case population, instead of case–control association analysis. The HWE analysis requires disease cases and demonstrates a notable ability in mapping recessive variants. Allelic heterogeneity is a common phenomenon in diseases. While gene-based case–control association analysis successfully incorporates this heterogeneity, there are no such approaches for HWE analysis. Therefore, we proposed a gene-based HWE test (gene-HWT) by aggregating single-nucleotide polymorphism (SNP)-level HWE test statistics in a gene to address allelic heterogeneity. Results: This method used only genotype count data and publicly available linkage disequilibrium information and has a very low computational cost. Extensive simulations demonstrated that gene-HWT effectively controls the type I error at a low significance level and outperforms SNP-level HWE test in power when there are multiple causal variants within a gene. Using gene-HWT, we analyzed genotype count data from a genome-wide association study of six cancer types in Japanese individuals and suggest DGKE and ANO3 as potential germline factors in colorectal cancer. Furthermore, FSTL4 was suggested through a combined analysis across the six cancer types, with particularly notable associations observed in colorectal and prostate cancers. Conclusions: These findings indicate the potential of gene-HWT to elucidate the genetic basis of complex diseases, including cancer.
KW - Allelic heterogeneity
KW - Cancer-related genes
KW - Gene-based analysis
KW - Genome-wide association study
KW - Hardy–Weinberg equilibrium test
KW - Recessive variants
UR - http://www.scopus.com/inward/record.url?scp=85218199130&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85218199130&partnerID=8YFLogxK
U2 - 10.1186/s12864-025-11321-6
DO - 10.1186/s12864-025-11321-6
M3 - Article
C2 - 39930364
AN - SCOPUS:85218199130
SN - 1471-2164
VL - 26
JO - BMC Genomics
JF - BMC Genomics
IS - 1
M1 - 124
ER -