TY - JOUR
T1 - Tutorial
T2 - a statistical genetics guide to identifying HLA alleles driving complex disease
AU - Sakaue, Saori
AU - Gurajala, Saisriram
AU - Curtis, Michelle
AU - Luo, Yang
AU - Choi, Wanson
AU - Ishigaki, Kazuyoshi
AU - Kang, Joyce B.
AU - Rumker, Laurie
AU - Deutsch, Aaron J.
AU - Schönherr, Sebastian
AU - Forer, Lukas
AU - LeFaive, Jonathon
AU - Fuchsberger, Christian
AU - Han, Buhm
AU - Lenz, Tobias L.
AU - de Bakker, Paul I.W.
AU - Okada, Yukinori
AU - Smith, Albert V.
AU - Raychaudhuri, Soumya
N1 - Publisher Copyright:
© 2023, Springer Nature Limited.
PY - 2023/9
Y1 - 2023/9
N2 - The human leukocyte antigen (HLA) locus is associated with more complex diseases than any other locus in the human genome. In many diseases, HLA explains more heritability than all other known loci combined. In silico HLA imputation methods enable rapid and accurate estimation of HLA alleles in the millions of individuals that are already genotyped on microarrays. HLA imputation has been used to define causal variation in autoimmune diseases, such as type I diabetes, and in human immunodeficiency virus infection control. However, there are few guidelines on performing HLA imputation, association testing, and fine mapping. Here, we present a comprehensive tutorial to impute HLA alleles from genotype data. We provide detailed guidance on performing standard quality control measures for input genotyping data and describe options to impute HLA alleles and amino acids either locally or using the web-based Michigan Imputation Server, which hosts a multi-ancestry HLA imputation reference panel. We also offer best practice recommendations to conduct association tests to define the alleles, amino acids, and haplotypes that affect human traits. Along with the pipeline, we provide a step-by-step online guide with scripts and available software (https://github.com/immunogenomics/HLA_analyses_tutorial). This tutorial will be broadly applicable to large-scale genotyping data and will contribute to defining the role of HLA in human diseases across global populations.
AB - The human leukocyte antigen (HLA) locus is associated with more complex diseases than any other locus in the human genome. In many diseases, HLA explains more heritability than all other known loci combined. In silico HLA imputation methods enable rapid and accurate estimation of HLA alleles in the millions of individuals that are already genotyped on microarrays. HLA imputation has been used to define causal variation in autoimmune diseases, such as type I diabetes, and in human immunodeficiency virus infection control. However, there are few guidelines on performing HLA imputation, association testing, and fine mapping. Here, we present a comprehensive tutorial to impute HLA alleles from genotype data. We provide detailed guidance on performing standard quality control measures for input genotyping data and describe options to impute HLA alleles and amino acids either locally or using the web-based Michigan Imputation Server, which hosts a multi-ancestry HLA imputation reference panel. We also offer best practice recommendations to conduct association tests to define the alleles, amino acids, and haplotypes that affect human traits. Along with the pipeline, we provide a step-by-step online guide with scripts and available software (https://github.com/immunogenomics/HLA_analyses_tutorial). This tutorial will be broadly applicable to large-scale genotyping data and will contribute to defining the role of HLA in human diseases across global populations.
UR - http://www.scopus.com/inward/record.url?scp=85165888860&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85165888860&partnerID=8YFLogxK
U2 - 10.1038/s41596-023-00853-4
DO - 10.1038/s41596-023-00853-4
M3 - Review article
C2 - 37495751
AN - SCOPUS:85165888860
SN - 1754-2189
VL - 18
SP - 2625
EP - 2641
JO - Nature Protocols
JF - Nature Protocols
IS - 9
ER -