journal article Open Access Dec 31, 2021

Statistical models and computational tools for predicting complex traits and diseases

Abstract
Predicting individual traits and diseases from genetic variants is critical to fulfilling the promise of personalized medicine. The genetic variants from genome-wide association studies (GWAS), including variants well below GWAS significance, can be aggregated into highly significant predictions across a wide range of complex traits and diseases. The recent arrival of large-sample public biobanks enables highly accurate polygenic predictions based on genetic variants across the whole genome. Various statistical methodologies and diverse computational tools have been introduced and developed to computed the polygenic risk score (PRS) more accurately. However, many researchers utilize PRS tools without a thorough understanding of the underlying model and how to specify the parameters for the best performance. It is advantageous to study the statistical models implemented in computational tools for PRS estimation and the formulas of parameters to be specified. Here, we review a variety of recent statistical methodologies and computational tools for PRS computation.
Topics

No keywords indexed for this article. Browse by subject →

References
54
[10]
PLINK: A Tool Set for Whole-Genome Association and Population-Based Linkage Analyses

Shaun Purcell, Benjamin Neale, Kathe Todd-Brown et al.

The American Journal of Human Genetics 10.1086/519795
[16]
Maier (2015)
[17]
Improved polygenic prediction by Bayesian multiple regression on summary statistics

Luke R. Lloyd-Jones, Jian Zeng, Julia Sidorenko et al.

Nature Communications 10.1038/s41467-019-12653-0
[18]
Regression Shrinkage and Selection Via the Lasso

Robert Tibshirani

Journal of the Royal Statistical Society Series B:... 10.1111/j.2517-6161.1996.tb02080.x
[20]
Regularization and Variable Selection Via the Elastic Net

Hui Zou, Trevor Hastie

Journal of the Royal Statistical Society Series B:... 10.1111/j.1467-9868.2005.00503.x
[33]
Locke (2015)
[35]
PRSice: Polygenic Risk Score software

Jack Euesden, Cathryn M. Lewis, Paul F. O’Reilly

Bioinformatics 10.1093/bioinformatics/btu848
[38]
(2015)
[39]
(2018)

Showing 50 of 54 references

Metrics
16
Citations
54
References
Details
Published
Dec 31, 2021
Vol/Issue
19(4)
Pages
e36
License
View
Cite This Article
Wonil Chung (2021). Statistical models and computational tools for predicting complex traits and diseases. Genomics & Informatics, 19(4), e36. https://doi.org/10.5808/gi.21053
Related

You May Also Like