TY - JOUR
T1 - Multiple polygenic score approach in colorectal cancer risk prediction
AU - Jiang, Shangqing Joyce
AU - Thomas, Minta
AU - Rosenthal, Elisabeth A.
AU - Phipps, Amanda I.
AU - Sakoda, Lori C.
AU - van Duijnhoven, Franzel J.B.
AU - Pellatt, Andrew J.
AU - Avery, Christy L.
AU - Berndt, Sonja I.
AU - Bishop, D. Timothy
AU - Castellví-Bel, Sergi
AU - Chan, Andrew T.
AU - Grant, Robert C.
AU - Gignoux, Chris
AU - Gsur, Andrea
AU - Gunter, Marc J.
AU - Haiman, Christopher A.
AU - Hoffmeister, Michael
AU - Jarvik, Gail P.
AU - Jenkins, Mark A.
AU - Keku, Temitope O.
AU - Küry, Sébastien
AU - Lee, Jeffrey K.
AU - Marchand, Loic Le
AU - Moreno, Victor
AU - Newcomb, Polly A.
AU - Newton, Christina C.
AU - Ogino, Shuji
AU - Palmer, Julie R.
AU - Pearlman, Rachel
AU - Qu, Conghui
AU - Schoen, Robert E.
AU - Um, Caroline Y.
AU - Van Guelpen, Bethany
AU - Visvanathan, Kala
AU - Vymetalkova, Veronika
AU - White, Emily
AU - Woods, Michael O.
AU - Platz, Elizabeth A.
AU - Brenner, Hermann
AU - Corley, Douglas A.
AU - Vogelaar, Iris Landorp
AU - Hsu, Li
AU - Peters, Ulrike
N1 - Publisher Copyright:
© The Author(s) 2025.
PY - 2025/10/30
Y1 - 2025/10/30
N2 - Recent studies have demonstrated that for various diseases, incorporating polygenic risk scores (PRSs) for other traits and diseases into the PRS-based risk prediction model may improve predictive performance – known as Multiple Polygenic Score (MPS) approach. We aimed to examine whether the MPS approach improves colorectal cancer (CRC) risk prediction. We included 2,187 non-CRC PRSs from the polygenic Score (PGS) Catalog and used machine learning (ML) models to select the most predictive non-CRC PRSs, utilizing individual-level data from 31,257 CRC cases and 33,408 controls. An independent dataset from the Genetic Epidemiology Research in Adult Health and Aging (GERA) cohort (4,852 cases and 67,939 controls) was randomly split into subsets for model estimation and validation. The model combined MPS with two existing CRC-PRSs based on known loci and genome-wide genotyping. We then assessed model performance by calculating the area under the receiver operating curve (AUC) in the validation set and performed 1,000 bootstrapped iterations to evaluate AUC improvements. The ML model selected 337 non-CRC PRSs predictive of CRC risk. Adding MPS to the CRC-PRSs significantly improved AUC by 0.017 (95% CI: 0.011–0.022, p < 0.0001) when combined with known-loci CRC-PRS, 0.005 (95% CI: 0.002–0.007, p = 0.0005) with genome-wide CRC-PRS, and 0.004 (95% CI: 0.002–0.006, p = 0.0005) with both the known loci and genome-wide CRC-PRSs. These findings demonstrate MPS’s potential to refine CRC risk prediction models and highlight opportunities for further advancements in risk prediction.
AB - Recent studies have demonstrated that for various diseases, incorporating polygenic risk scores (PRSs) for other traits and diseases into the PRS-based risk prediction model may improve predictive performance – known as Multiple Polygenic Score (MPS) approach. We aimed to examine whether the MPS approach improves colorectal cancer (CRC) risk prediction. We included 2,187 non-CRC PRSs from the polygenic Score (PGS) Catalog and used machine learning (ML) models to select the most predictive non-CRC PRSs, utilizing individual-level data from 31,257 CRC cases and 33,408 controls. An independent dataset from the Genetic Epidemiology Research in Adult Health and Aging (GERA) cohort (4,852 cases and 67,939 controls) was randomly split into subsets for model estimation and validation. The model combined MPS with two existing CRC-PRSs based on known loci and genome-wide genotyping. We then assessed model performance by calculating the area under the receiver operating curve (AUC) in the validation set and performed 1,000 bootstrapped iterations to evaluate AUC improvements. The ML model selected 337 non-CRC PRSs predictive of CRC risk. Adding MPS to the CRC-PRSs significantly improved AUC by 0.017 (95% CI: 0.011–0.022, p < 0.0001) when combined with known-loci CRC-PRS, 0.005 (95% CI: 0.002–0.007, p = 0.0005) with genome-wide CRC-PRS, and 0.004 (95% CI: 0.002–0.006, p = 0.0005) with both the known loci and genome-wide CRC-PRSs. These findings demonstrate MPS’s potential to refine CRC risk prediction models and highlight opportunities for further advancements in risk prediction.
UR - https://www.scopus.com/pages/publications/105020397713
U2 - 10.1038/s41598-025-21956-w
DO - 10.1038/s41598-025-21956-w
M3 - Article
C2 - 41168411
AN - SCOPUS:105020397713
SN - 2045-2322
VL - 15
JO - Scientific Reports
JF - Scientific Reports
IS - 1
M1 - 38006
ER -