A deep-learning workflow to predict upper tract urothelial carcinoma protein-based subtypes from H&E slides supporting the prioritization of patients for molecular testing

Miriam Angeloni, Thomas van Doeveren, Sebastian Lindner, Patrick Volland, Jorina Schmelmer, Sebastian Foersch, Christian Matek, Robert Stoehr, Carol I. Geppert, Hendrik Heers, Sven Wach, Helge Taubert, Danijel Sikic, Bernd Wullich, Geert J.L.H. van Leenders, Vasily Zaburdaev, Markus Eckstein, Arndt Hartmann, Joost L. Boormans, Fulvia Ferrazzi*Veronika Bahlinger*

*Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

16 Downloads (Pure)

Abstract

Upper tract urothelial carcinoma (UTUC) is a rare and aggressive, yet understudied, urothelial carcinoma (UC). The more frequent UC of the bladder comprises several molecular subtypes, associated with different targeted therapies and overlapping with protein-based subtypes. However, if and how these findings extend to UTUC remains unclear. Artificial intelligence-based approaches could help elucidate UTUC's biology and extend access to targeted treatments to a wider patient audience. Here, UTUC protein-based subtypes were identified, and a deep-learning (DL) workflow was developed to predict them directly from routine histopathological H&E slides. Protein-based subtypes in a retrospective cohort of 163 invasive tumors were assigned by hierarchical clustering of the immunohistochemical expression of three luminal (FOXA1, GATA3, and CK20) and three basal (CD44, CK5, and CK14) markers. Cluster analysis identified distinctive luminal (N = 80) and basal (N = 42) subtypes. The luminal subtype mostly included pushing, papillary tumors, whereas the basal subtype diffusely infiltrating, non-papillary tumors. DL model building relied on a transfer-learning approach by fine-tuning a pre-trained ResNet50. Classification performance was measured via three-fold repeated cross-validation. A mean area under the receiver operating characteristic curve of 0.83 (95% CI: 0.67–0.99), 0.8 (95% CI: 0.62–0.99), and 0.81 (95% CI: 0.65–0.96) was reached in the three repetitions. High-confidence DL-based predicted subtypes showed significant associations (p < 0.001) with morphological features, i.e. tumor type, histological subtypes, and infiltration type. Furthermore, a significant association was found with programmed cell death ligand 1 (PD-L1) combined positive score (p < 0.001) and FGFR3 mutational status (p = 0.002), with high-confidence basal predictions containing a higher proportion of PD-L1 positive samples and high-confidence luminal predictions a higher proportion of FGFR3-mutated samples. Testing of the DL model on an independent cohort highlighted the importance to accommodate histological subtypes. Taken together, our DL workflow can predict protein-based UTUC subtypes, associated with the presence of targetable alterations, directly from H&E slides.

Original languageEnglish
Article numbere12369
JournalJournal of Pathology: Clinical Research
Volume10
Issue number2
DOIs
Publication statusPublished - Mar 2024

Bibliographical note

Publisher Copyright:
© 2024 The Authors. The Journal of Pathology: Clinical Research published by The Pathological Society of Great Britain and Ireland and John Wiley & Sons Ltd.

Fingerprint

Dive into the research topics of 'A deep-learning workflow to predict upper tract urothelial carcinoma protein-based subtypes from H&E slides supporting the prioritization of patients for molecular testing'. Together they form a unique fingerprint.

Cite this