GestaltMatcher Database - A global reference for facial phenotypic variability in rare human diseases

Tzung-Chien Hsieh, Hellen Lesmann, Alexander Hustinx, Shahida Moosa, Elaine Marchi, Maria Del Pilar Caro Martin, Ibrahim Abdelrazek, Jean Tori Pantel, Hannah Klinkhammer, Merle Ten Hagen, Meow-Keong Thong, Rifhan Azwani Binti Mazlan, Sok Kun Tae, Tom Kamphans, Wolfgang Meiswinkel, Behnam Javanmardi, Alexej Knaus, Annette Uwineza, Cordula Knopp, Tinatin TkemaladzeMiriam Elbracht, Larissa Mattern, Rami Abou Jamra, Clara Velmans, Vincent Strehlow, Maureen Jacob, Angela Peron, Cristina Dias, Beatriz Nunes, Thainá Vilella, Isabel Pinheiro, Chong Kim, Maria Melaragno, Hannah Weiland, Sophia Kaptain, Karolina Chwiałkowska, Miroslaw Kwasniewski, Ramy Saad, Sarah Wiethoff, Himanshu Goel, Clara Tang, Anna Hau, Tahsin Stefan Barakat, Przemysław Panek, Amira Nabil, Julia Suh, Frederik Braun, Israel Gomy, Luisa Averdunk, Ekanem Ekure, Gaber Bergant, Borut Peterlin, Claudio Graziano, Nagwa Gaboon, Moisés Fiesco-Roa, Alessandro Spinelli, Nina-Maria Wilpert, Prasit Phowthongkum, Nergis Güzel, Tobias Haack, Rana Bitar, Andreas Tzschach, Agusti Rodriguez-Palmero, Theresa Brunet, Sabine Rudnik-Schöneborn, Silvina Contreras-Capetillo, Ava Oberlack, Carole Samango-Sprouse, Teresa Sadeghin, Margaret Olaya, Konrad Platzer, Artem Borovikov, Franziska Schnabel, Lara Heuft, Vera Herrmann, Nour Elkhateeb, Sheetal Kumar, Katalin Komlosi, Khoushoua Mohamed, Silvia Kalantari, Fabio Sirchia, Antonio Martinez-Monseny, Matthias Höller, Amal Mohamed, Amaia Lasa-Aranzasti, John Sayer, Nadja Ehmke, Magdalena Danyel, Henrike Sczakiel, Sarina Schwartzmann, Felix Boschann, Max Zhao, Ronja Adams, Lara Einicke, Kee Seang Chew, Choy Chen Kam, Miray Karakoyun, Ben Pode-Shakked, Aviva Eliyahu, Rachel Rock, Teresa Carrion, Odelia Chorin, Yuri Zarate, Marcello Martinez, Mert Karakaya, Moon Ley Tung, Bharatendu Chandra, Aimé Lumaka, Marwan Shinawi, Patrick Blackburn, Tianyun Wang, Tim Niehues, Ping Hu, Rebekah Waikel, Suzanna Ledgister Hanchard, Gehad Elmakkawy, Sylvia Safwat, Frédéric Ebstein, Elke Krüger, Sébastien Küry, Stephane Bezieau, Annabelle Arlt, Felix Marbach, Dong Li, Lucie Dupuis, Roberto Mendoza-Londono, Sofia Douzgou Houge, Denisa Weis, Christopher Mak, Nursel Elcioglu, Ayca Aykut, Peli Şimşek-Kiper, Nina Bögershausen, Bernd Wollnik, Heidi Beate Bentzen, Ingo Kurth, Christian Netzer, Aleksandra Jezela-Stanek, Koen Devriendt, Karen Gripp, Martin Mücke, Alain Verloes, Christian Schaaf, Christoffer Nellåker, Benjamin Solomon, Markus Nöthen, Ebtesam Abdalla, Gholson Lyon*, Peter Krawitz, Hulya Kayserili, Louiza Toutouna, Axel Schmidt, Regina Roth, Dagmar Wieczorek, Eric Olinger

*Corresponding author for this work

Research output: Contribution to journalArticleAcademic

22 Downloads (Pure)

Abstract

The most important factor that complicates the work of dysmorphologists is the significant phenotypic variability of the human face. Next-Generation Phenotyping (NGP) tools that assist clinicians with recognizing characteristic syndromic patterns are particularly challenged when confronted with patients from populations different from their training data. To that end, we systematically analyzed the impact of genetic ancestry on facial dysmorphism. For that purpose, we established the GestaltMatcher Database (GMDB) as a reference dataset for medical images of patients with rare genetic disorders from around the world. We collected 10,980 frontal facial images - more than a quarter previously unpublished - from 8,346 patients, representing 581 rare disorders. Although the predominant ancestry is still European (67%), data from underrepresented populations have been increased considerably via global collaborations (19% Asian and 7% African). This includes previously unpublished reports for more than 40% of the African patients. The NGP analysis on this diverse dataset revealed characteristic performance differences depending on the composition of training and test sets corresponding to genetic relatedness. For clinical use of NGP, incorporating non-European patients resulted in a profound enhancement of GestaltMatcher performance. The top-5 accuracy rate increased by +11.29%. Importantly, this improvement in delineating the correct disorder from a facial portrait was achieved without decreasing the performance on European patients. By design, GMDB complies with the FAIR principles by rendering the curated medical data findable, accessible, interoperable, and reusable. This means GMDB can also serve as data for training and benchmarking. In summary, our study on facial dysmorphism on a global sample revealed a considerable cross ancestral phenotypic variability confounding NGP that should be counteracted by international efforts for increasing data diversity. GMDB will serve as a vital reference database for clinicians and a transparent training set for advancing NGP technology.

Original languageEnglish
JournalResearch Square
DOIs
Publication statusPublished - 10 Jun 2024

Fingerprint

Dive into the research topics of 'GestaltMatcher Database - A global reference for facial phenotypic variability in rare human diseases'. Together they form a unique fingerprint.

Cite this