Preprint
GestaltMatcher Database - A global reference for facial phenotypic variability in rare human diseases
Research square
06/10/2024
DOI: 10.21203/rs.3.rs-4438861/v1
PMCID: PMC11188141
PMID: 38903062
Abstract
The most important factor that complicates the work of dysmorphologists is the significant phenotypic variability of the human face. Next-Generation Phenotyping (NGP) tools that assist clinicians with recognizing characteristic syndromic patterns are particularly challenged when confronted with patients from populations different from their training data. To that end, we systematically analyzed the impact of genetic ancestry on facial dysmorphism. For that purpose, we established the GestaltMatcher Database (GMDB) as a reference dataset for medical images of patients with rare genetic disorders from around the world. We collected 10,980 frontal facial images - more than a quarter previously unpublished - from 8,346 patients, representing 581 rare disorders. Although the predominant ancestry is still European (67%), data from underrepresented populations have been increased considerably via global collaborations (19% Asian and 7% African). This includes previously unpublished reports for more than 40% of the African patients. The NGP analysis on this diverse dataset revealed characteristic performance differences depending on the composition of training and test sets corresponding to genetic relatedness. For clinical use of NGP, incorporating non-European patients resulted in a profound enhancement of GestaltMatcher performance. The top-5 accuracy rate increased by +11.29%. Importantly, this improvement in delineating the correct disorder from a facial portrait was achieved without decreasing the performance on European patients. By design, GMDB complies with the FAIR principles by rendering the curated medical data findable, accessible, interoperable, and reusable. This means GMDB can also serve as data for training and benchmarking. In summary, our study on facial dysmorphism on a global sample revealed a considerable cross ancestral phenotypic variability confounding NGP that should be counteracted by international efforts for increasing data diversity. GMDB will serve as a vital reference database for clinicians and a transparent training set for advancing NGP technology.The most important factor that complicates the work of dysmorphologists is the significant phenotypic variability of the human face. Next-Generation Phenotyping (NGP) tools that assist clinicians with recognizing characteristic syndromic patterns are particularly challenged when confronted with patients from populations different from their training data. To that end, we systematically analyzed the impact of genetic ancestry on facial dysmorphism. For that purpose, we established the GestaltMatcher Database (GMDB) as a reference dataset for medical images of patients with rare genetic disorders from around the world. We collected 10,980 frontal facial images - more than a quarter previously unpublished - from 8,346 patients, representing 581 rare disorders. Although the predominant ancestry is still European (67%), data from underrepresented populations have been increased considerably via global collaborations (19% Asian and 7% African). This includes previously unpublished reports for more than 40% of the African patients. The NGP analysis on this diverse dataset revealed characteristic performance differences depending on the composition of training and test sets corresponding to genetic relatedness. For clinical use of NGP, incorporating non-European patients resulted in a profound enhancement of GestaltMatcher performance. The top-5 accuracy rate increased by +11.29%. Importantly, this improvement in delineating the correct disorder from a facial portrait was achieved without decreasing the performance on European patients. By design, GMDB complies with the FAIR principles by rendering the curated medical data findable, accessible, interoperable, and reusable. This means GMDB can also serve as data for training and benchmarking. In summary, our study on facial dysmorphism on a global sample revealed a considerable cross ancestral phenotypic variability confounding NGP that should be counteracted by international efforts for increasing data diversity. GMDB will serve as a vital reference database for clinicians and a transparent training set for advancing NGP technology.
Details
- Title: Subtitle
- GestaltMatcher Database - A global reference for facial phenotypic variability in rare human diseases
- Creators
- Hellen Lesmann - University Hospital BonnAlexander Hustinx - University Hospital BonnShahida Moosa - Stellenbosch UniversityHannah Klinkhammer - University Hospital BonnElaine Marchi - New York State Office for People With Developmental DisabilitiesMaria Del Pilar Caro MartinIbrahim Abdelrazek - Alexandria UniversityJean Tori Pantel - RWTH Aachen UniversityMerle Ten Hagen - University of BonnMeow-Keong ThongRifhan Azwani Binti Mazlan - University of MalayaSok Kun Tae - University of MalayaTom KamphansWolfgang MeiswinkelJing-Mei LiBehnam Javanmardi - University Hospital BonnAnnette Uwineza - University of KigaliCordula KnoppAlexej Knaus - University of BonnTinatin Tkemaladze - Tbilisi State Medical UniversityMiriam ElbrachtLarissa Mattern - RWTH Aachen UniversityRami Abou Jamra - University Medical CenterClara Velmans - University Hospital CologneVincent Strehlow - University Medical CenterMaureen Jacob - TUM KlinikumAngela Peron - Meyer Children's HospitalCristina Dias - King's College LondonBeatriz Nunes - Morpho (United States)Thainá Vilella - Morpho (United States)Isabel Pinheiro - Morpho (United States)Chong Kim - Universidade de São PauloMaria Melaragno - Universidade Federal de São PauloHannah Weiland - University of BonnSophia Kaptain - University of BonnKarolina Chwiałkowska - Medical University of BiałystokMiroslaw KwasniewskiRamy SaadSarah Wiethoff - University College LahoreHimanshu Goel - University of Newcastle AustraliaClara Tang - Rome FoundationAnna HauTahsin Stefan Barakat - Erasmus MCPrzemysław PanekAmira Nabil - Alexandria UniversityJulia Suh - RWTH Aachen UniversityFrederik Braun - Institute of Human GeneticsIsrael Gomy - Universidade de São PauloLuisa Averdunk - Heinrich Heine University DüsseldorfEkanem Ekure - University of LagosGaber Bergant - Ljubljana University Medical CentreBorut Peterlin - Ljubljana University Medical CentreClaudio GrazianoNagwa GaboonMoisés Fiesco-Roa - Instituto Nacional de PediatriaAlessandro SpinelliNina-Maria Wilpert - Charité - Universitätsmedizin BerlinPrasit Phowthongkum - King Chulalongkorn Memorial HospitalNergis GüzelTobias Haack - University of TübingenRana Bitar - Shaikh Khalifa Medical CityAndreas Tzschach - University Medical Center FreiburgAgusti Rodriguez-PalmeroTheresa Brunet - Technical University of MunichSabine Rudnik-SchönebornSilvina Contreras-CapetilloAva Oberlack - TUM KlinikumCarole Samango-SprouseTeresa Sadeghin - Micro Focus (United States)Margaret Olaya - Micro Focus (United States)Konrad PlatzerArtem Borovikov - Research Centre for Medical GeneticsFranziska Schnabel - Institute of Human GeneticsLara Heuft - Institute of Human GeneticsVera HerrmannRenske Oegema - University Medical Center UtrechtNour Elkhateeb - Cambridge University Hospitals NHS Foundation TrustSheetal Kumar - University Hospital BonnKatalin Komlosi - University of BonnKhoushoua MohamedSilvia Kalantari - University of PaviaFabio Sirchia - University of PaviaAntonio Martinez-Monseny - Children's HospitalMatthias Höller - Institute of Human GeneticsAmal MohamedAmaia Lasa-AranzastiJohn Sayer - Newcastle UniversityNadja Ehmke - Charité - Universitätsmedizin BerlinMagdalena Danyel - Charité - Universitätsmedizin BerlinHenrike Sczakiel - Humboldt-Universität zu BerlinSarina Schwartzmann - Charité - Universitätsmedizin BerlinFelix Boschann - Charité - Universitätsmedizin BerlinMax Zhao - Charité - Universitätsmedizin BerlinRonja AdamsLara Einicke - Charité - Universitätsmedizin BerlinDenise Horn - Charité - Universitätsmedizin BerlinKee Seang Chew - University of MalayaChoy Chen Kam - University of MalayaMiray Karakoyun - Ege UniversityBen Pode-ShakkedAviva Eliyahu - Sheba Medical CenterRachel Rock - Edmond and Lily Safra Children's HospitalTeresa Carrion - Hospital Universitario Son EspasesOdelia Chorin - Sheba Medical CenterYuri Zarate - University of Arkansas for Medical SciencesMarcello MartinezMert Karakaya - University of CologneAimé LumakaMarwan Shinawi - Washington University in St. LouisPatrick Blackburn - St. Jude Children's Research HospitalTianyun Wang - Peking UniversityTim Niehues - Helios Klinikum KrefeldPing Hu - National Human Genome Research InstituteRebekah Waikel - National Human Genome Research InstituteSuzanna Ledgister Hanchard - National Human Genome Research InstituteGehad Elmakkawy - Alexandria UniversitySylvia Safwat - Alexandria UniversityFrédéric Ebstein - Nantes UniversitéElke Krüger - Universität GreifswaldSébastien Küry - Centre Hospitalier Universitaire de NantesStephane Bezieau - Centre Hospitalier Universitaire de NantesAnnabelle Arlt - University of BonnFelix Marbach - Heidelberg UniversityDong Li - Children's Hospital of PhiladelphiaLucie Dupuis - SickKids FoundationRoberto Mendoza-LondonoSofia Douzgou HougeDenisa Weis - Johannes Kepler University of LinzBrian ChungChristopher MakNursel Elcioglu - Marmara UniversityAyca Aykut - Ege UniversityPeli Şimşek-KiperNina Bögershausen - Universitätsmedizin GöttingenBernd Wollnik - Universitätsmedizin GöttingenHeidi Beate Bentzen - University of OsloIngo Kurth - RWTH Aachen UniversityChristian Netzer - University of CologneAleksandra Jezela-StanekKoen DevriendtKaren Gripp - Community Health Systems - Dupont HospitalMartin Mücke - RWTH Aachen UniversityAlain Verloes - Hôpital Robert-DebréChristian Schaaf - Institute of Human GeneticsChristoffer Nellåker - University of OxfordBenjamin Solomon - National Human Genome Research InstituteMarkus Nöthen - University of BonnEbtesam Abdalla - Alexandria UniversityGholson LyonPeter Krawitz - University Hospital BonnHulya Kayserili - Koç UniversityLouiza Toutouna - University Medical Center FreiburgAxel SchmidtRegina RothDagmar Wieczorek - Heinrich Heine University DüsseldorfEric Olinger - Cliniques Universitaires Saint-LucBharatendu Chandra - University of Iowa, Stead Family Department of PediatricsMoon Ley Tung - University of IowaTzung-Chien Hsieh
- Resource Type
- Preprint
- Publication Details
- Research square
- DOI
- 10.21203/rs.3.rs-4438861/v1
- PMID
- 38903062
- PMCID
- PMC11188141
- Language
- English
- Date posted
- 06/10/2024
- Academic Unit
- Stead Family Department of Pediatrics; Medical Genetics and Genomics
- Record Identifier
- 9984643755402771
Metrics
67 Record Views