Abstract
BACKGROUND: Congenital Central Hypoventilation Syndrome (CCHS) has devastating consequences if not diagnosed promptly. Despite identification of the disease-defining gene PHOX2B and a facial phenotype, CCHS remains underdiagnosed. This study aimed to incorporate automated techniques on facial photos to screen for CCHS in a diverse pediatric cohort to improve early case identification and assess a facial phenotype-PHOX2B genotype relationship. METHODS: Facial photos of children and young adults with CCHS were control-matched by age, sex, race/ethnicity. After validating landmarks, principal component analysis (PCA) was applied with logistic regression (LR) for feature attribution and machine learning models for subject classification and assessment by PHOX2B pathovariant. RESULTS: Gradient-based feature attribution confirmed a subtle facial phenotype and models were successful in classifying CCHS: neural network performed best (median sensitivity 90% (IQR 84%, 95%)) on 179 clinical photos (versus LR and XGBoost, both 85% (IQR 75-76%, 90%)). Outcomes were comparable stratified by PHOX2B genotype and with the addition of publicly available CCHS photos (n = 104) using PCA and LR (sensitivity 83-89% (IQR 67-76%, 92-100%). CONCLUSIONS: Utilizing facial features, findings suggest an automated, accessible classifier may be used to screen for CCHS in children with the phenotype and support providers to seek PHOX2B testing to improve the diagnostics. IMPACT: Facial landmarking and principal component analysis on a diverse pediatric and young adult cohort with PHOX2B pathovariants delineated a distinct, subtle CCHS facial phenotype. Automated, low-cost machine learning models can detect a CCHS facial phenotype with a high sensitivity in screening to ultimately refer for disease-defining PHOX2B testing, potentially addressing gaps in disease underdiagnosis and allow for critical, timely intervention.