Member, Maternal & Child Health Research Institute (MCHRI)
Doctor of Science, Julius Maximilians Univsitat (2018)
Diplom, Julius Maximilians Univsitat (2011)
The incidence of autism spectrum disorder (ASD) has been rising, however ASD-risk biomarkers remain lacking. We previously identified the presence of maternal autoantibodies to fetal brain proteins specific to ASD, now termed maternal autoantibody-related (MAR) ASD. The current study aimed to create and validate a serological assay to identify ASD-specific maternal autoantibody patterns of reactivity against eight previously identified proteins (CRMP1, CRMP2, GDA, NSE, LDHA, LDHB, STIP1, and YBOX) that are highly expressed in developing brain, and determine the relationship of these reactivity patterns with ASD outcome severity. We used plasma from mothers of children diagnosed with ASD (n?=?450) and from typically developing children (TD, n?=?342) to develop an ELISA test for each of the protein antigens. We then determined patterns of reactivity a highly significant association with ASD, and discovered several patterns that were ASD-specific (18% in the training set and 10% in the validation set vs. 0% TD). The three main patterns associated with MAR ASD are CRMP1?+?GDA (ASD%?=?4.2 vs. TD%?=?0, OR 31.04, p?=?<0.0001), CRMP1?+?CRMP2 (ASD%?=?3.6 vs. TD%?=?0, OR 26.08, p?=?0.0005) and NSE?+?STIP1 (ASD%?=?3.1 vs. TD%?=?0, OR 22.82, p?=?0.0001). Additionally, we found that maternal autoantibody reactivity to CRMP1 significantly increases the odds of a child having a higher Autism Diagnostic Observation Schedule (ADOS) severity score (OR 2.3; 95% CI: 1.358-3.987, p?=?0.0021). This is the first report that uses machine learning subgroup discovery to identify with 100% accuracy MAR ASD-specific patterns as potential biomarkers of risk for a subset of up to 18% of ASD cases in this study population.
View details for DOI 10.1038/s41380-020-00998-8
View details for PubMedID 33483694
View details for Web of Science ID 000525432600082
The dense network of interconnected cellular signalling responses that are quantifiable in peripheral immune cells provides a wealth of actionable immunological insights. Although high-throughput single-cell profiling techniques, including polychromatic flow and mass cytometry, have matured to a point that enables detailed immune profiling of patients in numerous clinical settings, the limited cohort size and high dimensionality of data increase the possibility of false-positive discoveries and model overfitting. We introduce a generalizable machine learning platform, the immunological Elastic-Net (iEN), which incorporates immunological knowledge directly into the predictive models. Importantly, the algorithm maintains the exploratory nature of the high-dimensional dataset, allowing for the inclusion of immune features with strong predictive capabilities even if not consistent with prior knowledge. In three independent studies our method demonstrates improved predictions for clinically relevant outcomes from mass cytometry data generated from whole blood, as well as a large simulated dataset. The iEN is available under an open-source licence.
View details for DOI 10.1038/s42256-020-00232-8
View details for PubMedID 33294774
View details for PubMedCentralID PMC7720904
High-throughput single-cell analysis technologies produce an abundance of data that is critical for profiling the heterogeneity of cellular systems. We introduce VoPo (https://github.com/stanleyn/VoPo), a machine learning algorithm for predictive modeling and comprehensive visualization of the heterogeneity captured in large single-cell datasets. In three mass cytometry datasets, with the largest measuring hundreds of millions of cells over hundreds of samples, VoPo defines phenotypically and functionally homogeneous cell populations. VoPo further outperforms state-of-the-art machine learning algorithms in classification tasks, and identified immune-correlates of clinically-relevant parameters.
View details for DOI 10.1038/s41467-020-17569-8
View details for PubMedID 32719375
Worldwide, preterm birth (PTB) is the single largest cause of deaths in the perinatal and neonatal period and is associated with increased morbidity in young children. The cause of PTB is multifactorial, and the development of generalizable biological models may enable early detection and guide therapeutic studies.To investigate the ability of transcriptomics and proteomics profiling of plasma and metabolomics analysis of urine to identify early biological measurements associated with PTB.This diagnostic/prognostic study analyzed plasma and urine samples collected from May 2014 to June 2017 from pregnant women in 5 biorepository cohorts in low- and middle-income countries (LMICs; ie, Matlab, Bangladesh; Lusaka, Zambia; Sylhet, Bangladesh; Karachi, Pakistan; and Pemba, Tanzania). These cohorts were established to study maternal and fetal outcomes and were supported by the Alliance for Maternal and Newborn Health Improvement and the Global Alliance to Prevent Prematurity and Stillbirth biorepositories. Data were analyzed from December 2018 to July 2019.Blood and urine specimens that were collected early during pregnancy (median sampling time of 13.6 weeks of gestation, according to ultrasonography) were processed, stored, and shipped to the laboratories under uniform protocols. Plasma samples were assayed for targeted measurement of proteins and untargeted cell-free ribonucleic acid profiling; urine samples were assayed for metabolites.The PTB phenotype was defined as the delivery of a live infant before completing 37 weeks of gestation.Of the 81 pregnant women included in this study, 39 had PTBs (48.1%) and 42 had term pregnancies (51.9%) (mean [SD] age of 24.8 [5.3] years). Univariate analysis demonstrated functional biological differences across the 5 cohorts. A cohort-adjusted machine learning algorithm was applied to each biological data set, and then a higher-level machine learning modeling combined the results into a final integrative model. The integrated model was more accurate, with an area under the receiver operating characteristic curve (AUROC) of 0.83 (95% CI, 0.72-0.91) compared with the models derived for each independent biological modality (transcriptomics AUROC, 0.73 [95% CI, 0.61-0.83]; metabolomics AUROC, 0.59 [95% CI, 0.47-0.72]; and proteomics AUROC, 0.75 [95% CI, 0.64-0.85]). Primary features associated with PTB included an inflammatory module as well as a metabolomic module measured in urine associated with the glutamine and glutamate metabolism and valine, leucine, and isoleucine biosynthesis pathways.This study found that, in LMICs and high PTB settings, major biological adaptations during term pregnancy follow a generalizable model and the predictive accuracy for PTB was augmented by combining various omics data sets, suggesting that PTB is a condition that manifests within multiple biological systems. These data sets, with machine learning partnerships, may be a key step in developing valuable predictive tests and intervention candidates for preventing PTB.
View details for DOI 10.1001/jamanetworkopen.2020.29655
View details for PubMedID 33337494