Multi-scale data fusion
The main focus of the Gevaert lab is on biomedical data fusion: the development of machine learning methods for biomedical decision support using multi-scale biomedical data. Previously our work was focused on Bayesian and kernel methods studying breast and ovarian cancer. Subsequent work concerned the development of methods for multi-omics data fusion. This resulted in the development of MethylMix, to identify differentially methylation driven genes, and AMARETTO, a computational method to integrate DNA methylation, copy number and gene expression data to identify cancer modules. Additionally, the lab focuses on multi-scale data fusion by linking omics with cellular and tissue-level phenotypes. This led to key contributions in the field of imaging genomics/radiogenomics involving work in lung cancer and brain tumors. The work in imaging genomics is focused on developing a framework for non-invasive personalized medicine. In summary, the Gevaert lab has an interdisciplinary focus on developing novel algorithms for multi-scale biomedical data fusion.
Multi-omics data fusion
The lab has developed multiple frameworks for biomedical data fusion. Initially we developed an extensive framework using Bayesian algorithms. Next, we included algorithms using support vector machines and kernel methods. More recently, we have expanded our work into regularized regression approaches to build data fusion methods for multi-omics data. Most recently we developed AMARETTO and CaMoDi, these are algorithms for multi-omics data fusion. They model gene expression, DNA copy number and DNA methylation data and represent this as cancer modules, and have been shown to outperform existing methods. More recently we are linking multi-omics data across scales with cellular and imaging phenotypes and extend towards multi-scale biomedical data fusion.
Molecular profiles of tumors are nowadays used to determine prognosis and to guide therapy. For example the presence of a mutation in the EGFR gene will most likely lead to anti-EGFR therapy. Recently an image phenotype was discovered that acts as a biomarker of EGFR mutation. This is a precursor of the possibilities of a new emerging field called radiogenomics defined as directly linking imaging features to underlying molecular properties. This shows the potential of image signatures that reflect molecular properties of diseases. Especially if these molecular properties are actionable, non-invasive precion medicine becomes a reality.
Read more about an example studying glioblastoma here.
Our goal is to study the creation of comprehensive and expressive metadata for biomedical datasets to facilitate data discovery, data interpretation, and data reuse. By facilitating the submission of high quality metadata to describe biomedical experiments, our framework enables experiments to be verified and for disparate data to be integrated. It is imperative to make the authoring of high quality metadata a manageable task. In our work we adopt a data-driven approach for value set recommendation. We used the metadata that investigators have already defined to infer metadata attributes that are commonly used in metadata submission. We use these data to recommend values for the newly entered metadata that need to be filled in during the data submission. Similarly, we use our methods to enhance metadata of existing repositories.
More about our recent results in this area here.
- Brain tumors: glioma's and glioblastoma
- Head and neck cancer
- Hepatocellular caricinoma