We are pleased to invite you to the next seminar within our traditional Seminar series in Statistics and Data Science
Speaker: Zhi Zhao, Postdoc at Radiumhospitalet, Oslo University Hospital
Title: Multivariate Bayesian variable selection in high-dimensional settings
When? TUESDAY, 24.05.2021, 14:15-15:15
Where? Erling Svedrups plass and Zoom https://uio.zoom.us/j/68227870839?pwd=SDRobDVrNU9xUDVyM3Zvc0wyeUwrQT09
Abstract:
Precision cancer medicine aims to determine the optimal treatment for each patient. In-vitro cancer drug sensitivity screens combined with multi-omics characterization of the cancer cells has become an important tool to achieve this aim. Analyzing such pharmacogenomic studies requires flexible and efficient joint statistical models for associating drug sensitivity with high-dimensional multi-omics data. We propose a structured multivariate Bayesian variable selection modelling framework for sparse identification of omics features associated with multiple correlated drug responses. We have provided an efficient implementation of a class of models in the BayesSUR R package (https://CRAN.R-project.org/package=BayesSUR). BayesSUR allows the specification of the models in a modular way, where the user chooses among three priors for variable selection and among three priors for covariance selection separately. Since many anti-cancer drugs are designed for specific molecular targets, our approach can make use of known structure between responses and predictors, e.g. molecular pathways and related omics features targeted by specific drugs, via a Markov-random-field (MRF) prior for the latent variable selection indicators of the coefficient matrix in sparse seemingly unrelated regression (SUR). The structure information included in the MRF prior can improve the model performance, i.e. variable selection and response prediction, compared to other common priors. The proposed approach is validated by simulation studies and applied to data from the Genomics of Drug Sensitivity in Cancer database, which includes pharmacological profiling and multi-omics characterization of a large set of heterogeneous cell lines. Finally, as an alternative to the SUR setup of the Bayesian models, we also suggest Gaussian copula models for multivariate responses of diverse types for identifying important variables from high-dimensional covariates.
Welcome!
Best regards,
Sven Ove Samuelsen & Aliaksandr Hubin