Decomposing feature-level variation with Covariate Gaussian Process Latent Variable Models

Published on May 24, 2019 in ICML (International Conference on Machine Learning)
Kaspar Märtens5
Estimated H-index: 5
(University of Oxford),
Kieran R. Campbell13
Estimated H-index: 13
(UBC: University of British Columbia),
Christopher Yau28
Estimated H-index: 28
(University of Birmingham)
The interpretation of complex high-dimensional data typically requires the use of dimensionality reduction techniques to extract explanatory low-dimensional representations. However, in many real-world problems these representations may not be sufficient to aid interpretation on their own, and it would be desirable to interpret the model in terms of the original features themselves. Our goal is to characterise how feature-level variation depends on latent low-dimensional representations, external covariates, and non-linear interactions between the two. In this paper, we propose to achieve this through a structured kernel decomposition in a hybrid Gaussian Process model which we call the Covariate Gaussian Process Latent Variable Model (c-GPLVM). We demonstrate the utility of our model on simulated examples and applications in disease progression modelling from high-dimensional gene expression data in the presence of additional phenotypes. In each setting we show how the c-GPLVM can extract low-dimensional structures from high-dimensional data sets whilst allowing a breakdown of feature-level variability that is not present in other commonly used dimensionality reduction approaches.
📖 Papers frequently viewed together
4 Citations
6 Citations
Cited By5
#1Haitao Liu (DUT: Dalian University of Technology)H-Index: 14
#2Yew-Soon Ong (NTU: Nanyang Technological University)H-Index: 59
Last. Xiaofang Wang (DUT: Dalian University of Technology)H-Index: 5
view all 4 authors...
Abstract null null For a learning task, Gaussian process (GP) is interested in learning the statistical relationship between inputs and outputs, since it offers not only the prediction mean but also the associated variability. The vanilla GP however is hard to learn complicated distribution with the property of, e.g., heteroscedastic noise, multi-modality and non-stationarity, from massive data due to the Gaussian marginal and the cubic complexity. To this end, this article studies new scalable ...
Using observational data to estimate the effect of a treatment is a powerful tool for decision-making when randomized experiments are infeasible or costly. However, observational data often yields biased estimates of treatment effects, since treatment assignment can be confounded by unobserved variables. A remedy is offered by deconfounding methods that adjust for such unobserved confounders. In this paper, we develop the Sequential Deconfounder, a method that enables estimating individualized t...
1 Citations
Oct 24, 2020 in IROS (Intelligent Robots and Systems)
#1Juan Antonio Delgado-Guerrero (CSIC: Spanish National Research Council)H-Index: 1
#2Adrià Colomé (CSIC: Spanish National Research Council)H-Index: 7
Last. Carme Torras (CSIC: Spanish National Research Council)H-Index: 33
view all 3 authors...
In the next few years, the amount and variety of context-aware robotic manipulator applications is expected to increase significantly, especially in household environments. In such spaces, thanks to programming by demonstration, non-expert people will be able to teach robots how to perform specific tasks, for which the adaptation to the environment is imperative, for the sake of effectiveness and users safety. These robot motion learning procedures allow the encoding of such tasks by means of pa...
1 CitationsSource