skip to primary navigationskip to content

08.09.18 Branching gene expression in response to infection analysed with Gaussian processes

last modified Oct 09, 2018 05:12 PM
Penfold et al from the Surani lab apply statistical methods to understand patterns in changes in gene expression following perturbations such as infection
08.09.18 Branching gene expression in response to infection analysed with Gaussian processes

Fig. 1 (extract): Fitting a branching process with different Gaussian processes

Branch-recombinant Gaussian processes for analysis of perturbations in biological time series

Penfold CA et al. (2018) Bioinformatics Volume 34, Issue 17, pp i1005–i1013. DOI: 10.1093/bioinformatics/bty603


Abstract from the paper


A common class of behaviour encountered in the biological sciences involves branching and recombination. During branching, a statistical process bifurcates resulting in two or more potentially correlated processes that may undergo further branching; the contrary is true during recombination, where two or more statistical processes converge. A key objective is to identify the time of this bifurcation (branch or recombination time) from time series measurements, e.g. by comparing a control time series with perturbed time series. Gaussian processes (GPs) represent an ideal framework for such analysis, allowing for nonlinear regression that includes a rigorous treatment of uncertainty. Currently, however, GP models only exist for two-branch systems. Here, we highlight how arbitrarily complex branching processes can be built using the correct composition of covariance functions within a GP framework, thus outlining a general framework for the treatment of branching and recombination in the form of branch-recombinant Gaussian processes (B-RGPs).


We first benchmark the performance of B-RGPs compared to a variety of existing regression approaches, and demonstrate robustness to model misspecification. B-RGPs are then used to investigate the branching patterns of Arabidopsis thaliana gene expression following inoculation with the hemibotrophic bacteria, Pseudomonas syringae DC3000, and a disarmed mutant strain, hrpA. By grouping genes according to the number of branches, we could naturally separate out genes involved in basal immune response from those subverted by the virulent strain, and show enrichment for targets of pathogen protein effectors. Finally, we identify two early branching genes WRKY11 and WRKY17, and show that genes that branched at similar times to WRKY11/17 were enriched for W-box binding motifs, and overrepresented for genes differentially expressed in WRKY11/17 knockouts, suggesting that branch time could be used for identifying direct and indirect binding targets of key transcription factors.

Availability and implementation

Studying development to understand disease

The Gurdon Institute is funded by Wellcome and Cancer Research UK to study the biology of development, and how normal growth and maintenance go wrong in cancer and other diseases.

combinedLogo x3 trans2018


Share this

Identification of a regeneration-organizing cell in the Xenopus tail

Citrullination of HP1γ chromodomain affects association with chromatin

A critical but divergent role of PRDM14 in human primordial germ cell fate revealed by inducible degrons

A transmissible RNA pathway in honey bees

METTL1 Promotes let-7 MicroRNA Processing via m7G Methylation

A Secreted RNA Binding Protein Forms RNA-Stabilizing Granules in the Honeybee Royal Jelly

The Human Lung Cell Atlas - A high-resolution reference map of the human lung in health and disease

A Compendium of Mutational Signatures of Environmental Agents

Characteristics and homogeneity of N6-methylation in human genomes

Comparative Epigenomics Reveals that RNA Polymerase II Pausing and Chromatin Domain Organization Control Nematode piRNA Biogenesis

Pluripotency and X chromosome dynamics revealed in pig pre-gastrulating embryos by single cell analysis

Dorsal-ventral differences in neural stem cell quiescence are induced by p57KIP2/Dacapo

Crypt fusion as a homeostatic mechanism in the human colon

TaDa! Analysing cell type-specific chromatin in vivo with Targeted DamID

A single-cell molecular map of mouse gastrulation and early organogenesis

Theory of mechanochemical patterning in biphasic biological tissues

Identification of functional long non-coding RNAs in C. elegans

The proneural wave in the Drosophila optic lobe is driven by an excitable reaction-diffusion mechanism

A walk through tau therapeutic strategies

Link to full list on PubMed