On the Stability and Reproducibility of Data Science Pipelines

10:15-10:35, January 28 @ 4BC

Talk/ Overview

Modern data science pipelines involve a complex array of operations, with many sources of stochastic behaviour, some controlled, some uncontrolled. In pharma, an exemplar scenario is that of data-driven biomarker selection. This talk will discuss statistical methods to quantify the stability and hence *reproducibility* of results coming from such pipelines.

Talk/ Speakers

Gavin Brown

University of Manchester

AMLD / Global partners