On the Stability and Reproducibility of Data Science Pipelines

10:15-10:35, January 28 @ 4BC

Talk/ Overview

Modern data science pipelines involve a complex array of operations, with many sources of stochastic behaviour, some controlled, some uncontrolled. In pharma, an exemplar scenario is that of data-driven biomarker selection. This talk will discuss statistical methods to quantify the stability and hence *reproducibility* of results coming from such pipelines.

Talk/ Speakers

Gavin Brown

University of Manchester

Talk/ Slides

Download the slides for this talk.Download ( PDF, 6369.69 MB)

Talk/ Highlights

22:28

On the Stability and Reproducibility of Data Science Pipelines

With Gavin BrownPublished March 12, 2020

AMLD / Global partners