Algorithm Validation for Data Science
Data Science (DS) algorithms interpret outcomes of empirical experiments with random influences. Often, such algorithms are cascaded to long processing pipelines especially in biomedical applications. The validation of such pipelines poses an open question since data compression of the input should preserve as much information as possible to distinguish between possible outputs. Starting with a minimum description length argument for model selection we motivate a localization criterion as a lower bound that achieves information theoretical optimality. Uncertainty in the input causes a rate distortion tradeoff in the output when the DS algorithm is adapted by learning. We present design choices for algorithm selection and sketch a theory of validation. The concept is demonstrated in neuroscience applications of diffusion tensor imaging for tractography and brain parcellation.
Date: 21 September 2022, 11:00 (Wednesday, -2nd week, Michaelmas 2022)
Venue: Richard Doll Building, Old Road Campus OX3 7LF
Venue Details: Lecture Theatre
Speaker: Professor Joachim M. Buhmann (Department of Computer Science, ETH Zurich, Switzerland)
Organising department: Big Data Institute (NDPH)
Part of: BDI seminars
Booking required?: Not required
Audience: Members of the University only
Editor: Graham Bagley