Episode 01 - Mechanistic Machine Learning

Update: 2020-07-10

Description

This is the first episode of the Flush to Data podcast. We start with a discussion on mechanistic modelling and machine learning and venture into models for emulation, uncertainty quantification, and data quality. Bonus material includes a discussion on aspects of current scientific practice, including the lack of hypothesis testing, the evaluation of novelty, and the challenges with a generalist approach.

Hosts: Jörg Rieckermann and Kris Villez
Guest: Juan Pablo Carbjal

Links:
* Juan Pablo's web page: https://sites.google.com/site/juanpicarbajal/
* Article relating Gaussian processes and Kalman filter: www.jstor.org/stable/2984861
* BBC podcast on Gauss: https://www.bbc.co.uk/programmes/b09gbnfj
* Using Lake Zurich as a heat sink: Unfortunately, we could not back-track the original source, despite considerable effort. If anyone of the listeners happens to know how to access the original source we would be grateful for a notice. The best we could find was documentation of related projects by Eawag: https://thermdis.eawag.ch/ and [1]. These show that ecological consequences have indeed been assessed in detail.
* Goodhart's law: https://en.wikipedia.org/wiki/Goodhart's_law
* An invitation to reproducible computational research: https://doi.org/10.1093/biostatistics/kxq028
* Science in the age of selfies: https://doi.org/10.1073/pnas.1609793113

References:
[1] Wüest, A. (2012). Potential zur Wärmeenergienutzung aus dem Zürichsee. Machbarkeit. Wärmeentzug (Heizen) und Einleitung von Kühlwasser. Kastanienbaum: Eawag. DORA-Link

Episode guide:
[0:00:00 ] Who is Juan Pablo Carbajal?
[0:03:10 ] Mechanistic modelling versus artificial intelligence
[0:07:08 ] Who is Juan Pablo Carbajal? (ctd.)
[0:09:26 ] Cross-fertilization between robotics and wastewater engineering
[0:15:05 ] Emulation: using models to approximate other models
[0:21:22 ] Incorporating common sense and prior knowledge into data-driven models
[0:31:31 ] Equivalence between Gaussian processes and Kalman filter
[0:33:50 ] Utility of emulation
[0:40:15 ] Utility of quantified uncertainty
[0:44:50 ] Intermezzo
[0:49:04 ] What can models say about data quality
[1:02:15 ] How to communicate about data quality?
[1:10:10 ] Preparing engineers for the future
[1:15:23 ] Thank you and goodbye!

Bonus material:
[1:16:40 ] Interpretable machine learning models
[1:22:33 ] Hypothesis testing
[1:26:14 ] Critical assessment of novelty
[1:30:50 ] Barriers to the generalist approach
[1:35:48 ] Thank you and goodbye!

Comments

In Channel

[Episode 07] Linda Åmand

2023-05-1601:29:35

[Episode 06] Ali Gagnon

2021-12-2001:09:08

[Episode 05] Lina Belia

2021-02-2850:54

Episode 04 - Open Science

2020-11-2301:31:36

Episode 03 - Integrated assessment - Bonus track

2020-09-0124:51

Episode 03 - Integrated Assessment

2020-09-0148:33

Episode 02 - Micropollutants and Learning from Sewers on Society

2020-07-1101:13:58

Episode 01 - Mechanistic Machine Learning

2020-07-1001:36:09

Episode 00 - Trailer

2020-07-0702:27

00:00

Episode 01 - Mechanistic Machine Learning

Kris Villez and Jörg Rieckermann

#box-pro-ellipsis-176125707037961{-webkit-line-clamp:2;}Episode 01 - Mechanistic Machine Learning

Episode 01 - Mechanistic Machine Learning

Kris Villez and Jörg Rieckermann

Episode 01 - Mechanistic Machine Learning