Deep Dive workshop
Statistical perspectives on machine learning for health applications
Wednesday 27 August 9.00
Organizer: Line Clemmensen, University of Copenhagen
Our session aims to bring a statistical perspective on data science with applications in healthcare. Healthcare is considered a high risk application in EU’s AI act, and therefore statistical rigor is of utmost importance. The use of machine learning in health science research is rapidly increasing, and AI tools are also entering clinical practice. This stresses the importance of understanding the specific challenges embedded into health data, and how they interplay with machine learning approaches. In this session, we introduce a number of cases and topics where statistical thinking may provide new insights into ML for health applications.
We will bring together researchers and practitioners to discuss methods and best practices for accurate modeling and evaluation of ML/AI in health applications. The methodology and discussions are supported by examples with computer vision to detect skin cancer, risk of cardiovascular disease, and medical image analysis.
The session will be organized as a mix of invited (3 x 30 minutes) and contributed (3 x 15 minutes) talks, as well as facilitated discussions at tables (2 x 20 minutes). The invited talks provide introductory overviews to general themes, while the contributed talks can zoom in on a more specific topic or piece of research. The table discussions will focus on how the talks relate to the participants’ own data science practice and experiences.
The invited speakers are:
The contributed speakers are:
We expect an audience of around 50-70 people in the room. The target audience is any participant with an interest in or working with data science for healthcare applications, including but not limited to those on the statistical evaluation of AI Summer School. Both conference participants engaged in health care applications using data science methods, or those who do research in trustworthy AI and related fields may find this session useful.
We expect participants to gain insights into methodological challenges when working with health data and an overview of existing knowledge and approaches from statistics that may help address such challenges. Through the table discussions, participants will reflect on how the topics presented by the invited speakers relate to their own work. We will capture the learnings on posters with colored stick-it notes.
Introductory
The organizing team is multidisciplinary and multi-institutional, bringing forth experience from theoretical and applied machine learning and statistics.