TDA crash course: theory and practice for ML applications

Workshop / Overview

GitHub repo: https://github.com/lordgrilo/AML-days-TDA-tutorial

We will introduce elements of the theory of topological data analysis (TDA) and show the participants how to apply standard topological data pipelines, such as persistent homology and Mapper, to datasets of different origin (spatial data, word embeddings, brain networks). We will also show how TDA can be used for feature engineering or as a preprocessing step in standard machine learning pipelines, and how to assess the significance of the unearthed features. Finally, since TDA outputs can sometimes be of difficult interpretation, we will guide the participants through some common pitfalls in the interpretation and provide some examples of successful applications.

The workshop will consist of: an introductory section on TDA methods; hands-on exercises focused on implementing the required pipelines and performing the analysis; a session on validation methods and null models for TDA; a session on strengths and weaknesses of topological methods.

We will use Jupyter notebooks in Python for analysis and visualization.

Workshop / Outcome

Participants will be able to set up a TDA pipeline
Participants will be able to interpret the relevance of the results based on both theory and practice
Participants will have developed ready-to-use code, which they will be able to adapt to similar applications

Workshop / Difficulty

Intermediate level

Workshop / Prerequisites

basic Python programming skills (laptop with Python 2.7/3 installed)
basic Jupyter notebook skills
basic git/data management skills
notions of linear algebra and elementary topology are helpful but not needed
GitHub repo: https://github.com/lordgrilo/AML-days-TDA-tutorial. Please install the required packages

Track / Co-organizers

Giovanni Petri

Research Leader, ISI Foundation

AMLD EPFL 2019 / Workshops

View workshops

TensorFlow Basics 2019 – Saturday

With Bartek Wołowiec, Megan Ruthven, Ruslan Habalov & Andreas Steiner

09:00-16:30 January 26

Document Digitization Challenge

With Mihai Gurban & Raquel Terrés Cristofani

09:00-16:30 January 262A

TensorFlow Basics 2019 – Sunday

With Bartek Wołowiec, Megan Ruthven, Ruslan Habalov & Andreas Steiner

09:00-16:30 January 274ABC

AMLD / Global partners

AMLD EPFL 2019 TDA crash course: theory and practice for ML applications