Dimension Reduction Methods for Data Visualization

March 19, 2024 - 03:00 - March 19, 2024 - 04:00

Antonio Auffinger, Northwestern

Fine Hall 224

The purpose of dimension reduction methods for data visualization is to project high dimensional data to 2 or 3 dimensions so that humans can understand some of its structure. In this talk, we will give a an overview of some of the most popular and powerful methods in this active area. We will then the focus on two algorithms: Stochastic Neighbor Embedding (SNE) and Uniform Manifold Approximation and Projection (UMAP). Here, we will present new rigorous results that establish an equilibrium distribution for these methods when the number of data points diverge in the presence of pure noise or with a planted signal. Based on joint work with Daniel Fletcher.