Dimension Reduction Methods for Data Visualization

-
Antonio Auffinger, Northwestern
Fine Hall 224

The purpose of dimension reduction methods for data visualization is to project high dimensional data to 2 or 3 dimensions so that humans can understand some of its structure. In this talk, we will give a an overview of some of the most popular and powerful methods in this active area. We will then the focus on two algorithms: Stochastic Neighbor Embedding (SNE) and Uniform Manifold Approximation and Projection (UMAP). Here, we will present new rigorous results that establish an equilibrium distribution for these methods when the number of data points diverge in the presence of pure noise or with a planted signal. Based on joint work with Daniel Fletcher.