A worldwide web of images

-
Blaise Aguera y Arcas, Microsoft Live Labs
Fine Hall 214

In this talk we'll explore the emerging potential of computer vision to transform the way we think about the interconnectedness of digital imagery and the Web, and how these relate to our physical environment. We'll begin with an introduction to the foundations of "3D computer vision," a bag of tricks which has been developing steadily for three decades, combining classical photogrammetry with machine vision. We'll then dive specifically into Photosynth, based on a combination of the Photo Tourism project (a collaboration between Microsoft Research and the University of Washington) and Seadragon, a multiresolution networked platform allowing one to play with arbitrarily many arbitrary large visual objects using only constant-time and constant-bandwidth operations. The aim of Photosynth is to allow meaningful 3D navigation within real-world environments reconstructed entirely from the photos. Interesting social dimensions are added to this application when one considers that the source photos can be mined from the existing Web, aggregated from user communities, and actively contributed to and interconnected. We'll end with some preliminary findings about the latent graph structure of Internet photography, and a glimpse of where we're heading next.