Sara Beery, MIT: Urban Forest Mapping
At RISE Learning Machines Seminar on Dec 1, we have the pleasure to listen to Sara Beery from MIT give her talk: "Auto Arborist: Towards Mapping Urban Forests Across North America".
– Generalization to novel domains is a fundamental challenge for computer vision. We propose urban forest monitoring for capturing real world complexity.
Abstract
Generalization to novel domains is a fundamental challenge for computer vision. Near-perfect accuracy on benchmarks is common, but these models do not work as expected when deployed outside of the training distribution. To build computer vision systems that truly solve real-world problems at global scale, we need benchmarks that fully capture real-world complexity, including geographic domain shift, long-tailed distributions, and data noise. We propose urban forest monitoring as an ideal testbed for studying and improving upon these computer vision challenges, while working towards filling a crucial environmental and societal need. Urban forests provide significant benefits to urban societies.
However, planning and maintaining these forests is expensive. One particularly costly aspect of urban forest management is monitoring the existing trees in a city: e.g., tracking tree locations, species, and health. Monitoring efforts are currently based on tree censuses built by human experts, costing cities millions of dollars per census and thus collected infrequently. Previous investigations into automating urban forest monitoring focused on small datasets from single cities, covering only common categories .
To address these shortcomings, we introduce a new large-scale dataset that joins public tree censuses from 23 cities with a large collection of street level and aerial imagery. Our Auto Arborist dataset contains over 2.5M trees and >300 genera and is >2 orders of magnitude larger than the closest dataset in the literature. We introduce baseline results on our dataset across modalities as well as metrics for the detailed analysis of generalization with respect to geographic distribution shifts, vital for such a system to be deployed at-scale.
About the speaker
Sara is an incoming assistant professor at MIT EECS’ Faculty of AI and Decision Making and CSAIL, and a Visiting Researcher at Google working on Auto Arborist. She’s always loved the natural world, and has seen a growing need for technology-based approaches to conservation and sustainability challenges. Her research focuses on building computer vision methods that enable global-scale environmental and biodiversity monitoring across data modalities, tackling real-world challenges including strong spatiotemporal correlations that lead to domain shift, imperfect data quality, fine-grained categories, and long-tailed distributions. Sara received my PhD in Computing and Mathematical Sciences (CMS) at Caltech, advised by Pietro Perona, where she recieved the Amori Doctoral Prize for my dissertation.