- About
- Events
- Calendar
- Graduation Information
- Cornell Learning Machines Seminar
- Student Colloquium
- BOOM
- Fall 2024 Colloquium
- Conway-Walker Lecture Series
- Salton 2024 Lecture Series
- Seminars / Lectures
- Big Red Hacks
- Cornell University - High School Programming Contests 2024
- Game Design Initiative
- CSMore: The Rising Sophomore Summer Program in Computer Science
- Explore CS Research
- ACSU Research Night
- Cornell Junior Theorists' Workshop 2024
- People
- Courses
- Research
- Undergraduate
- M Eng
- MS
- PhD
- Admissions
- Current Students
- Computer Science Graduate Office Hours
- Advising Guide for Research Students
- Business Card Policy
- Cornell Tech
- Curricular Practical Training
- A & B Exam Scheduling Guidelines
- Fellowship Opportunities
- Field of Computer Science Ph.D. Student Handbook
- Graduate TA Handbook
- Field A Exam Summary Form
- Graduate School Forms
- Instructor / TA Application
- Ph.D. Requirements
- Ph.D. Student Financial Support
- Special Committee Selection
- Travel Funding Opportunities
- Travel Reimbursement Guide
- The Outside Minor Requirement
- Diversity and Inclusion
- Graduation Information
- CS Graduate Minor
- Outreach Opportunities
- Parental Accommodation Policy
- Special Masters
- Student Spotlights
- Contact PhD Office
A Near-Linear Time Algorithm for the Chamfer Distance (via zoom)
Abstract: I will describe a fast algorithm to approximate the Chamfer distance, a commonly used proxy for the more computationally demanding Earth-Mover (Optimal Transport) Distance. For any two point sets A and B of size n, the Chamfer distance from A to B is defined as CH(A, B) = sum over x in A of d(x, B), where d(x, B) is the minimum distance from x to any point in B. In other words, it is the sum of nearest neighbor distances from A to B. The Chamfer distance is a popular measure of dissimilarity between point clouds, used in many machine learning, computer vision, and graphics applications, and admits a straightforward O(n^2)-time brute force algorithm. However, the quadratic dependence on n in the running time makes the naive approach intractable for large datasets. I will present a (1+epsilon)-approximate algorithm for estimating the Chamfer distance with a near-linear running time. Our experiments demonstrate that it is both accurate and fast on large high-dimensional datasets.
This is joint work with Ainesh Bakshi, Piotr Indyk, Rajesh Jayaram, and Erik Waingarten, and will appear at NeurIPS 23.
Bio: Sandeep is a final year PhD student at MIT, advised by Piotr Indyk. His interests are broadly in efficient algorithm design. Recently, he has been working in the intersection of machine learning and algorithms by designing provable algorithms in various ML settings, such as efficient algorithms for processing large datasets, as well as using ML to inspire and motivate algorithm design.