Are you an EPFL student looking for a semester project?
Work with us on data science and visualisation projects, and deploy your project as an app on top of Graph Search.
This lecture covers techniques for efficiently retrieving similar items using similarity search queries. It discusses naive near neighbors, approximate near neighbors, and hashing items with concepts like MinHash and LSH on Spark. The presentation emphasizes trading accuracy for performance and the use of load balancing in near neighbors retrieval.