FANOK: Knockoffs in Linear Time

We describe a series of algorithms that efficiently implement Gaussian model-X knockoffs to control the false discovery rate on large-scale feature selection problems. Identifying the knockoff distribution requires solving a large-scale semidefinite program for which we derive several efficient methods. One handles generic covariance matrices and has a complexity scaling as O(p(3)), where p is the ambient dimension, while another assumes a rank-k factor model on the covariance matrix to reduce this complexity bound to O(pk(2)). We review an efficient procedure to estimate factor models and show that under a factor model assumption, we can sample knockoff covariates with complexity linear in the dimension. We test our methods on problems with p as large as 500 000.

Chat with Graph Search

Ask any question about EPFL courses, lectures, exercises, research, news, etc. or try the example questions below.

DISCLAIMER: The Graph Chatbot is not programmed to provide explicit or categorical answers to your questions. Rather, it transforms your questions into API requests that are distributed across the various IT services officially administered by EPFL. Its purpose is solely to collect and recommend relevant references to content that you can explore to help you answer your questions.

FANOK: Knockoffs in Linear Time

Graph Chatbot

Chat with Graph Search

Inhalation of Microplastics—A Toxicological Complexity

A Comparative Analysis of Tools & Task Types for Measuring Computational Problem-Solving

The Complexity of Checking Non-Emptiness in Symbolic Tree Automata

Inhalation of Microplastics—A Toxicological Complexity

The Complexity of Checking Non-Emptiness in Symbolic Tree Automata

A Comparative Analysis of Tools & Task Types for Measuring Computational Problem-Solving