Patrick Thiran, Farnood Salehi, Laura Elisa Celis
Coordinate descent methods usually minimize a cost function by updating a random decision variable (corresponding to one coordinate) at a time. Ideally, we would update the decision variable that yields the largest decrease in the cost function. However, f ...
NEURAL INFORMATION PROCESSING SYSTEMS (NIPS)2018