Paper-Conference

Proving linear mode connectivity of neural networks via optimal transport
Gradient descent is optimal under lower restricted secant inequality and upper error bound
On Fundamental Proof Structures in First-Order Optimization
Super-acceleration with cyclical step-sizes
A Study of Condition Numbers for First-Order Optimization
Gradient-based sample selection for online continual learning
Robust Detection of Covariate-Treatment Interactions in Clinical Trials