Optimization for large-scale doubly-penalized ANOVA modeling and understanding accelerated gradient methods