An Empirical Study of Nonconvex Optimization in Deep Learning

William Blackwood

doi:10.55632/pwvas.v98i1.1341

An Empirical Study of Nonconvex Optimization in Deep Learning

Authors

William Blackwood Fairmont State University

DOI:

https://doi.org/10.55632/pwvas.v98i1.1341

Abstract

This study examines how common training choices affect convergence and generalization in nonconvex deep learning. Models were trained on CIFAR-10 using ResNet-18, VGG-11, and a three-layer MLP while varying optimizer, learning-rate schedule, batch size, and weight decay. Each configuration was repeated across three random seeds and evaluated using training loss, test accuracy, generalization gap, and an estimate of Hessian trace as a measure of sharpness. The results show that adaptive optimizers reduced loss faster early in training, but SGD with momentum remained competitive in final test accuracy when properly tuned. Cosine annealing and warmup plus cosine schedules generally produced smaller generalization gaps than constant and step schedules. Increasing batch size was associated with sharper minima and worse generalization, while AdamW produced more reliable regularization than standard Adam under matched weight-decay settings. Overall, the study shows that optimizer and training choices strongly influence both convergence behavior and final model quality, and that careful tuning can matter as much as the optimizer itself.

Downloads

Published

2026-04-08

How to Cite

William Blackwood. (2026). An Empirical Study of Nonconvex Optimization in Deep Learning. Proceedings of the West Virginia Academy of Science, 98(1). https://doi.org/10.55632/pwvas.v98i1.1341

Download Citation

Issue

Vol. 98 No. 1 (2026): 100th Annual Meeting Abstracts

Section

Meeting Abstracts-Oral

License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

Proceedings of the West Virginia Academy of Science applies the Creative Commons Attribution-NonCommercial (CC BY-NC) license to works we publish. By virtue of their appearance in this open access journal, articles are free to use, with proper attribution, in educational and other non-commercial settings.

An Empirical Study of Nonconvex Optimization in Deep Learning

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

License

Information