Efficient Training of Input Convex Neural Networks Using Variable Projection 公开
Chen, Yixiao (Spring 2023)
Abstract
Normalizing flows are deep generative models that construct a diffeomorphic mapping between a simple reference distribution and a complex probability distribution. Previous studies have utilized the gradient of input convex neural networks (ICNNs) to represent the diffeomorphism, which guarantees invertibility for any network weights. Moreover, the resulting mapping constitutes a unique optimal transformation that minimizes transportation costs, as dictated by optimal transport theory.Training ICNNs presents a challenging non-convex problem, except for weights of the last layer. To address this issue, this thesis proposes a training approach for ICNNs using variable projection (VarPro). The proposed method takes advantage of the affine mapping in the last layer of ICNNs, which preserves convexity of the network. Empirical results based on a two-dimensional synthetic dataset demonstrate that VarPro achieves a lower test loss and requires fewer gradient evaluations compared to the mini-batch gradient descent method Adam.
Table of Contents
1 Introduction...................... 1
1.1 Contribution&Outline...................... 3
2 Background......................5
2.1 ConvexPotentialFlows...................... 6
2.2 VariableProjection........................ 10
3 Efficient Method in Training ICNNs......................12
3.1 SampleAverageApproximation ................. 13
3.2 TrainingICNNswithVariableProjection . . . . . . . . . . . . 14
4 Numerical Experiments......................19
4.1 ExperimentwithMoonsDataset................. 20
5 Discussion......................23
6 Conclusion......................25
About this Honors Thesis
School | |
---|---|
Department | |
Degree | |
Submission | |
Language |
|
Research Field | |
关键词 | |
Committee Chair / Thesis Advisor | |
Committee Members |
Primary PDF
Thumbnail | Title | Date Uploaded | Actions |
---|---|---|---|
Efficient Training of Input Convex Neural Networks Using Variable Projection () | 2023-04-10 15:27:11 -0400 |
|
Supplemental Files
Thumbnail | Title | Date Uploaded | Actions |
---|