Efficient Training of Input Convex Neural Networks Using Variable Projection 公开

Chen, Yixiao (Spring 2023)

Permanent URL: https://etd.library.emory.edu/concern/etds/b5644t07f?locale=zh
Published

Abstract

Normalizing flows are deep generative models that construct a diffeomorphic mapping between a simple reference distribution and a complex probability distribution. Previous studies have utilized the gradient of input convex neural networks (ICNNs) to represent the diffeomorphism, which guarantees invertibility for any network weights. Moreover, the resulting mapping constitutes a unique optimal transformation that minimizes transportation costs, as dictated by optimal transport theory.Training ICNNs presents a challenging non-convex problem, except for weights of the last layer. To address this issue, this thesis proposes a training approach for ICNNs using variable projection (VarPro). The proposed method takes advantage of the affine mapping in the last layer of ICNNs, which preserves convexity of the network. Empirical results based on a two-dimensional synthetic dataset demonstrate that VarPro achieves a lower test loss and requires fewer gradient evaluations compared to the mini-batch gradient descent method Adam.

Table of Contents

1 Introduction...................... 1

1.1 Contribution&Outline...................... 3

2 Background......................5

2.1 ConvexPotentialFlows...................... 6

2.2 VariableProjection........................ 10

3 Efficient Method in Training ICNNs......................12

3.1 SampleAverageApproximation ................. 13

3.2 TrainingICNNswithVariableProjection . . . . . . . . . . . . 14

4 Numerical Experiments......................19

4.1 ExperimentwithMoonsDataset................. 20

5 Discussion......................23

6 Conclusion......................25 

About this Honors Thesis

Rights statement
  • Permission granted by the author to include this thesis or dissertation in this repository. All rights reserved by the author. Please contact the author for information regarding the reproduction and use of this thesis or dissertation.
School
Department
Degree
Submission
Language
  • English
Research Field
关键词
Committee Chair / Thesis Advisor
Committee Members
最新修改

Primary PDF

Supplemental Files