Investigating the Role of Transfer Learning in Enhancing CNN-Based Subgrid-Scale Models for Geophysical Turbulence

Date
2024-12-05
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract

Transfer learning (TL) is a powerful tool for enhancing the performance of neural networks (NNs) in applications such as weather and climate prediction and turbulence modeling. TL enables models to generalize to out-of-distribution data with minimal data input. In this study, we employed a 9-layer convolutional NN to predict subgrid forcing in quasi-geostrophic systems and examined which metrics best describe its performance and generalizability. Fourier analysis of the NNs' kernels reveals that they learn low-pass, band-pass, and high-pass filters, regardless of their training dataset's isotropic or anisotropic nature. By analyzing the activation spectra, we also identified the reasons behind NNs' failure to generalize and how TL can overcome these limitations. The main reason is that learned weights and biases on one dataset underestimate the out-of-distribution sample spectra as they pass through NN, leading to an underestimation of output spectra. By only re-training one layer with new data from the target system, this underestimation is fixed and results in NN producing predictions matching target system dataset spectra. These findings are broadly applicable to data-driven parameterization of high-dimensional dynamical systems.

Description
Degree
Master of Science
Type
Thesis
Keywords
Transfer Learning, Data-driven parameterization, turbulence modeling, neural networks, explainability
Citation
Has part(s)
Forms part of
Published Version
Rights
Link to license
Citable link to this page