Repository logo
English
  • English
  • Català
  • Čeština
  • Deutsch
  • Español
  • Français
  • Gàidhlig
  • Italiano
  • Latviešu
  • Magyar
  • Nederlands
  • Polski
  • Português
  • Português do Brasil
  • Suomi
  • Svenska
  • Türkçe
  • Tiếng Việt
  • Қазақ
  • বাংলা
  • हिंदी
  • Ελληνικά
  • Yкраї́нська
  • Log In
    or
    New user? Click here to register.Have you forgotten your password?
Repository logo
  • Communities & Collections
  • All of R-3
English
  • English
  • Català
  • Čeština
  • Deutsch
  • Español
  • Français
  • Gàidhlig
  • Italiano
  • Latviešu
  • Magyar
  • Nederlands
  • Polski
  • Português
  • Português do Brasil
  • Suomi
  • Svenska
  • Türkçe
  • Tiếng Việt
  • Қазақ
  • বাংলা
  • हिंदी
  • Ελληνικά
  • Yкраї́нська
  • Log In
    or
    New user? Click here to register.Have you forgotten your password?
  1. Home
  2. Browse by Author

Browsing by Author "Darman, Moein"

Now showing 1 - 1 of 1
Results Per Page
Sort Options
  • Loading...
    Thumbnail Image
    Item
    Investigating the Role of Transfer Learning in Enhancing CNN-Based Subgrid-Scale Models for Geophysical Turbulence
    (2024-12-05) Darman, Moein; Hassanzadeh, Pedram
    Transfer learning (TL) is a powerful tool for enhancing the performance of neural networks (NNs) in applications such as weather and climate prediction and turbulence modeling. TL enables models to generalize to out-of-distribution data with minimal data input. In this study, we employed a 9-layer convolutional NN to predict subgrid forcing in quasi-geostrophic systems and examined which metrics best describe its performance and generalizability. Fourier analysis of the NNs' kernels reveals that they learn low-pass, band-pass, and high-pass filters, regardless of their training dataset's isotropic or anisotropic nature. By analyzing the activation spectra, we also identified the reasons behind NNs' failure to generalize and how TL can overcome these limitations. The main reason is that learned weights and biases on one dataset underestimate the out-of-distribution sample spectra as they pass through NN, leading to an underestimation of output spectra. By only re-training one layer with new data from the target system, this underestimation is fixed and results in NN producing predictions matching target system dataset spectra. These findings are broadly applicable to data-driven parameterization of high-dimensional dynamical systems.
  • About R-3
  • Report a Digital Accessibility Issue
  • Request Accessible Formats
  • Fondren Library
  • Contact Us
  • FAQ
  • Privacy Notice
  • R-3 Policies

Physical Address:

6100 Main Street, Houston, Texas 77005

Mailing Address:

MS-44, P.O.BOX 1892, Houston, Texas 77251-1892