Out-of-distributional risk bounds for neural operators with applications to the Helmholtz equation

Despite their remarkable success in approximating a wide range of operators defined by PDEs, existing neural operators (NOs) do not necessarily perform well for all physics problems. We focus here on high-frequency waves to highlight possible shortcomings. To resolve these, we propose a subfamily of...

Full description

Saved in:
Bibliographic Details
Main Authors: Benitez, J. Antonio Lara, Furuya, Takashi, Faucher, Florian, Kratsios, Anastasis, Tricoche, Xavier, de Hoop, Maarten V
Format: Journal Article
Language:English
Published: 26-01-2023
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Abstract Despite their remarkable success in approximating a wide range of operators defined by PDEs, existing neural operators (NOs) do not necessarily perform well for all physics problems. We focus here on high-frequency waves to highlight possible shortcomings. To resolve these, we propose a subfamily of NOs enabling an enhanced empirical approximation of the nonlinear operator mapping wave speed to solution, or boundary values for the Helmholtz equation on a bounded domain. The latter operator is commonly referred to as the ''forward'' operator in the study of inverse problems. Our methodology draws inspiration from transformers and techniques such as stochastic depth. Our experiments reveal certain surprises in the generalization and the relevance of introducing stochastic depth. Our NOs show superior performance as compared with standard NOs, not only for testing within the training distribution but also for out-of-distribution scenarios. To delve into this observation, we offer an in-depth analysis of the Rademacher complexity associated with our modified models and prove an upper bound tied to their stochastic depth that existing NOs do not satisfy. Furthermore, we obtain a novel out-of-distribution risk bound tailored to Gaussian measures on Banach spaces, again relating stochastic depth with the bound. We conclude by proposing a hypernetwork version of the subfamily of NOs as a surrogate model for the mentioned forward operator.
AbstractList Despite their remarkable success in approximating a wide range of operators defined by PDEs, existing neural operators (NOs) do not necessarily perform well for all physics problems. We focus here on high-frequency waves to highlight possible shortcomings. To resolve these, we propose a subfamily of NOs enabling an enhanced empirical approximation of the nonlinear operator mapping wave speed to solution, or boundary values for the Helmholtz equation on a bounded domain. The latter operator is commonly referred to as the ''forward'' operator in the study of inverse problems. Our methodology draws inspiration from transformers and techniques such as stochastic depth. Our experiments reveal certain surprises in the generalization and the relevance of introducing stochastic depth. Our NOs show superior performance as compared with standard NOs, not only for testing within the training distribution but also for out-of-distribution scenarios. To delve into this observation, we offer an in-depth analysis of the Rademacher complexity associated with our modified models and prove an upper bound tied to their stochastic depth that existing NOs do not satisfy. Furthermore, we obtain a novel out-of-distribution risk bound tailored to Gaussian measures on Banach spaces, again relating stochastic depth with the bound. We conclude by proposing a hypernetwork version of the subfamily of NOs as a surrogate model for the mentioned forward operator.
Author Tricoche, Xavier
Benitez, J. Antonio Lara
de Hoop, Maarten V
Kratsios, Anastasis
Furuya, Takashi
Faucher, Florian
Author_xml – sequence: 1
  givenname: J. Antonio Lara
  surname: Benitez
  fullname: Benitez, J. Antonio Lara
– sequence: 2
  givenname: Takashi
  surname: Furuya
  fullname: Furuya, Takashi
– sequence: 3
  givenname: Florian
  surname: Faucher
  fullname: Faucher, Florian
– sequence: 4
  givenname: Anastasis
  surname: Kratsios
  fullname: Kratsios, Anastasis
– sequence: 5
  givenname: Xavier
  surname: Tricoche
  fullname: Tricoche, Xavier
– sequence: 6
  givenname: Maarten V
  surname: de Hoop
  fullname: de Hoop, Maarten V
BackLink https://doi.org/10.48550/arXiv.2301.11509$$DView paper in arXiv
BookMark eNotj8tOwzAURL2ABRQ-gBX-gYTr2o6TJaqAIlXqpvvoOrYVizQOtsPr66Epq5FmdEY61-RiDKMl5I5BKWop4QHjl_8o1xxYyZiE5oro_ZyL4ArjU45ez9mHEQcafXqjOsyjSdSFSEc7x786TDZiDjHRT597itM0-A5PTKI50NxburXDsQ9D_qH2fV6mG3LpcEj29j9X5PD8dNhsi93-5XXzuCuwUk2hobOgOqY5V6IxVqFgam0cVFxhrbATQqND0Ugutak1WKiVqAQaBOlA8RW5P98uku0U_RHjd3uSbRdZ_gt3SFNj
ContentType Journal Article
Copyright http://arxiv.org/licenses/nonexclusive-distrib/1.0
Copyright_xml – notice: http://arxiv.org/licenses/nonexclusive-distrib/1.0
DBID AKY
AKZ
EPD
GOX
DOI 10.48550/arxiv.2301.11509
DatabaseName arXiv Computer Science
arXiv Mathematics
arXiv Statistics
arXiv.org
DatabaseTitleList
Database_xml – sequence: 1
  dbid: GOX
  name: arXiv.org
  url: http://arxiv.org/find
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
ExternalDocumentID 2301_11509
GroupedDBID AKY
AKZ
EPD
GOX
ID FETCH-LOGICAL-a679-b0ce07c1b33749de7a4172df0637a87ac44bafa49535bd8b0e087464ada05f073
IEDL.DBID GOX
IngestDate Mon Jan 08 05:46:35 EST 2024
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-a679-b0ce07c1b33749de7a4172df0637a87ac44bafa49535bd8b0e087464ada05f073
OpenAccessLink https://arxiv.org/abs/2301.11509
ParticipantIDs arxiv_primary_2301_11509
PublicationCentury 2000
PublicationDate 2023-01-26
PublicationDateYYYYMMDD 2023-01-26
PublicationDate_xml – month: 01
  year: 2023
  text: 2023-01-26
  day: 26
PublicationDecade 2020
PublicationYear 2023
Score 1.8716741
SecondaryResourceType preprint
Snippet Despite their remarkable success in approximating a wide range of operators defined by PDEs, existing neural operators (NOs) do not necessarily perform well...
SourceID arxiv
SourceType Open Access Repository
SubjectTerms Computer Science - Learning
Computer Science - Numerical Analysis
Mathematics - Numerical Analysis
Statistics - Machine Learning
Title Out-of-distributional risk bounds for neural operators with applications to the Helmholtz equation
URI https://arxiv.org/abs/2301.11509
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://sdu.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwdV07T8NADLZoJxYEAlSeuoH14JpccsmIoKUTHejQrbqHT2KgKU2KEL8eOwmChdVn3WDr7E-27zPATWqyLAkYZWa0lrr0uSy9QhnH3vo0oreR6x2zF_O8LB4nTJMjfv7C2O3n60fHD-zqO8LH41vGLOUABknCI1tP82XXnGypuHr9Xz3CmK3oT5KYHsJBj-7EfeeOI9jD9TG4-a6RVZSBOWr79VKkxEPdwvFWo1oQchTMLEniaoNt57sWXCIVfxvMoqkE4TVBqeKNglbzJfC9Y-o-gcV0sniYyX61gbS5KaVTHpXxY5emRpcBjdUEJEIkvGBsYazX2tloefYzc6FwClVhdK5tsCqL9CpPYbiu1jgCkSfGZTFFnzKXnvIleromUAzDooi5P4NRa5DVpmOvWLGtVq2tzv8_uoB93qvOtYYkv4Rhs93hFQzqsLtuXfANsRSHsA
link.rule.ids 228,230,782,887
linkProvider Cornell University
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Out-of-distributional+risk+bounds+for+neural+operators+with+applications+to+the+Helmholtz+equation&rft.au=Benitez%2C+J.+Antonio+Lara&rft.au=Furuya%2C+Takashi&rft.au=Faucher%2C+Florian&rft.au=Kratsios%2C+Anastasis&rft.date=2023-01-26&rft_id=info:doi/10.48550%2Farxiv.2301.11509&rft.externalDocID=2301_11509