Search Page | SpringerLink

Convergence rates of training deep neural networks via alternating minimization methods

Training deep neural networks (DNNs) is an important and challenging optimization problem in machine learning due to its non-convexity and...

**tao Xu, Chenglong Bao, Wenxun **ng in Optimization Letters

Article 21 June 2023

Learning Velocity Model for Complex Media with Deep Convolutional Neural Networks

Abstract

The paper considers the problem of velocity model acquisition for a complex media based on boundary measurements. The acoustic model is used...

A. S. Stankevich, I. O. Nechepurenko, ... A. V. Vasyukov in Lobachevskii Journal of Mathematics

Article 01 January 2024

Deep limits of residual neural networks

Neural networks have been very successful in many applications; we often, however, lack a theoretical understanding of what the neural networks are...

Matthew Thorpe, Yves van Gennip in Research in the Mathematical Sciences

Article Open access 16 December 2022

Specialized Pre-Training of Neural Networks on Synthetic Data for Improving Paraphrase Generation

Paraphrase generation is a fundamental problem in natural language processing. Due to the significant success of transfer learning, the “pre-training...

O. H. Skurzhanskyi, O. O. Marchenko, A. V. Anisimov in Cybernetics and Systems Analysis

Article 27 March 2024

On the approximation of rough functions with deep neural networks

The essentially non-oscillatory (ENO) procedure and its variant, the ENO-SR procedure, are very efficient algorithms for interpolating...

Tim De Ryck, Siddhartha Mishra, Deep Ray in SeMA Journal

Article Open access 23 May 2022

Learning spiking neuronal networks with artificial neural networks: neural oscillations

First-principles-based modelings have been extremely successful in providing crucial insights and predictions for complex biological functions and...

Ruilin Zhang, Zhongyi Wang, ... Yao Li in Journal of Mathematical Biology

Article 17 April 2024

Stochastic perturbation of subgradient algorithm for nonconvex deep neural networks

Choosing a learning rate is a necessary part of any subgradient method optimization. With deeper models such as convolutional neural networks of...

A. El Mouatasim, J. E. Souza de Cursi, R. Ellaia in Computational and Applied Mathematics

Article 01 May 2023

Limitations of neural network training due to numerical instability of backpropagation

We study the training of deep neural networks by gradient descent where floating-point arithmetic is used to compute the gradients. In this framework...

Clemens Karner, Vladimir Kazeev, Philipp Christian Petersen in Advances in Computational Mathematics

Article Open access 11 February 2024

Approximation of functions from Korobov spaces by deep convolutional neural networks

The efficiency of deep convolutional neural networks (DCNNs) has been demonstrated empirically in many practical applications. In this paper, we...

Tong Mao, Ding-Xuan Zhou in Advances in Computational Mathematics

Article Open access 07 December 2022

Ranks of elliptic curves and deep neural networks

Matija Kazalicki, Domagoj Vlah in Research in Number Theory

Article 28 June 2023

Loss Function Dynamics and Landscape for Deep Neural Networks Trained with Quadratic Loss

Abstract

Knowledge of the loss landscape geometry makes it possible to successfully explain the behavior of neural networks, the dynamics of their...

M. S. Nakhodnov, M. S. Kodryan, ... D. S. Vetrov in Doklady Mathematics

Article Open access 01 December 2022

Discovery of Governing Equations with Recursive Deep Neural Networks

Model discovery based on existing data has been one of the major focuses of mathematical modelers for decades. Despite tremendous achievements in...

Jarrod Mau, Jia Zhao in Communications on Applied Mathematics and Computation

Article 03 July 2023

Improved Architectures and Training Algorithms for Deep Operator Networks

Operator learning techniques have recently emerged as a powerful tool for learning maps between infinite-dimensional Banach spaces. Trained under...

Sifan Wang, Hanwen Wang, Paris Perdikaris in Journal of Scientific Computing

Article 24 June 2022

Solving Parametric Partial Differential Equations with Deep Rectified Quadratic Unit Neural Networks

Implementing deep neural networks for learning the solution maps of parametric partial differential equations (PDEs) turns out to be more efficient...

Zhen Lei, Lei Shi, Chenyu Zeng in Journal of Scientific Computing

Article 09 November 2022

Improved Deep Neural Networks with Domain Decomposition in Solving Partial Differential Equations

An improved neural networks method based on domain decomposition is proposed to solve partial differential equations, which is an extension of the...

Wei Wu, **nlong Feng, Hui Xu in Journal of Scientific Computing

Article 03 September 2022

On the regularized risk of distributionally robust learning over deep neural networks

In this paper, we explore the relation between distributionally robust learning and different forms of regularization to enforce robustness of deep...

Camilo Andrés García Trillos, Nicolás García Trillos in Research in the Mathematical Sciences

Article 08 August 2022

Using Deep Neural Networks for Detecting Spurious Oscillations in Discontinuous Galerkin Solutions of Convection-Dominated Convection–Diffusion Equations

Standard discontinuous Galerkin finite element solutions to convection-dominated convection–diffusion equations usually possess sharp layers but also...

Derk Frerichs-Mihov, Linus Henning, Volker John in Journal of Scientific Computing

Article Open access 25 September 2023

Mesh-Informed Neural Networks for Operator Learning in Finite Element Spaces

Thanks to their universal approximation properties and new efficient training strategies, Deep Neural Networks are becoming a valuable tool for the...

Nicola Rares Franco, Andrea Manzoni, Paolo Zunino in Journal of Scientific Computing

Article Open access 23 September 2023