Abstract

We present a data-driven workflow to biological tissue modeling, which aims to predict the displacement field based on digital image correlation (DIC) measurements under unseen loading scenarios, without postulating a specific constitutive model form nor possessing knowledge of the material microstructure. To this end, a material database is constructed from the DIC displacement tracking measurements of multiple biaxial stretching protocols on a porcine tricuspid valve anterior leaflet, with which we build a neural operator learning model. The material response is modeled as a solution operator from the loading to the resultant displacement field, with the material microstructure properties learned implicitly from the data and naturally embedded in the network parameters. Using various combinations of loading protocols, we compare the predictivity of this framework with finite element analysis based on three conventional constitutive models. From in-distribution tests, the predictivity of our approach presents good generalizability to different loading conditions and outperforms the conventional constitutive modeling at approximately one order of magnitude. When tested on out-of-distribution loading ratios, the neural operator learning approach becomes less effective. To improve the generalizability of our framework, we propose a physics-guided neural operator learning model via imposing partial physics knowledge. This method is shown to improve the model's extrapolative performance in the small-deformation regime. Our results demonstrate that with sufficient data coverage and/or guidance from partial physics constraints, the data-driven approach can be a more effective method for modeling biological materials than the traditional constitutive modeling.

1 Introduction

For many decades, constitutive models based on continuum mechanics have been commonly employed for modeling the mechanical responses of soft biological tissues. In Ref. [1], the seminal phenomenological constitutive models were developed and later employed for the modeling of soft tissues, including the iris [2], cardiac heart valves [35], arterial vessels [6], and the skin [7]. In the constitutive modeling approaches, a strain energy density function is predefined with a specific functional form. Then, the material parameters are calibrated through an inverse method or analytical stress–strain fitting. The descriptive power of these models is often restricted to certain deformation modes/strain ranges, which might lead to limited predictivity and generalizability [810].

To circumvent such a limitation, data-driven computing has been considered in recent years as an alternative for modeling the mechanical response of biological tissues [9,1113]. Unlike the traditional material identification techniques in constitutive modeling, data-driven approaches directly integrate material identification with the modeling procedures, and hence do not require a predefined constitutive model form. In Ref. [11], a fully convolutional neural network was trained based on synthetic datasets, to estimate a displacement field of material points in the simulated liver organ. In Ref. [14], Miñano et al. construct the constitutive law for soft tissue damage by solving the system of linear equations consisting of coefficients of shape functions, rather than nonlinear fitting to a predefined model. In Ref. [9], a local convexity data-driven computational framework was developed that couples manifold learning with nonlinear elasticity, for modeling a representative porcine mitral (heart) valve posterior leaflet's stress–strain data. This framework was further extended to an auto-embedding data-driven approach [12] to infer the underlying low-dimensional embedding representation of the material database. In Ref. [13], a neural network was developed to learn the mechanical behavior of porcine and murine skin from biaxial testing data by inferring the relationship between the isochoric strain invariants and the value of strain energy, as well as the strain energy derivatives. Despite these advances, data-driven methods on soft tissue modeling are mostly focusing on the identification of stress–strain and/or energy–strain relationships for a homogenized material model, and are thus not capable to capture the effects of material spatial heterogeneity. For example, the lack of considering the soft tissue heterogeneity could induce large errors in the predictions of tissue displacements and stresses [15].

Alternatively, there has been significant progress in the development of deep neural networks (NNs) for heterogeneous material modeling [1627]. Among these works, we focus on the neural operator learning approach [2227], which learns the maps between the inputs of a dynamical system and its state, so that the network serves as a surrogate for a solution operator. Compared with the classical NNs, the most notable advantage of neural operators is their generalizability to different input instances, rendering a computing advantage on prediction efficiency—once the neural operator is trained, solving for a new instance of the input parameter only requires a forward pass of the network. In Refs. [2830], neural operators have been successfully applied to modeling the unknown physics law of homogeneous materials. In Refs. [2527,31], neural operators were used as a solution surrogate for Darcy's flow in a heterogeneous porous medium with a known microstructure field. In our previous work [22], an implicit neural operator architecture, namely, the implicit Fourier neural operator (IFNO), was proposed to model heterogeneous material responses without using any predefined constitutive models or microstructure measurements. In particular, we have investigated the applicability of learning a material model for a latex material directly from digital image correlation (DIC) measurements and show that the learned solution operators substantially outperform the conventional constitutive models such as the generalized Mooney–Rivlin model.

To the best of our knowledge, the neural operator learning approaches have not been applied to soft tissue biomechanics. Moreover, the effectiveness of neural operator learning methods in extrapolation to small and large deformation regimes has yet to be systematically examined. To achieve these goals, in this work we propose to advance the current data-driven methods of soft tissue modeling by extending the neural operator learning approach. In particular, we employ the IFNO to learn the material model from DIC measurements on a representative tricuspid valve anterior leaflet (TVAL) specimen from a porcine heart and assess its predictability on unseen and out-of-distribution loading scenarios. To further improve the generalizability of the proposed framework, we also infuse partial physics knowledge via a soft penalty constraint to obtain a novel physics-guided neural operator learning framework. This method is shown to improve the extrapolative performance of our model in the small deformation regime.

The remainder of this paper is organized as follows. In Sec. 2, we introduce our data-driven computing paradigm based on the neural operator learning method, which integrates material identification, modeling procedures, and material response prediction into one unified learning framework. In particular, a stable deep layer architecture, i.e., the IFNO, is introduced in Sec. 2.2 and incorporated with partial physics knowledge in Sec. 2.3. In Sec. 3, we introduce our experimental setting on a representative TVAL specimen. Four study scenarios, considering different sets of experimental data for model training and predictions, are used to examine the in-distribution and out-of-distribution predictivity of the proposed IFNO method. The effectiveness of the IFNO approach is also compared with finite element simulation results based on three constitutive models. Then, we illustrate the prediction results of the IFNOs and physics-guided IFNOs, and compared their results with the modeling results based on fitted constitutive models in Sec. 4. Finally, we provide a summary of our achieved goals and concluding remarks in Sec. 5.

2 An Integrated Learning Framework

In this section, we first formulate the proposed workflow of data-driven material modeling using the operator learning framework and then introduce the deep neural operator model—the IFNO [22]. Next, we propose to further infuse partial physics knowledge via a soft penalty constraint to guide the training and prediction of the neural operators.

2.1 Neural Operator Learning Methods.

The main objective of this work is to model the mechanical response of a representative soft biological tissue directly from DIC-tracked displacement measurements, without any predefined constitutive model nor knowledge of the tissue microstructure. As depicted in Fig. 1 and Table 1, let us consider a soft biological tissue specimen that is mounted to a biaxial testing system and deforms under external loading. Denoting the region of interest on this specimen as a two-dimensional domain Ω, our aim is to identify the best surrogate operator, that can accurately predict the displacement field u(x),xΩ, given new and unseen loading scenarios. In this work, we model the tissue mechanical response as a quasi-static and hyperelastic problem for simplicity, so the resultant displacement field can be fully determined by a displacement-type loading applied on the domain boundary Ω. Thus, given the Dirichlet-type boundary condition, uD(x) for xΩ, our ultimate goal is to predict the corresponding displacement field u(x),xΩ.

Fig. 1
Problem setup for the proposed data-driven computations: (a) an image of the speckle-patterned porcine TVAL specimen subject to biaxial stretching (the DIC tracking grid is shown above the tissue picture), (b) schematic of a specimen subject to Dirichlet-type boundary conditions, as the corresponding numerical setting of (a), and (c) first Piola–Kirchhoff stresses P11 versus P22 of seven biaxial tension and constrained uniaxial tension testing protocols, with further details provided in Table 1
Fig. 1
Problem setup for the proposed data-driven computations: (a) an image of the speckle-patterned porcine TVAL specimen subject to biaxial stretching (the DIC tracking grid is shown above the tissue picture), (b) schematic of a specimen subject to Dirichlet-type boundary conditions, as the corresponding numerical setting of (a), and (c) first Piola–Kirchhoff stresses P11 versus P22 of seven biaxial tension and constrained uniaxial tension testing protocols, with further details provided in Table 1
Close modal
Table 1

Protocols of the mechanical testing on a representative TVAL specimen

Set IDExperiment protocolmax(λ1)max(λ2)max(P11)max(P22)# of samples
1Biaxial tensions P11:P22=1:11.461.68184.1 kPa165.1 kPa3921
2Biaxial tensions P11:P22=1:0.661.481.63187.1 kPa127.8 kPa3797
3Biaxial tensions P11:P22=1:0.331.521.52186.9 kPa74.1 kPa3539
4Biaxial tensions P11:P22=0.66:11.421.72145.9 kPa188.2 kPa4013
5Biaxial tensions P11:P22=0.33:11.321.7977.9 kPa189.8 kPa4175
6Constrained uniaxial in x, P11:P22=0.05:11.561.0197.9 kPa10.6 kPa3539
7Constrained uniaxial in y, P11:P22=1:0.11.01.8917.2 kPa176.1 kPa3539
Set IDExperiment protocolmax(λ1)max(λ2)max(P11)max(P22)# of samples
1Biaxial tensions P11:P22=1:11.461.68184.1 kPa165.1 kPa3921
2Biaxial tensions P11:P22=1:0.661.481.63187.1 kPa127.8 kPa3797
3Biaxial tensions P11:P22=1:0.331.521.52186.9 kPa74.1 kPa3539
4Biaxial tensions P11:P22=0.66:11.421.72145.9 kPa188.2 kPa4013
5Biaxial tensions P11:P22=0.33:11.321.7977.9 kPa189.8 kPa4175
6Constrained uniaxial in x, P11:P22=0.05:11.561.0197.9 kPa10.6 kPa3539
7Constrained uniaxial in y, P11:P22=1:0.11.01.8917.2 kPa176.1 kPa3539

The resultant displacement fields, based on digital image correlation, were used in the data-driven computations (P11 and P22 denote the first Piola–Kirchhoff stresses in the x- and y-directions, respectively, and λ1, λ2 are the stretches ratios in these two directions).

Mathematically, let K be the unknown differential operator associated with the momentum balance equation which depends on the unknown tissue microstructure and mechanical properties. For a given boundary condition uD(x), the momentum balance equation and boundary conditions are
(2.1)
Hence, our goal is to provide a surrogate solution operator for Eq. (2.1) as a mapping between any arbitrary uD and the corresponding material response u. To this end, we propose to embrace the descriptive power of NNs, and develop a data-driven neural operator with its input being uD(x) and its output being the displacement field u(x), for any xΩ. Given a collection of observed function pairs {(uD)j(x),uj(x)}j=1N from DIC measurements, where the input {(uD)j},j=1,,N is a sequence of boundary displacement loading and G[(uD)j](x)=uj(x) is the corresponding (potentially noisy) displacement field. With neural operator learning, we aim to build an approximation of G by constructing a nonlinear parametric map G[·;θ] in the form of a NN, for some finite dimensional parameter space Θ. Here, θΘ is the set of network architecture parameters to be inferred by solving the minimization problem
(2.2)

In this context, we have formulated the soft tissue response modeling problem as learning the solution operator G of an unknown partial differential equation (PDE) system from the DIC data.

Thus, our goal is to provide a neural operator, i.e., an approximated solution operator G[·;θ]:uDu, that delivers solutions of Eq. (2.1) for any input uD. Compared with the classical PDE solvers and the NN approaches, this is a more challenging task for several reasons. First, in contrast to the classical NN approaches where the solution operator is parameterized between finite dimensional Euclidean spaces [3236], the neural operators are built as mappings between infinite dimensional spaces [25,27,37]. Second, for every new instance of material microstructure and/or loading scenario f, the neural operators require only a forward pass of the network, which implies that the optimization problem (2.2)only needs to be solved once and the resulting NN can be utilized to solve for new and unseen loading scenarios. This property is in contrast to the classical numerical PDE methods [3840] and some machine learning approaches [4145], where the optimization problem needs to be solved for every new instance of the input parameter of a known governing law. Finally, of fundamental importance is the fact that the neural operators can find solution maps regardless of the presence of an underlying PDE and only require the observed data pairs {((uD)j,uj)}j=1N. Therefore, the neural operator learning approach is particularly promising when the mechanical responses are provided by experimental measurements, such as the displacement tracking data from DIC considered in this paper.

2.2 Implicit Fourier Neural Operators.

To provide an efficient, deep, and stable integral neural operator for the solution operator learning problem discovered above, we employ the IFNOs [22]. IFNOs stem from the idea of modeling the solution operator as a fixed point equation that naturally mimics the solution procedure for the displacement/damage fields in materials modeling. The increment between neural network hidden layers is modeled as an integral operator, which is directly parameterized in the Fourier space to facilitate the fast Fourier transformation and accelerated learning techniques for deep networks. As shown in Ref. [22], by learning the material responses directly from data, the material microstructure and properties are learned implicitly and embedded naturally in the network parameters, enabling the prediction of the material displacement for unseen loading conditions.

Figure 2 depicts the NN architecture employed in this work. Two IFNOs are built to predict ux(x) and uy(x), the x and y components of the displacement field, respectively. For each IFNO, the input is a vector function f(x) on Ω that contains information from x and uD(x). Here, we notice that the displacement boundary loading uD(x) is only defined on Ω. To make it well-defined on the whole domain, we employ the zero-padding strategy proposed in Ref. [31], namely, defining f(x):=[x,ũD(x)] where
(2.3)
Fig. 2
The architecture of the proposed physics-guided IFNO (PG-IFNO), which consists of two subnetworks for the prediction of two displacement field components. Each subnetwork starts from the input [x,ũD(x)], then (1) lifts to a high dimensional feature space by the lifting layer P and obtains the first hidden layer representation h(x,0); (2) applies L iterative layers; (3) projects the last hidden layer representation h(x,LΔt) back to the target dimension through a shallow network Q. The optimal network parameter θ* is obtained by minimizing the hybrid loss function defined as the weighted sum of the data-driven loss, Ldata, and the physics constraint loss, Lphysics.
Fig. 2
The architecture of the proposed physics-guided IFNO (PG-IFNO), which consists of two subnetworks for the prediction of two displacement field components. Each subnetwork starts from the input [x,ũD(x)], then (1) lifts to a high dimensional feature space by the lifting layer P and obtains the first hidden layer representation h(x,0); (2) applies L iterative layers; (3) projects the last hidden layer representation h(x,LΔt) back to the target dimension through a shallow network Q. The optimal network parameter θ* is obtained by minimizing the hybrid loss function defined as the weighted sum of the data-driven loss, Ldata, and the physics constraint loss, Lphysics.
Close modal
Then, we lift the input f(·) to a representation (feature) h(·,0) of dimension d, that corresponds to the first network layer. For the consistency of notation, we label the first argument of h as the space (the set of nodes) and the second argument as the time (the set of layers) and define the first network layer as

where Pd×4 and pd are trainable parameters.

Second, we denote the lth network representation by h(x,lΔt), and formulate the NN architecture in an iterative manner: h(·,0)h(·,Δt)h(·,2Δt)h(·,T), where h(·,jΔt),j=0,,L:=T/Δt, is a sequence of functions representing the features at each hidden layer, taking values in d. Here, l =0 (or equivalently, t =0) denotes the first hidden layer, whereas t=LΔt=T corresponds to the last hidden layer. In particular, the layer update rule in the IFNOs writes

Here, F and F1 denote the Fourier transform and its inverse, respectively. In practice, F and F1 are computed using the fast Fourier transformation and its inverse to each component of h separately, with the highest modes truncated and keeping only the first k modes. Also, cd defines a constant bias, Wd×d is the weight matrix, and F[κ(·;v)]:=Rd×d×k is a circulant matrix that depends on the convolution kernel κ. We further define σ as the activation function, which is chosen to be the popular rectified linear unit (ReLU) function [46]. Here we note that the definition of t stems from the relationship established between the network update and a time stepping scheme, which enables the employment of the accelerated training strategy for the NN in the deep layer limit.

Third, the output ux(x) or uy(x) is obtained through a projection layer. Taking the IFNO for the prediction of ux(x), for example, we project the last hidden layer representation h(·,T) as

Here, Q1dQ×d,Q21×dQ,q1dQ, and q2 are the trainable parameters.

Denoting the parameters and the corresponding operators associated with ux and uy with the subscripts x and y, respectively, the vanilla version of our neural operator learning architecture without physics constraints (which will be denoted as IFNO in the following context, with a slight abuse of notation) is written as:
Note that the trainable parameters are collected in θ:={Px,px,(Q1)x,(Q2)x,(q1)x,(q2)x,cx,Wx,Rx,Py,py,(Q1)y,(Q2)y,(q1)y,(q2)y,cy,Wy,Ry}, obtained in the vanilla IFNO by minimizing the data-driven loss only
(2.4)

Further, as the layer becomes deep (Δt0), the iterative architecture of the IFNOs can be seen as an analog of discretized ordinary differential equations (ODEs). This allows us to exploit the shallow-to-deep learning technique [22,37,47,48]. Specifically, using the optimal network parameters θ* obtained by training an IFNO of depth L, we initialize the (deeper) L̃-layer network. As such, the optimal parameters learned on shallow networks are considered as (quasi-optimal) initial guesses for the deeper networks—accelerating the training for deeper NNs.

2.3 Physics-Guided Neural Operators.

So far, the neural operator model introduced above fully relies on the data, and hence its predictions may not be consistent with the underlying physical principles. For instance, with the quasi-static and hyperelastic assumption of our model, the specimen has no permanent deformation. In other words, if there is no loading applied to the tissue (i.e., the specimen is at rest), we should observe a zero displacement field in the specimen. However, this is generally not guaranteed in a fully data-driven neural operator model.

In this work, we aim to further leverage the neural operator learning architecture by imposing the underlying physical laws via soft penalty constraints during model training. In particular, considering a specimen at rest, the no-permanent-deformation assumption implies that zero loading should lead to zero displacement, i.e., G[0]=0. To enable the neural operator predictions to be consistent with this physical constraint, we propose a physics-guided neural operator model that minimizes the residual of the above physical law together with the fitting loss from the data. This is achieved by solving the following minimization problem with a hybrid loss function:
(2.5)
where the data-driven loss Ldata is defined in Eq. (2.4), and the physics constraint loss Lphysics is defined as
(2.6)

Here, γ>0 is a penalty parameter to enforce the zero deformation state for material subject to zero loading. Thus, the physics-guided neural operator is anticipated to improve the prediction performance in the small deformation regime. In the following, we will denote this model as the physics-guided IFNO, or the PG-IFNO, in short.

3 Application to Tissue Biomechanics of the Heart Valve Leaflet

We now consider the problem of learning the material response of a TVAL specimen from displacement measurements based on DIC tracking. In this problem, the constitutive equations and material microstructure are both unknown, and the dataset has unavoidable measurement noise. To demonstrate the efficacy of the proposed IFNOs in conjunction with the physics-based enrichment, we further compared our method against three conventional approaches that use constitutive modeling with parameter fittings. The code and dataset have been publicly released at the following link.2

3.1 Tissue Preparation and Mechanical Testing.

In this section, we first introduce the experimental specimen and data acquisition procedure. In brief, we followed our previously established biaxial testing procedure, including acquisition of a healthy porcine heart and retrieval of the TVAL [49,50]. We then sectioned the leaflet tissue into a square specimen and measured the thickness using an optical measuring system (Keyence, Itasca, IL). Afterward, we applied a speckling pattern to the tissue surface using an airbrush and black paint [5153]. The painted specimen was then mounted to a biaxial testing device (Bio-Tester, CellScale, Waterloo, ON, Canada) with an effective testing area of 9 × 9 mm for the following tissue characterizations (Fig. 1(a)).

First, we performed a preconditioning protocol in which the specimen was subjected to ten cycles of biaxial loading and unloading that targeted a first Piola–Kirchhoff stress of 150 kPa to emulate the valve's in vivo functioning conditions [54]. Then, we performed seven protocols of displacement-controlled testing to target various biaxial stresses: P11:P22= 1:1, 1:0.66, 1:0.33, 0.66:1, 0.33:1, 0.05:1, 1:0.1, with the last two protocols for constrained uniaxial stretching in x and y (Fig. 1(c) and Table 1). Here, P11 and P22 denote the first Piola–Kirchhoff stresses in the x- and y-directions, respectively. Each stress ratio was performed for three loading/unloading cycles. Throughout the test, images of the specimen were captured by a CCD camera, and the load cell readings and actuator displacements were recorded at 5 Hz. Due to the use of displacement-controlled testing, we observed mild deviations from the target stresses (see Table 1).

After testing, the acquired images were analyzed using the DIC module of the Bio-Tester's software. A 5.5×5.5 mm domain in the central region of the TVAL specimen was selected since the speckling pattern was more uniform and could yield more reliable node tracking (see Figs. 1(a) and 1(b)). The pixel coordinate locations of the DIC-tracked grid were then exported for use in the subsequent study scenarios. Based on the tracked coordinates, we constructed two numerical testing datasets: (i) an original dataset obtained directly from the experimental measurements, and (ii) a smoothed dataset where moving least-squares (MLS) smoothing was performed for the nodal displacements.

To generate the displacement fields uoriginal(x) for the original samples, we subtracted each material point location with its initial location on the first sample of each protocol, and the boundary displacement was obtained by enforcing uoriginal(x) on the boundary nodes. Next, to construct the smoothed samples for the ith material point, xi=(xi,yi), we employed a two-dimensional MLS shape function Ψi to reconstruct the smoothed displacement field: usmooth(xi,yi)=k=1NPΨk(xi,yi)uoriginal(xk,yk), based on the unsmoothed displacement vector of the NP numbers of points in the neighborhood of xi. For further details regarding the MLS shape function and the smoothing procedure, we refer interested readers to Refs. [22], [55], and [56]. Both the smoothed and the original datasets have 26,523 total time instants (samples), denoted as Dsmooth={(uD)jsmooth,ujsmooth}j=126,523 and Doriginal={(uD)joriginal,ujoriginal}j=126,523, respectively. Finally, to create a structured grid for the proposed IFNOs, we further applied a cubic spline interpolation to the displacement field on a structured 21 × 21 node grid.

3.2 Baseline: Constitutive Modeling.

As the baseline method for comparisons with the proposed neural operator learning methods, we considered a constitutive modeling approach using parameter fitting to the experimental stress–stretch data. In particular, for comparisons with the IFNO approach, we employed three models for the planar stress–strain behavior of the tissue: (i) a Fung-type model, (ii) an invariant-based model, and (iii) a structure-informed model. The Fung-type model was considered with the strain energy density function given by

where c, a1, a2, and a3 are the model parameters to be determined, and E11, E22 are the principle Green–Lagrange strains in the x- and y-directions, respectively.

The invariant-based model was chosen as the Lee–Sacks form [8,57,58], with the strain energy density function defined as

Herein, ci (i=0,1,2,3) and w are the model parameters to be determined, w[0,1] denotes the material anisotropy, and I1=tr(C) and I4=m·Cm are the invariant and pseudo-invariant of the right Cauchy–Green deformation C=FTF, respectively. In this study, we consider the direction of the collagen fibers in the reference configuration to be in the circumferential direction (i.e., m=[1,0,0]T).

Finally, for the structure-informed model, we use a simplified ensemble fiber stress–strain relationship, along with a neo-Hookean nonfibrous ground matrix [9,10,59,60], with the strain energy density function
where μm is the neo-Hookean stiffness and p is the penalty term to enforce tissue incompressibility J=det(F)=1 that can be analytically determined by further applying the plane-stress condition [59], Ef=NT(θ)EN(θ) is the fiber strain along N(θ)=[cos(θ),sin(θ),0]T,E=(CI)/2 is the Green–Lagrange strain, and I is the identity tensor. For the fiber stress–strain behavior, we used an exponential model with a terminal stiffness for numerical stability

where c0 and c1 are the material parameters and Eub is threshold fiber strain for the transition to a linear fiber tangent modulus. Finally, we used a Gaussian distribution function, with zero mean and fiber dispersion of σ, for Γθ. Thus, for the structure-informed model there are four parameters to be determined from optimization: μm, c0, c1, and σ, whereas Eub is precalculated as the strain corresponding to a predetermined stress threshold value of 10 MPa for a given pair of c0 and c1.

In this work, constitutive model parameters were obtained by nonlinear least-squares fitting to the biaxial stress–stretch data for the training samples. In brief, the first Piola–Kirchhoff stresses in the x- and y-directions were determined using the specimen thickness Lz, the undeformed edge lengths Lx and Ly, and the measured forces Fx and Fy: P11=Fx/LyLz and P22=Fy/LxLz. The two stretches were calculated as the ratio of the deformed to the undeformed edge lengths. To obtain the optimal parameters for the different model, differential evolution optimization was employed that minimizes the residual mean squared errors in the stress between the experimental data and model prediction [61]. Finally, using the determined model parameters, finite element simulation was performed in Abaqus [62] with the DIC-tracked nodal displacements prescribed as boundary displacement conditions. The relative errors of displacement fields were then evaluated by comparing the finite element solution and the DIC-based measurements. In the following, we will refer to these baseline approaches as the “Fung model” method, the “invariant-based” method, and the “structure-informed” method.

3.3 Numerical Study Scenarios.

Based on the seven mechanical testing protocols listed in Table 1, four study scenarios are considered to evaluate the interpolative and extrapolative performances of the proposed neural operator learning methods. In each study scenario, a subset of the samples was selected to form the training set and to obtain the optimal neural operator by solving (2.2) and (2.6). Then, the displacement field predictions were made for the remaining samples and the results were compared with the ground-truth displacement fields from the DIC measurements, to evaluate the predictivity and generalizability of our proposed methods. Due to the relatively large number of samples, in constitutive modeling approach, it is generally intractable to perform finite element analysis for all 26,523 samples. To reduce the computational cost, although we train both models on samples from all cycles, we only evaluate the training and testing errors for samples in the first loading/unloading cycle of each protocol for the three constitutive modeling approaches. Then, we considered the averaged relative error of displacement, ||upred,juj||L2(Ω)/||uj||L2(Ω), as the error metric, so as to provide a fair comparison between the constitutive modeling and our neural operator learning approaches. Here, uj denotes the jth sample from the DIC measurement, and upred,j is the prediction from either the neural operator or the corresponding baseline constitutive model for this sample.

3.3.1 Study 1.

We mixed all samples from all seven protocols, randomly selected 83% of samples for training, and used the remaining for validation (10% of samples) and testing (7% of samples). In this scenario, we ensured that the boundary conditions of the samples in the testing set are inside the training region. Therefore, with this study, we aimed to investigate the in-distribution predictivity of the proposed method. With this study, we not only investigate the performance of the proposed approach in in-distribution learning tasks but also study the required amount of training data by demonstrating the errors when using different numbers of (randomly) selected training samples.

3.3.2 Study 2.

For this study, we employed protocols 1, 2, and 4 for training and protocols 3, 5, 6, and 7 for testing. In this study and the two studies below, the samples from the second loading/unloading cycle of testing protocols are employed as the validation dataset for the purpose of hyperparameter tuning, while the first cycle is reserved as the test dataset. We notice that the testing protocols are not covered in any of training sets, and they have smaller maximum tensions compared with the training sets. Hence, with this study, we aimed to investigate the performance of the proposed IFNO methods for predicting the out-of-distribution material responses in the small deformation regime.

3.3.3 Study 3.

We used protocols 1, 6, and 7 for training and protocols 2–5 for testing. The protocols considered in testing were not covered in any of the training protocols, although the deformation range of the testing protocols may fall inside the range of the training ones. Hence, we attempted to illustrate the out-of-distribution prediction on the intermediate deformation regime.

3.3.4 Study 4.

We used protocols 2–7 for training, and protocol 1 for prediction. We notice that the equibiaxial tension protocol (P11:P22=1:1) is not covered in any of other sets, and protocol 1 exhibits the largest maximum tensions among all the sets. Hence, with this study, we aimed to investigate the out-of-distribution predictivity in the large deformation regime of the proposed method.

4 Results and Discussions

In this section, we illustrate the performance of the proposed neural operator learning approaches. All our numerical experiments were performed on a machine with a 2.8 GHz 8-core CPU and a single Nvidia (Santa Clara, CA) RTX 3060 graphics processing unit, using a Pytorch implementation modified from the package provided in Ref. [27]. The optimization was performed with the Adam optimizer. For all IFNOs, we set the dimension of h as d =16 and the number of truncated Fourier modes as k=8×8, with L =12 hidden layers. The network was trained with the shallow-to-deep training procedure: we initialized the L layer network parameters from the (L/2) layer IFNOs model. For each depth L, we trained the network for 1000 epochs with a learning rate of 3×103, then decreased the learning rate with a ratio of 0.5 every 100 epochs. For all PG-IFNOs we took the penalty parameter γ=1.0, although we noted that this parameter can be further hand-tuned or optimized to potentially achieve a better performance.

4.1 Study 1: In-Distribution Prediction.

To verify the model's predictivity for in-distribution learning tasks, in this study we randomly selected 83% of the samples of all protocols to form the training set and then built the vanilla IFNO model and the three baseline constitutive models based on this common training set. Figure 3 (top) shows the relative displacement errors when using different amounts of training samples, and the samplewise errors for each model are provided in Fig. 3 (bottom). When comparing the results between the original dataset and the smoothed dataset, one can observe that their samplewise errors present a similar trend, while the smoothing procedure slightly improves the prediction accuracy for both the IFNO and three baseline models. Probably unsurprisingly, from the left of Fig. 3 (top), one can see that the accuracy of the IFNO improves when using more training samples. In particular, the test error decreases with a convergence rate of around O(N0.34), when the training dataset size, N, increases. With only 45 samples, the IFNO achieves a comparable accuracy as the three constitutive models trained on all 22,000 samples. When using all 22,000 measurements in both models, the IFNO outperforms the conventional constitutive modeling approaches by around one order of magnitude, on both the original and smoothed datasets. To provide further insights into this comparison, in Fig. 4 we visualized both the x- and y-displacement solutions and the prediction errors obtained with the IFNO and the structure-informed model (the best baseline model) on two test samples, which correspond to the large deformation (sample #2) and small deformation (sample #1) representatives, respectively. The structure-informed model, which considered the homogenized stress–strain at one material point (i.e., the center of the specimen) due to limited information about the spatial variation in the stress measurement, failed to capture material heterogeneity and hence exhibited large prediction errors in the interior region of the TVAL specimen domain. This observation confirms the importance of capturing the material heterogeneity and verifies the capability of the IFNOs in heterogeneous material modeling.

Fig. 3
Error comparisons of different models—study 1: in-distribution prediction with training set on 83% of randomly selected samples. Top: relative errors for the displacement field prediction on the training and test datasets. Left: errors when using different amounts of (original) training samples in the IFNO model. Right: results of the IFNO using 45 training samples and all training samples, and their comparison with three baseline models using all training samples. We highlight the model with the best prediction accuracy in bold. Bottom: samplewise error comparison on all biaxial testing protocol sets. Results from the original dataset are on left and results from the smoothed dataset are on right.
Fig. 3
Error comparisons of different models—study 1: in-distribution prediction with training set on 83% of randomly selected samples. Top: relative errors for the displacement field prediction on the training and test datasets. Left: errors when using different amounts of (original) training samples in the IFNO model. Right: results of the IFNO using 45 training samples and all training samples, and their comparison with three baseline models using all training samples. We highlight the model with the best prediction accuracy in bold. Bottom: samplewise error comparison on all biaxial testing protocol sets. Results from the original dataset are on left and results from the smoothed dataset are on right.
Close modal
Fig. 4
Visualization of the most accurate baseline constitutive model fitting and IFNO performances on two representative test samples in (a) the original dataset and (b) the smoothed dataset—study 1 (corresponding to the two representative test samples highlighted in Fig. 3—bottom)
Fig. 4
Visualization of the most accurate baseline constitutive model fitting and IFNO performances on two representative test samples in (a) the original dataset and (b) the smoothed dataset—study 1 (corresponding to the two representative test samples highlighted in Fig. 3—bottom)
Close modal

4.2 Study 2: Out-of-Distribution Prediction on the Small Deformation Regime.

In this study, three protocols with the largest tensions (i.e., sets 1, 2, and 4) were used for training, and the other four protocols were used for prediction validation (sets 3, 5, 6, and 7 as listed in Table 1). Since the prediction sets are with a different biaxial loading ratio that is unseen from the training samples, this is an extrapolative learning task in the small deformation region. Figure 5 (left) provides the relative displacement errors from all models. One can see that compared with the interpolative prediction task in study 1, the extrapolative predictions are less effective for the neural operator. It was, in particular, noted that for the vanilla IFNO model while the training error was at a relatively low error (i.e., the model still possessed good expressivity in sets 1, 2, and 4), the testing error deteriorates by ten times and reached a similar level to the Fung-type model but slightly higher than invariant-based and structure-informed models. Perhaps unsurprisingly, this observation again verifies the sensitivity of machine learning models in extrapolative tasks (see, e.g., Ref. [9]). As shown in Fig. 5 (right), we demonstrate the samplewise errors from the original dataset for each model, and we noticed that the results on the smoothed dataset exhibit a similar trend. One can observe that for the three baseline models, the level of prediction errors is relatively similar for all four testing sets, while large errors are observed in sets 6 and 7 (the sets with the smallest maximum tensions) for the vanilla IFNO model. Those sets are the furthest away from the training set and hence their sample distributions are substantially different from those in the training sets.

Fig. 5
Error comparisons of different models—study 2: out-of-distribution prediction on the small deformation regime. Left: relative errors for the displacement field prediction on the training and test datasets. We highlight the model with the best prediction accuracy in bold. Right: samplewise error comparison on all test sets from the original (unsmoothed) dataset.
Fig. 5
Error comparisons of different models—study 2: out-of-distribution prediction on the small deformation regime. Left: relative errors for the displacement field prediction on the training and test datasets. We highlight the model with the best prediction accuracy in bold. Right: samplewise error comparison on all test sets from the original (unsmoothed) dataset.
Close modal

In this study, we also investigated the performance of the proposed PG-IFNOs. By infusing the no-permanent-deformation constraint, an improvement of the testing error was observed on both original and smoothed dataset. From the samplewise errors, we can tell that the invariant-based and structure informed models outperform PG-IFNO mostly on sets 6 and 7—which are the protocols on the small deformation regime. As we mentioned before, those sets are differ greatly from the training set, it's expected that neural operator would be less accurate. On the set where deformation is closer to the training set, i.e., set 3 and set 5, the PG-IFNO model shows a similar or even better performance than the baseline models. This fact is verified by the solutions and prediction errors on a representative sample in set 5, as depicted in Fig. 6. Compared with structure-informed method, the PG-IFNO model has a lower solution error and captures the material heterogeneity. Hence, these results suggest that sufficient coverage of sample distribution in the training protocol is critical for neural operator learning methods. Even though the constitutive modeling approach can have the lower solution error if the predefined model form exhibits good generalizability like the invariant-based or structured-informed model, the neural operator approach is superior at capturing the heterogeneous features. On these challenging extrapolative learning tasks, incorporating proper physics constraints seems to make the neural operator learning more versatile.

Fig. 6
Visualization of IFNO and PG-IFNO performances on a test sample in (a) the original dataset and (b) the smoothed dataset—study 2 (corresponds to the representative test sample defined in Fig. 5—right)
Fig. 6
Visualization of IFNO and PG-IFNO performances on a test sample in (a) the original dataset and (b) the smoothed dataset—study 2 (corresponds to the representative test sample defined in Fig. 5—right)
Close modal

4.3 Study 3: Out-of-Distribution Prediction on the Intermediate Deformation Regime.

In this study, protocol sets 1, 6, and 7 were used in model training, while the rest of sets (protocols 2–5) were for prediction validation. As such, the prediction sets are still with unseen tension ratios from the training sets, but the deformation range of the testing protocols is within the range of the training ones. In Fig. 7 (left), the relative displacement errors are provided. We can see that the testing errors from the IFNOs are still much larger than their respective training errors, due to the out-of-distribution learning nature of this study. However, the prediction error from the IFNOs outperforms all three constitutive models by at least 1.15% and 0.67% on the original and smoothed dataset, respectively. Therefore, as long as the dataset has sufficient coverage of the deformation, neural operator learning methods can be a more effective approach than the traditional constitutive models, even in the more challenging out-of-distribution tasks. To provide further insights into this comparison, Fig. 7 (right) depicts the samplewise errors on the original (unsmoothed) dataset. One can see that the prediction errors are almost uniform among all protocol sets. In this study, the PG-IFNO does not substantially improve the prediction accuracy compared with the original IFNO, possibly because the physics constraint stems from the zero loading case and is more helpful in guiding the prediction in small deformation regimes.

Fig. 7
Error comparisons of different models—study 3: out-of-distribution prediction on the intermediate deformation regime. Left: relative errors for the displacement field prediction on the training and test datasets. We highlight the model with the best prediction accuracy in bold. Right: samplewise error comparison on all test sets from the original (unsmoothed) dataset.
Fig. 7
Error comparisons of different models—study 3: out-of-distribution prediction on the intermediate deformation regime. Left: relative errors for the displacement field prediction on the training and test datasets. We highlight the model with the best prediction accuracy in bold. Right: samplewise error comparison on all test sets from the original (unsmoothed) dataset.
Close modal

4.4 Study 4: Out of Distribution Prediction on the Large Deformation Regime.

In the last comparative study scenario, the experimental protocol for prediction was selected to be the one with the largest maximum tension, with the aim to evaluate the models' extrapolative prediction abilities on the large deformation regime. From Fig. 8, one can see that while the IFNO becomes less effective on both datasets, the Fung-type model exhibits a better fit to both the original and smoothed testing datasets. On the other hand, the physics constraint does not help much in this study. These results suggest that to ensure reliable predictions from neural operator learning methods, with an insufficient coverage of deformation range in the training protocols, a judiciously designed physics constraint for the data range becomes important.

Fig. 8
Error comparisons of different models—study 4: out-of-distribution prediction on the large deformation regime. Left: relative errors for displacement field prediction on the training and test datasets. We highlight the model with the best prediction accuracy in bold. Right: samplewise error comparison on all test sets from the original (unsmoothed) dataset.
Fig. 8
Error comparisons of different models—study 4: out-of-distribution prediction on the large deformation regime. Left: relative errors for displacement field prediction on the training and test datasets. We highlight the model with the best prediction accuracy in bold. Right: samplewise error comparison on all test sets from the original (unsmoothed) dataset.
Close modal

5 Conclusion

In this work, we have applied the neural operator learning method to modeling the mechanical responses of a biological tissue specimen under different loading conditions. Specifically, a data-driven computing workflow has been proposed, which learns the material model directly from the DIC displacement tracking measurements and integrates material identification, modeling procedures, and material response prediction into one unified learning framework. With the proposed neural operator learning, the mechanical response of this tissue specimen can be modeled as a data-driven solution operator from the boundary loading to the resultant displacement field, and the learned model will be applicable to unseen loading conditions. To verify its efficacy on real-world soft tissue response learning tasks which feature spatial heterogeneity, measurement noise, anisotropic and nonlinear behaviors, we have used the proposed workflow to model a porcine heart TVAL specimen with the DIC measurement data collected from biaxial and constrained uniaxial tension tests. In the in-distribution validation case, our proposed neural operator learning method has been shown to significantly outperform the conventional constitutive modeling approach, with its predictions on out-of-distribution learning tasks being less effective. To improve the model generalizability on out-of-distribution tasks, we have further leveraged the neural operator learning method toward physically consistent predictions for tissue at rest and proposed a physics-guided neural operator learning approach. Numerical studies have demonstrated substantial improvements in terms of enhanced generalization performance in the small deformation regime. On the other hand, once the network is fully trained, the prediction of displacement field under a new and unseen loading only requires a forward pass in the neural operator model. When tested on a single CPU core, on average it only takes 0.012 s for the neural operator to predict one sample of the displacement field, while the same task takes 0.537 s in the baseline finite element solver. Hence, these results suggest that with sufficient coverage of training sample distribution and/or properly designed physics constraints, the neural operator learning approach could offer an alternative approach that outperforms the conventional phenomenological model in complex and heterogeneous material modeling tasks on both accuracy and efficiency.

Despite the encouraging results presented herein, numerous questions and potentials require further investigation. For example, although the proposed no-permanent-deformation constraint seems effective in the small deformation regime, it has little impact on improving the prediction accuracy of the large deformation regimes. As a natural extension, we will consider the enforcement of other physics constraints, which would potentially further enhance the performance of the method's extrapolative predictivity. Another natural future extension to be considered is the generalizability to other specimens with a different computational domain and/or microstructure. We point out that in this work, the neural operator model is trained by assuming that the tissue has unchanged microstructure, geometry, and biomechanical properties, leading to the constantly learned network parameter θ. To achieve the generalizability across different specimens, one possible approach is to consider varying θ across different geometries and microstructures and adapt the model using transfer-learning techniques such as the metalearning methods proposed in Ref. [63]. Similarly, translating the currently trained model to whole organ simulations would be another interesting generalization problem. On the other hand, another important next step is to consider other boundary loading scenarios in our learning framework, such as the traction loading problems as demonstrated in the synthetic dataset example in [22]. Moreover, in this study, we have considered the three homogeneous phenomenological models: Fung-type, invariant-based, and structure-informed models as the baseline constitutive modeling, because of their popularity and proven efficacy in soft tissue modeling [1,64,65]. However, we would also like to point out that by considering other constitutive models, such as the Holzapfel model [66], and/or incorporating the material microstructure into the conventional model, one might further improve the accuracy of the conventional constitutive model. Finally, another question arises from the possibility of achieving improved performance by optimizing the penalty parameter γ in the physics-guided hybrid loss function (2.5). It has been shown that an optimized penalty parameter could further enhance the accuracy and trainability of the constrained neural networks (see, e.g., Refs. [6769]). Hence, the performance of PG-IFNO might get further enhanced by designing effective algorithms which select appropriate weights in the hybrid loss function.

Acknowledgment

H. You and Y. Yu would like to acknowledge support from the National Science Foundation under award DMS 1753031 and the AFOSR grant FA9550-22-1-0197. Portions of this research were conducted on Lehigh University's Research Computing infrastructure partially supported by NSF Award 2019035. C. Ross, C.-H. Lee, and M.-C. Hsu would like to acknowledge support from the Presbyterian Health Foundation Team Science Grant. C. Ross was in part supported by the National Science Foundation Graduate Research Fellowship Program (GRF2020307284). M.-C. Hsu was in part supported by the National Institutes of Health under award number R01HL142504.

Funding Data

  • Air Force Office of Scientific Research (Grant No. FA9550-22-1-0197; Funder ID: 10.13039/100000181).

  • National Institutes of Health (Grant No. R01HL142504; Funder ID: 10.13039/100000002).

  • National Science Foundation (Grant Nos. DMS 1753031, 2019035, GRF2020307284; Funder ID: 10.13039/100000001).

  • Presbyterian Health Foundation (Team Science Grant; Funder ID: 10.13039/100001298).

Footnotes

References

1.
Fung
,
Y.
,
Fronek
,
K.
, and
Patitucci
,
P.
,
1979
, “
Pseudoelasticity of Arteries and the Choice of Its Mathematical Expression
,”
Am. J. Physiol. Heart Circ. Physiol.
,
237
(
5
), pp.
H620
H631
.10.1152/ajpheart.1979.237.5.H620
2.
Pant
,
A. D.
,
Dorairaj
,
S. K.
, and
Amini
,
R.
,
2018
, “
Appropriate Objective Functions for Quantifying Iris Mechanical Properties Using Inverse Finite Element Modeling
,”
ASME J. Biomech. Eng.
,
140
(
7
), p.
074502
.10.1115/1.4039679
3.
May-Newman
,
K.
, and
Yin
,
F.
,
1998
, “
A Constitutive Law for Mitral Valve Tissue
,”
ASME J. Biomech. Eng.
,
120
(
1
), pp.
38
47
.10.1115/1.2834305
4.
Prot
,
V.
,
Skallerud
,
B.
, and
Holzapfel
,
G.
,
2007
, “
Transversely Isotropic Membrane Shells With Application to Mitral Valve Mechanics. Constitutive Modelling and Finite Element Implementation
,”
Int. J. Numer. Methods Eng.
,
71
(
8
), pp.
987
1008
.10.1002/nme.1983
5.
Sacks
,
M. S.
,
Zhang
,
W.
, and
Wognum
,
S.
,
2016
, “
A Novel Fibre-Ensemble Level Constitutive Model for Exogenous Cross-Linked Collagenous Tissues
,”
Interface Focus
,
6
(
1
), p.
20150090
.10.1098/rsfs.2015.0090
6.
van den Broek
,
C. N.
,
Van der Horst
,
A.
,
Rutten
,
M.
, and
Van De Vosse
,
F. N.
,
2011
, “
A Generic Constitutive Model for the Passive Porcine Coronary Artery
,”
Biomech. Model. Mechanobiol.
,
10
(
2
), pp.
249
258
.10.1007/s10237-010-0231-9
7.
Bischoff
,
J. E.
,
Arruda
,
E. M.
, and
Grosh
,
K.
,
2000
, “
Finite Element Modeling of Human Skin Using an Isotropic, Nonlinear Elastic Constitutive Model
,”
J. Biomech.
,
33
(
6
), pp.
645
652
.10.1016/S0021-9290(00)00018-X
8.
Lee
,
C.-H.
,
Amini
,
R.
,
Gorman
,
R. C.
,
Gorman
,
J. H.
, III
, and
Sacks
,
M. S.
,
2014
, “
An Inverse Modeling Approach for Stress Estimation in Mitral Valve Anterior Leaflet Valvuloplasty for In-Vivo Valvular Biomaterial Assessment
,”
J. Biomech.
,
47
(
9
), pp.
2055
2063
.10.1016/j.jbiomech.2013.10.058
9.
He
,
Q.
,
Laurence
,
D. W.
,
Lee
,
C.-H.
, and
Chen
,
J.-S.
,
2021
, “
Manifold Learning Based Data-Driven Modeling for Soft Biological Tissues
,”
J. Biomech.
,
117
, p.
110124
.10.1016/j.jbiomech.2020.110124
10.
Lee
,
C.-H.
,
Zhang
,
W.
,
Feaver
,
K.
,
Gorman
,
R. C.
,
Gorman
,
J. H.
, and
Sacks
,
M. S.
,
2017
, “
On the In Vivo Function of the Mitral Heart Valve Leaflet: Insights Into Tissue–Interstitial Cell Biomechanical Coupling
,”
Biomech. Model. Mechanobiol.
,
16
(
5
), pp.
1613
1632
.10.1007/s10237-017-0908-4
11.
Pfeiffer
,
M.
,
Riediger
,
C.
,
Weitz
,
J.
, and
Speidel
,
S.
,
2019
, “
Learning Soft Tissue Behavior of Organs for Surgical Navigation With Convolutional Neural Networks
,”
Int. J. Comput. Assist. Radiol. Surg.
,
14
(
7
), pp.
1147
1155
.10.1007/s11548-019-01965-7
12.
He
,
X.
,
He
,
Q.
, and
Chen
,
J.-S.
,
2021
, “
Deep Autoencoders for Nonlinear Physics-Constrained Data-Driven Computational Framework With Application to Biological Tissue Modeling
,”
AAAI Spring Symposium
, MLPS, Palo Alto, CA, Mar. 22–24.10.1016/j.cma.2021.114034
13.
Tac
,
V.
,
Sree
,
V. D.
,
Rausch
,
M. K.
, and
Tepole
,
A. B.
,
2021
, “
Data-Driven Modeling of the Mechanical Behavior of Anisotropic Soft Biological Tissue
,”
Eng. With Comp.
, pp.
1
16
.10.1007/s00366-022-01733-3
14.
Miñano
,
M.
, and
Montáns
,
F. J.
,
2018
, “
WYPiWYG Damage Mechanics for Soft Materials: A Data-Driven Approach
,”
Arch. Comput. Methods Eng.
,
25
(
1
), pp.
165
193
.10.1007/s11831-017-9233-4
15.
Howell
,
B.
, and
McIntyre
,
C. C.
,
2017
, “
Role of Soft-Tissue Heterogeneity in Computational Models of Deep Brain Stimulation
,”
Brain Stimul.
,
10
(
1
), pp.
46
50
.10.1016/j.brs.2016.09.001
16.
Wang
,
K.
, and
Sun
,
W.
,
2018
, “
A Multiscale Multi-Permeability Poroplasticity Model Linked by Recursive Homogenizations and Deep Learning
,”
Comput. Methods Appl. Mech. Eng.
,
334
, pp.
337
380
.10.1016/j.cma.2018.01.036
17.
He
,
Q.
,
Barajas-Solano
,
D.
,
Tartakovsky
,
G.
, and
Tartakovsky
,
A. M.
,
2020
, “
Physics-Informed Neural Networks for Multiphysics Data Assimilation With Application to Subsurface Transport
,”
Adv. Water Resour.
,
141
, p.
103610
.10.1016/j.advwatres.2020.103610
18.
Tartakovsky
,
A. M.
,
Marrero
,
C. O.
,
Perdikaris
,
P.
,
Tartakovsky
,
G. D.
, and
Barajas-Solano
,
D.
,
2020
, “
Physics-Informed Deep Neural Networks for Learning Parameters and Constitutive Relationships in Subsurface Flow Problems
,”
Water Resour. Res.
,
56
(
5
) p.
e2019WR026731
.10.1029/2019WR026731
19.
Liu
,
Z.
,
Wu
,
C.
, and
Koishi
,
M.
,
2019
, “
A Deep Material Network for Multiscale Topology Learning and Accelerated Nonlinear Modeling of Heterogeneous Materials
,”
Comput. Methods Appl. Mech. Eng.
,
345
, pp.
1138
1168
.10.1016/j.cma.2018.09.020
20.
Yang
,
H.
,
Guo
,
X.
,
Tang
,
S.
, and
Liu
,
W. K.
,
2019
, “
Derivation of Heterogeneous Material Laws Via Data-Driven Principal Component Expansions
,”
Comput. Mech.
,
64
(
2
), pp.
365
379
.10.1007/s00466-019-01728-w
21.
Garbrecht
,
K.
,
Aguilo
,
M.
,
Sanderson
,
A.
,
Rollett
,
A.
,
Kirby
,
R. M.
, and
Hochhalter
,
J.
,
2021
, “
Interpretable Machine Learning for Texture-Dependent Constitutive Models With Automatic Code Generation for Topological Optimization
,”
Integr. Mater. Manuf. Innov.
,
10
(
3
), pp.
373
392
.10.1007/s40192-021-00231-6
22.
You
,
H.
,
Zhang
,
Q.
,
Ross
,
C. J.
,
Lee
,
C.-H.
, and
Yu
,
Y.
,
2022
, “
Learning Deep Implicit Fourier Neural Operators (IFNOs) With Applications to Heterogeneous Material Modeling
,”
Comput. Methods Appl. Mech. Eng.
,
398
, p.
115296
.10.1016/j.cma.2022.115296
23.
Lu
,
L.
,
Jin
,
P.
, and
Karniadakis
,
G. E.
,
2019
, “
Deeponet: Learning Nonlinear Operators for Identifying Differential Equations Based on the Universal Approximation Theorem of Operators
,” arXiv preprint
arXiv:1910.03193
.10.48550/arXiv.1910.03193
24.
Lu
,
L.
,
Jin
,
P.
,
Pang
,
G.
,
Zhang
,
Z.
, and
Karniadakis
,
G. E.
,
2021
, “
Learning Nonlinear Operators Via DeepONet Based on the Universal Approximation Theorem of Operators
,”
Nat. Mach. Intell.
,
3
(
3
), pp.
218
229
.10.1038/s42256-021-00302-5
25.
Li
,
Z.
,
Kovachki
,
N.
,
Azizzadenesheli
,
K.
,
Liu
,
B.
,
Bhattacharya
,
K.
,
Stuart
,
A.
, and
Anandkumar
,
A.
,
2020
, “
Neural Operator: Graph Kernel Network for Partial Differential Equations
,” arXiv preprint
arXiv:2003.03485
.https://www.researchgate.net/publication/340439031_Neural_Operator_Graph_Kernel_Network_for_Partial_Differential_Equations
26.
Li
,
Z.
,
Kovachki
,
N.
,
Azizzadenesheli
,
K.
,
Liu
,
B.
,
Stuart
,
A.
,
Bhattacharya
,
K.
, and
Anandkumar
,
A.
,
2020
, “
Multipole Graph Neural Operator for Parametric Partial Differential Equations
,”
Adv. Neural Inf. Process. Syst.
,
33
, pp.
6755
6766
.https://dl.acm.org/doi/abs/10.5555/3495724.3496291
27.
Li
,
Z.
,
Kovachki
,
N. B.
,
Azizzadenesheli
,
K.
,
Bhattacharya
,
K.
,
Stuart
,
A.
,
Anandkumar
,
A.
, et al.,
2020
, “
Fourier Neural Operator for Parametric Partial Differential Equations
,”
International Conference on Learning Representations
, Addis Ababa, Ethiopia, Apr. 26–30.10.48550/arXiv.2010.08895
28.
Yin
,
M.
,
Ban
,
E.
,
Rego
,
B. V.
,
Zhang
,
E.
,
Cavinato
,
C.
,
Humphrey
,
J. D.
, and
Em Karniadakis
,
G.
,
2022
, “
Simulating Progressive Intramural Damage Leading to Aortic Dissection Using DeepONet: An Operator–Regression Neural Network
,”
J. R. Soc. Interface
,
19
(
187
), p.
20210670
.10.1098/rsif.2021.0670
29.
Goswami
,
S.
,
Yin
,
M.
,
Yu
,
Y.
, and
Karniadakis
,
G. E.
,
2022
, “
A Physics-Informed Variational DeepONet for Predicting Crack Path in Quasi-Brittle Materials
,”
Comput. Methods Appl. Mech. Eng.
,
391
, p.
114587
.10.1016/j.cma.2022.114587
30.
Yin
,
M.
,
Zhang
,
E.
,
Yu
,
Y.
, and
Karniadakis
,
G. E.
,
2022
, “
Interfacing Finite Elements With Deep Neural Operators for Fast Multiscale Modeling of Mechanics Problems
,”
Comput. Methods Appl. Mech. Eng.
, epub, p.
115027
.10.1016/j.cma.2022.115027
31.
Lu
,
L.
,
Meng
,
X.
,
Cai
,
S.
,
Mao
,
Z.
,
Goswami
,
S.
,
Zhang
,
Z.
, and
Karniadakis
,
G. E.
,
2021
, “
A Comprehensive and Fair Comparison of Two Neural Operators (With Practical Extensions) Based on Fair Data
,”
Comp. Meth. App. Mech. Eng.
, 393, p.
114778
.10.1016/j.cma.2022.114778
32.
Guo
,
X.
,
Li
,
W.
, and
Iorio
,
F.
,
2016
, “
Convolutional Neural Networks for Steady Flow Approximation
,”
Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
, San Francisco, CA, Aug. 13–17, pp.
481
490
.10.1145/2939672.2939738
33.
Zhu
,
Y.
, and
Zabaras
,
N.
,
2018
, “
Bayesian Deep Convolutional Encoder–Decoder Networks for Surrogate Modeling and Uncertainty Quantification
,”
J. Comput. Phys.
,
366
, pp.
415
447
.10.1016/j.jcp.2018.04.018
34.
Adler
,
J.
, and
Öktem
,
O.
,
2017
, “
Solving Ill-Posed Inverse Problems Using Iterative Deep Neural Networks
,”
Inverse Probl.
,
33
(
12
), p.
124007
.10.1088/1361-6420/aa9581
35.
Bhatnagar
,
S.
,
Afshar
,
Y.
,
Pan
,
S.
,
Duraisamy
,
K.
, and
Kaushik
,
S.
,
2019
, “
Prediction of Aerodynamic Flow Fields Using Convolutional Neural Networks
,”
Comput. Mech.
,
64
(
2
), pp.
525
545
.10.1007/s00466-019-01740-0
36.
Khoo
,
Y.
,
Lu
,
J.
, and
Ying
,
L.
,
2021
, “
Solving Parametric PDE Problems With Artificial Neural Networks
,”
Eur. J. Appl. Math.
,
32
(
3
), pp.
421
435
.10.1017/S0956792520000182
37.
You
,
H.
,
Yu
,
Y.
,
D'Elia
,
M.
,
Gao
,
T.
, and
Silling
,
S.
,
2022
, “
Nonlocal Kernel Network (NKN): a Stable and Resolution-Independent Deep Neural Network
,”
J. Comp. Phy.
, 469, p.
111536
.10.1016/j.jcp.2022.111536
38.
LeVeque
,
R. J.
,
2007
,
Finite Difference Methods for Ordinary and Partial Differential Equations: Steady-State and Time-Dependent Problems
,
SIAM
, Philadelphia, PA.
39.
Zienkiewicz
,
O. C.
,
Taylor
,
R. L.
,
Nithiarasu
,
P.
, and
Zhu
,
J.
,
1977
,
The Finite Element Method
, Vol.
3
,
McGraw-Hill
,
London, UK
.
40.
Karniadakis
,
G.
, and
Sherwin
,
S.
,
2005
,
Spectral/HP Element Methods for Computational Fluid Dynamics
,
OUP Oxford, Oxford, UK
.
41.
Raissi
,
M.
,
Perdikaris
,
P.
, and
Karniadakis
,
G. E.
,
2019
, “
Physics-Informed Neural Networks: A Deep Learning Framework for Solving Forward and Inverse Problems Involving Nonlinear Partial Differential Equations
,”
J. Comput. Phys.
,
378
, pp.
686
707
.10.1016/j.jcp.2018.10.045
42.
Weinan
,
E.
, and
Yu
,
B.
,
2018
, “
The Deep Ritz Method: A Deep Learning-Based Numerical Algorithm for Solving Variational Problems
,”
Commun. Math. Stat.
,
6
(
1
), pp.
1
12
.10.1007/s40304-018-0127-z
43.
Bar
,
L.
, and
Sochen
,
N.
,
2019
, “
Unsupervised Deep Learning Algorithm for PDE-Based Forward and Inverse Problems
,” arXiv preprint
arXiv:1904.05417
.https://www.researchgate.net/publication/332368969_Unsupervised_Deep_Learning_Algorithm_for_PDE-based_Forward_and_Inverse_Problems
44.
Smith
,
J. D.
,
Azizzadenesheli
,
K.
, and
Ross
,
Z. E.
,
2021
, “
Eikonet: Solving the Eikonal Equation With Deep Neural Networks
,”
IEEE Trans. Geosci. Remote Sensing
,
59
(
12
), pp.
10685
10696
.10.1109/TGRS.2020.3039165
45.
Pan
,
S.
, and
Duraisamy
,
K.
,
2020
, “
Physics-Informed Probabilistic Learning of Linear Embeddings of Nonlinear Dynamics With Guaranteed Stability
,”
SIAM J. Appl. Dyn. Syst.
,
19
(
1
), pp.
480
509
.10.1137/19M1267246
46.
Agarap
,
A. F.
,
2018
, “
Deep Learning Using Rectified Linear Units (ReLU)
,” arXiv preprint
arXiv:1803.08375
.https://www.researchgate.net/publication/323956667_Deep_Learning_using_Rectified_Linear_Units_ReLU
47.
Haber
,
E.
,
Ruthotto
,
L.
,
Holtham
,
E.
, and
Jun
,
S.-H.
,
2018
, “
Learning Across Scales—Multiscale Methods for Convolution Neural Networks
,”
Proceedings of AAAI Conference Artificial Intelligence
,
32
, New Orleans, LA, Feb. 2–7.10.1609/aaai.v32i1.11680
48.
Modersitzki
,
J.
,
2009
,
FAIR: Flexible Algorithms for Image Registration
,
Siam
, Philadelphia, PA.
49.
Ross
,
C. J.
,
Laurence
,
D. W.
,
Richardson
,
J.
,
Babu
,
A. R.
,
Evans
,
L. E.
,
Beyer
,
E. G.
,
Childers
,
R. C.
,
Wu
,
Y.
,
Towner
,
R. A.
,
Fung
,
K.-M.
,
Mir
,
A.
,
Burkhart
,
H. M.
,
Holzapfel
,
G. A.
, and
Lee
,
C.-H.
,
2019
, “
An Investigation of the Glycosaminoglycan Contribution to Biaxial Mechanical Behaviours of Porcine Atrioventricular Heart Valve Leaflets
,”
J. R. Soc. Interface
,
16
(
156
), p.
20190069
.10.1098/rsif.2019.0069
50.
Laurence
,
D.
,
Ross
,
C.
,
Jett
,
S.
,
Johns
,
C.
,
Echols
,
A.
,
Baumwart
,
R.
,
Towner
,
R.
,
Liao
,
J.
,
Bajona
,
P.
,
Wu
,
Y.
, and
Lee
,
C.-H.
,
2019
, “
An Investigation of Regional Variations in the Biaxial Mechanical Properties and Stress Relaxation Behaviors of Porcine Atrioventricular Heart Valve Leaflets
,”
J. Biomech.
,
83
, pp.
16
27
.10.1016/j.jbiomech.2018.11.015
51.
Zhang
,
D. S.
, and
Arola
,
D. D.
,
2004
, “
Applications of Digital Image Correlation to Biological Tissues
,”
J. Biomed. Opt.
,
9
(
4
), pp.
691
699
.10.1117/1.1753270
52.
Lionello
,
G.
, and
Cristofolini
,
L.
,
2014
, “
A Practical Approach to Optimizing the Preparation of Speckle Patterns for Digital-Image Correlation
,”
Meas. Sci. Technol.
,
25
(
10
), p.
107001
.10.1088/0957-0233/25/10/107001
53.
Palanca
,
M.
,
Tozzi
,
G.
, and
Cristofolini
,
L.
,
2016
, “
The Use of Digital Image Correlation in the Biomechanical Area: A Review
,”
Int. Biomech.
,
3
(
1
), pp.
1
21
.10.1080/23335432.2015.1117395
54.
Jett
,
S.
,
Laurence
,
D.
,
Kunkel
,
R.
,
Babu
,
A. R.
,
Kramer
,
K.
,
Baumwart
,
R.
,
Towner
,
R.
,
Wu
,
Y.
, and
Lee
,
C.-H.
,
2018
, “
An Investigation of the Anisotropic Mechanical Properties and Anatomical Structure of Porcine Atrioventricular Heart Valves
,”
J. Mech. Behav. Biomed. Mater.
,
87
, pp.
155
171
.10.1016/j.jmbbm.2018.07.024
55.
Belytschko
,
T.
,
Krongauz
,
Y.
,
Organ
,
D.
,
Fleming
,
M.
, and
Krysl
,
P.
,
1996
, “
Meshless Methods: An Overview and Recent Developments
,”
Comput. Methods Appl. Mech. Eng.
,
139
(
1–4
), pp.
3
47
.10.1016/S0045-7825(96)01078-X
56.
Chen
,
J.-S.
,
Pan
,
C.
,
Wu
,
C.-T.
, and
Liu
,
W. K.
,
1996
, “
Reproducing Kernel Particle Methods for Large Deformation Analysis of Non-Linear Structures
,”
Comput. Methods Appl. Mech. Eng.
,
139
(
1–4
), pp.
195
227
.10.1016/S0045-7825(96)01083-3
57.
Johnson
,
E. L.
,
Laurence
,
D. W.
,
Xu
,
F.
,
Crisp
,
C. E.
,
Mir
,
A.
,
Burkhart
,
H. M.
,
Lee
,
C.-H.
, and
Hsu
,
M.-C.
,
2021
, “
Parameterization, Geometric Modeling, and Isogeometric Analysis of Tricuspid Valves
,”
Comput. Methods Appl. Mech. Eng.
,
384
, p.
113960
.10.1016/j.cma.2021.113960
58.
Kamensky
,
D.
,
Xu
,
F.
,
Lee
,
C.-H.
,
Yan
,
J.
,
Bazilevs
,
Y.
, and
Hsu
,
M.-C.
,
2018
, “
A Contact Formulation Based on a Volumetric Potential: Application to Isogeometric Simulations of Atrioventricular Valves
,”
Comput. Methods Appl. Mech. Eng.
,
330
, pp.
522
546
.10.1016/j.cma.2017.11.007
59.
Fan
,
R.
, and
Sacks
,
M. S.
,
2014
, “
Simulation of Planar Soft Tissues Using a Structural Constitutive Model: Finite Element Implementation and Validation
,”
J. Biomech.
,
47
(
9
), pp.
2043
2054
.10.1016/j.jbiomech.2014.03.014
60.
Lee
,
C.-H.
,
Rabbah
,
J.-P.
,
Yoganathan
,
A. P.
,
Gorman
,
R. C.
,
Gorman
,
J. H.
, and
Sacks
,
M. S.
,
2015
, “
On the Effects of Leaflet Microstructure and Constitutive Model on the Closing Behavior of the Mitral Valve
,”
Biomech. Model. Mechanobiol.
,
14
(
6
), pp.
1281
1302
.10.1007/s10237-015-0674-0
61.
Price
,
K.
,
Storn
,
R. M.
, and
Lampinen
,
J. A.
,
2006
,
Differential Evolution: A Practical Approach to Global Optimization
,
Springer Science & Business Media, Berlin
.
62.
Abaqus
,
G.
,
2011
,
Abaqus 6.11
,
Dassault Systemes Simulia Corporation
,
Providence, RI
.
63.
Zhang
,
L.
,
You
,
H.
, and
Yu
,
Y.
,
2022
, “
MetaNOR: A Meta-Learnt Nonlocal Operator Regression Approach for Metamaterial Modeling
,”
MRS Comm.
, epub.10.1557/s43579-022-00250-0
64.
Sun
,
W.
, and
Sacks
,
M. S.
,
2005
, “
Finite Element Implementation of a Generalized Fung-Elastic Constitutive Model for Planar Soft Tissues
,”
Biomech. Model. Mechanobiol.
,
4
(
2–3
), pp.
190
199
.10.1007/s10237-005-0075-x
65.
Famaey
,
N.
, and
Sloten
,
J. V.
,
2008
, “
Soft Tissue Modelling for Applications in Virtual Surgery and Surgical Robotics
,”
Comput. Methods Biomech. Biomed. Eng.
,
11
(
4
), pp.
351
366
.10.1080/10255840802020412
66.
Holzapfel
,
G. A.
,
2002
, “
Nonlinear Solid Mechanics: A Continuum Approach for Engineering Science
,”
Meccanica
,
37
(
4/5
), pp.
489
490
.10.1023/A:1020843529530
67.
Wang
,
S.
,
Wang
,
H.
, and
Perdikaris
,
P.
,
2021
, “
Learning the Solution Operator of Parametric Partial Differential Equations With Physics-Informed DeepONets
,”
Sci. Adv.
,
7
(
40
), p.
eabi8605
.10.1126/sciadv.abi8605
68.
Wang
,
S.
,
Yu
,
X.
, and
Perdikaris
,
P.
,
2022
, “
When and Why Pinns Fail to Train: A Neural Tangent Kernel Perspective
,”
J. Comput. Phys.
,
449
, p.
110768
.10.1016/j.jcp.2021.110768
69.
McClenny
,
L.
, and
Braga-Neto
,
U.
,
2020
, “
Self-Adaptive Physics-Informed Neural Networks Using a Soft Attention Mechanism
,” arXiv preprint
arXiv:2009.04544
.http://ceur-ws.org/Vol-2964/article_68.pdf