Introduction

Nepal is an earthquake-prone country and its capital is located in the Kathmandu Valley. It is therefore important to accurately assess the seismic hazards there, and ground motion simulations based on accurate velocity structure models in and around the valley are essential for such assessments.

The collision of the Indian plate with the Eurasian plate is causing the Himalayan orogeny, which has formed many valleys and basins, as shown in Fig. 1a. The largest among them is the Kathmandu Valley. This valley is filled with sediments from the Bagmati River (thick blue lines in all panels of Fig. 1) and its tributaries (medium blue lines in Fig. 1b, c). The Paleo-Kathmandu Lake was formed 1 million years ago by the damming of the Bagmati River, and dried 12,000 years ago (Sakai et al. 2016). This geological history has resulted in the velocity structure of the Kathmandu Valley being three-dimensionally complicated.

Fig. 1
figure 1

Topographic maps for Nepal and the Kathmandu Valley (a and b). c is a close-up map around the valley center and airport (dark green area). The large black box and red lines with triangles in a represent a wide area around the valley and MFT, respectively. The light green line labeled X and Y indicates where the cross sections in Fig. 6 are taken. The thick lines in blue indicate the Bagmati River in all maps, while thin lines in medium blue indicate other major rivers in a or tributaries of the river in b and c. The Kathmandu Valley and its internal sediments are surrounded by the basin boundary (brown) and sedimentary boundary (black) of Shrestha et al. (1998) in b. In b and c, the sites for microtremor observations, stations of strong motion seismometers, and locations of boring survey results are plotted with red, magenta, and orange circles, respectively. The violet dots are points of gravity observation, and a seismic reflection survey is conducted along the very thick lines in light green

The collision of the plates also causes the Indian lithosphere to underthrust beneath Nepal along the Main Himalayan Thrust, which reaches the ground surface at the Main Frontal Thrust (MFT; red lines with triangles in Fig. 1a). This underthrusting generates earthquakes and three-dimensionally complicates the velocity structure around the Kathmandu Valley. Therefore, in this study, we construct the three-dimensional velocity structure models in and around the Kathmandu Valley using diverse data sets and information.

Modeling in Kathmandu Valley (data)

In the Kathmandu Valley, Yokoi et al. (2021) performed broadband array observations and analyses of microtremors, and Takai et al. (2021) recorded ground motions from earthquakes with strong motion seismometers. The locations of microtremor observations and strong motion seismometers are plotted in Fig. 1b, c with red and magenta circles, respectively. The details of these observations will be reported in separate papers. The gravity observations were also performed as described by Pradhan et al. (2018). The violet dots in Fig. 1b, c indicate these observation sites including those of Moribayashi and Maruo (1980). Kawasaki et al. (2019) performed a seismic reflection survey for the first time in the Kathmandu Valley along a line extending adjacent to the runway of the Tribhuvan International Airport (TIA). The airport and survey line are shown in Fig. 1b, c with dark green areas and light green lines. The details of the gravity and seismic reflection surveys will also be reported in other separate papers.

We first evaluated the Rayleigh-wave phase velocities and horizontal-to-vertical spectral ratios of microtremors, which were provided by Yokoi et al. (2021), for S-wave velocity (VS) structures using the joint inversion method of Suzuki and Yamanaka (2010) and the assumption of Arai and Tokimatsu (2000). The assumption is that microtremors are the summation of surface waves from point sources distributed at all directions and distances. Microtremor data do not have resolution for deep parts of the VS structure. Therefore, we set the VS of the two layers in the basement as 2.2 and 2.6 km/s referring to crustal structures in the Eurasian plate, although we determined both the VS and the thicknesses of the five sedimentary layers above them. Additionally, we assumed fixed structures of the crust and upper mantle based on CRUST 1.0 (Laske et al. 2013). The results at the TIA site are shown in the center of Fig. 2, and all results are shown in Additional file 1: Fig. S1.

Fig. 2
figure 2

Elevation profile obtained from the seismic reflection survey by Kawasaki et al. (2019). Their interpretation for the basement surface is shown with the green line. The basement surface obtained from the gravity survey by Pradhan et al. (2018) is also shown with the black line. The embedded column represents the S-wave velocity structure obtained from our microtremor survey at the TIA site

This site is located close to the reflection survey line as shown in Fig. 1c, so that the elevation profile obtained from the survey by Kawasaki et al. (2019) is shown in the background of Fig. 2. Their interpretation of the basement surface is indicated by the green line. The black line indicates the basement surface along the reflection survey line obtained from the gravity survey performed by Pradhan et al. (2018). The shapes of the basement surface are similar to each other and they agree with the interface between the sediments (red) and basement (green) in the result of the microtremor analysis.

We then analyzed the observed ground motions from earthquakes for VS structures at seismometer stations. The observed north–south and east–west components were converted to radial and tangential components using the earthquake epicenter location determined by the United States Geological Survey or Horiuchi et al. (2021). A Fourier transform was performed on the coda parts of this radial component and the observed vertical component to obtain the R/V spectral ratio. For example, seven earthquakes with determined epicenters were observed at the SGL station (Fig. 1b, c), and their R/V spectral ratios were calculated as indicated by the gray lines in Additional file 1: Fig. S2c. The average of these R/V spectral ratios was considered as the observed R/V spectral ratio (black line). This station is located close to the TIA site, as shown in Fig. 1c. Therefore, starting with the velocity structure at TIA (Additional file 1: Fig. S2a), the velocity structure at SGL was improved through trial and error (Additional file 1: Fig. S2b) to match the peak of the theoretical R/V spectral ratio of the Rayleigh wave to that of the observed ratio (magenta and black lines in Additional file 1: Fig. S2c). In this process, because of limited performance of R/V spectral ratio data, trial and error was performed by changing only the thicknesses of the sedimentary layers. The layer thicknesses at SGL were determined to be 80% of those at TIA (two-headed lines in Additional file 1: Figs. S2a, b). All the results of this analysis are shown in Additional file 1: Fig. S3.

The depths of the layer interfaces were determined from the obtained VS structures shown in Additional file 1: Figs. S1 and S3. The interface at the top of the 2.2 km/s layer is the basement surface, and the interfaces shallower than this are called sedimentary interfaces. The basement surface was also obtained from the results of the seismic reflection survey (Fig. 2) and borehole survey results of JICA (2018). The depths of the basement surface from the microtremors, ground motions, reflection survey, and borehole surveys are highly accurate but limited in number, whereas those from gravity observations by Pradhan et al. (2018) are numerous, as shown in Fig. 1b, c. The latter are called gravimetric depths and are plotted along with the former, collectively called seismic depths, in Fig. 3. This figure shows that the gravimetric depths have fair accuracy, although there are some variations.

Fig. 3
figure 3

Plot of gravimetric depths against seismic depths for the basement surface. The blue, green, and red dots represent gravimetric depths at the microtremor sites and seismometer stations, those along the reflection profile, and those at the boring sites, respectively. They are located at the horizontal positions corresponding to the seismic depths from the microtremor, ground motion, or boring data and those from the result of the reflection survey

According to the similarity and accuracy shown in Figs. 2 and 3, we use both the seismic and gravimetric depths for the modeling of the three-dimensional velocity structure in the Kathmandu Valley. It is noted that the microtremors and ground motions also provided us with the seismic depths of the sedimentary interfaces. In addition, to further utilize the advantage of the large amount of gravity data, a grid was defined over the area including the Kathmandu Valley, and Bouguer anomalies at its nodes are obtained by interpolation and extrapolation of the observations using the minimum curvature algorithm (Smith and Wessel 1990). The gravimetric depths calculated from them (Pradhan et al. 2018) were used in this study.

We assumed seven layers for the sediments and basement based on the VS structures in Additional file 1: Figs. S1 and S3. We adopted the accurate VS structures obtained from the joint inversions of the microtremor data (Additional file 1: Fig. S1), but excluded Chobar, because its shallow VS structure is very different from those of other stations. VS of an assumed layer was determined by averaging the values of the corresponding layers in the VS structures, as shown in Additional file 1: Table S1. The P-wave velocity (VP), density (ρ), and S-wave and P-wave quality factors (QS and QP, respectively) for each layer were obtained using the empirical relationships by Kitsunezaki et al. (1990), Ludwig et al. (1970), and Kawabe and Kamae (2008).

Modeling in Kathmandu Valley (methods)

The shapes of the basement surface (basement/sediment interface) and the four sedimentary interfaces separating the five upper layers were then determined in the form of a depth distribution. This determination is a variant of the common problem of map** a regionalized variable from limited sample of data. We considered the depth distribution of an interface as a regionalized variable \(z\left({\varvec{x}}\right)\) at unsampled location \({\varvec{x}}\), and assumed that seismic depths \({z}_{i}\) were obtained at sampled locations \({{\varvec{x}}}_{i} (i=1, 2, \dots ,n)\). To recover \(z\left({\varvec{x}}\right)\) from \({z}_{i}\), we use the ‘kriging’ method developed by Matheron (1963), which was named after pioneering work of Daniel Krige in the early 1950s. \(z\left({\varvec{x}}\right)\) was statistically distributed in region A, which was the Kathmandu Valley, with a probability density function. We adopted the normal score transformation of Deutsch and Journel (1997) such that \(z\left({\varvec{x}}\right)\) was converted to \(y\left({\varvec{x}}\right)\), the probability density of which is normally distributed with zero mean and unit variance. We note that the expectation value of a function \(g\left({\varvec{x}}\right)\) including \(y\left({\varvec{x}}\right)\) is defined as

$$E\left\{g\left({\varvec{x}}\right)\right\}={\iint }_{{\varvec{x}}\in A}g\left({\varvec{x}}\right)f\left({\varvec{x}}\right)d{\varvec{x}}$$
(1)

where \(f\left({\varvec{x}}\right)\) is the normal distribution with zero mean and unit variance.

It is assumed that \({z}_{i}\) is equal to the true variable at \({{\varvec{x}}}_{i}\), which is \(z\left({{\varvec{x}}}_{i}\right)\) or \(y\left({{\varvec{x}}}_{{\varvec{i}}}\right)\) in the normal score space, and that \(y\left({\varvec{x}}\right)\) is represented by a linear combination of \(y\left({{\varvec{x}}}_{i}\right)\) as

$$y\left({\varvec{x}}\right) \sim \sum_{i=1}^{n}{w}_{i}y\left({{\varvec{x}}}_{i}\right).$$
(2)

Because the integral in (1) is a weighted sum of \(g\left({\varvec{x}}\right)\) in A, the variance (i.e., the weighted sum of squared residuals) for the observation equation (2) can be written as

$$\begin{array}{l}S=E\left\{{\left[y\left({\varvec{x}}\right)-\sum_{i=1}^{n}{w}_{i}y\left({{\varvec{x}}}_{i}\right)\right]}^{2}\right\}\\ =E\left\{{\left[y\left({\varvec{x}}\right)\right]}^{2}\right\}+E\left\{{\left[\sum_{i=1}^{n}{w}_{i}y\left({{\varvec{x}}}_{i}\right)\right]}^{2}\right\}-2E\left\{y\left({\varvec{x}}\right)\sum_{i=1}^{n}{w}_{i}y\left({{\varvec{x}}}_{i}\right)\right\}\\ =E\left\{{\left[y\left({\varvec{x}}\right)\right]}^{2}\right\}+\sum_{i=1}^{n}\sum_{j=1}^{n}{w}_{i}{w}_{j}E\left\{y\left({{\varvec{x}}}_{i}\right)y\left({{\varvec{x}}}_{j}\right)\right\}-2\sum_{i=1}^{n}{w}_{i}E\left\{y\left({\varvec{x}}\right)y\left({{\varvec{x}}}_{i}\right)\right\}\end{array}$$

The covariance and variogram of \(y\left({\varvec{x}}\right)\) are defined as

$$C\left({\varvec{h}}\right)=E\left\{y\left({\varvec{x}}\right)y\left({\varvec{x}}+{\varvec{h}}\right)\right\}, \,2\gamma \left({\varvec{h}}\right)=E\left\{{\left[y\left({\varvec{x}}\right)-y\left({\varvec{x}}+{\varvec{h}}\right)\right]}^{2}\right\}.$$
(3)

They are related to each other by

$$C\left({\varvec{h}}\right)=C\left(0\right)- \gamma \left({\varvec{h}}\right)$$

where \(\gamma \left({\varvec{h}}\right)\) is a half of the variogram and called a semi-variogram. S is then rewritten using \(C\) in (3) as

$$\begin{array}{c}S=C(0)+\sum_{i=1}^{n}\sum_{j=1}^{n}{w}_{i}{w}_{j}C\left({{\varvec{x}}}_{j}-{{\varvec{x}}}_{i}\right)-2\sum_{i=1}^{n}{w}_{i}C\left({{\varvec{x}}}_{i}-{\varvec{x}}\right).\end{array}$$

S is minimum for the solutions of the normal equations \(\partial S/\partial {w}_{i}=0\), \(i=1, 2, \ldots , n\).

This formulation is extended to ‘cokriging’ where data with different attributes were included (e.g., Doyen 1988). We included the gravimetric depths \({z^{\prime}}_{i{\prime}}=z^{\prime}\left({{{\varvec{x}}}{\boldsymbol{^{\prime}}}}_{{i}^{\prime}}\right), i^{\prime}=1, 2, \ldots , n^{\prime}\) in addition to the seismic depths \({z}_{i}=z\left({{\varvec{x}}}_{i}\right), i=1, 2, \ldots , n\). (2) then yielded the observation equation:

$$y\left({\varvec{x}}\right) \sim \sum_{i=1}^{n}{w}_{i}y\left({{\varvec{x}}}_{i}\right)+\sum_{i\mathrm{^{\prime}}=1}^{n\mathrm{^{\prime}}}{w\mathrm{^{\prime}}}_{i\mathrm{^{\prime}}}y\mathrm{^{\prime}}\left({{\varvec{x}}\mathbf{^{\prime}}}_{i\mathrm{^{\prime}}}\right),$$
(4)

where \(y^{\prime}\left({{\varvec{x}}\boldsymbol{^{\prime}}}_{i^{\prime}}\right)\) is the normal score transform of \(z^{\prime}\left({{\varvec{x}}\boldsymbol{^{\prime}}}_{i^{\prime}}\right)\) and \({w^{\prime}}_{i^{\prime}}\) is the linear combination coefficient for \(y^{\prime}\left({{\varvec{x}}\boldsymbol{^{\prime}}}_{i^{\prime}}\right)\). The weighted sum of squared residuals for (4) is

$$\begin{array}{l}S=E\left\{{\left[y\left({\varvec{x}}\right)-\sum_{i=1}^{n}{w}_{i}y\left({{\varvec{x}}}_{i}\right)-\sum_{i\mathrm{^{\prime}}=1}^{n\mathrm{^{\prime}}}{w\mathrm{^{\prime}}}_{i\mathrm{^{\prime}}}y\mathrm{^{\prime}}\left({{\varvec{x}}\mathbf{^{\prime}}}_{i\mathrm{^{\prime}}}\right)\right]}^{2}\right\}\\ =E\left\{{\left[y\left({\varvec{x}}\right)\right]}^{2}\right\}+E\left\{{\left[\sum_{i=1}^{n}{w}_{i}y\left({{\varvec{x}}}_{i}\right)\right]}^{2}\right\}+E\left\{{\left[\sum_{i\mathrm{^{\prime}}=1}^{n\mathrm{^{\prime}}}{w\mathrm{^{\prime}}}_{i\mathrm{^{\prime}}}y\mathrm{^{\prime}}\left({{\varvec{x}}\mathbf{^{\prime}}}_{i\mathrm{^{\prime}}}\right)\right]}^{2}\right\}\\ -2E\left\{y\left({\varvec{x}}\right)\sum_{i=1}^{n}{w}_{i}y\left({{\varvec{x}}}_{i}\right)\right\}-2E\left\{y\left({\varvec{x}}\right)\sum_{{i}^{\mathrm{^{\prime}}}=1}^{{n}^{\mathrm{^{\prime}}}}{{w}^{\mathrm{^{\prime}}}}_{{i}^{\mathrm{^{\prime}}}}{y}^{\mathrm{^{\prime}}}\left({{\varvec{x}}\mathbf{^{\prime}}}_{{i}^{\mathrm{^{\prime}}}}\right)\right\}+2E\left\{\sum_{i=1}^{n}{w}_{i}y\left({{\varvec{x}}}_{i}\right)\sum_{i\mathrm{^{\prime}}=1}^{n\mathrm{^{\prime}}}{w\mathrm{^{\prime}}}_{i\mathrm{^{\prime}}}y\mathrm{^{\prime}}\left({{\varvec{x}}\mathbf{^{\prime}}}_{i\mathrm{^{\prime}}}\right)\right\}\\ =E\left\{{\left[y\left({\varvec{x}}\right)\right]}^{2}\right\}+\sum_{i=1}^{n}\sum_{j=1}^{n}{w}_{i}{w}_{j}E\left\{y\left({{\varvec{x}}}_{i}\right)y\left({{\varvec{x}}}_{j}\right)\right\}+\sum_{{i}^{\mathrm{^{\prime}}}=1}^{{n}^{\mathrm{^{\prime}}}}\sum_{{j}^{\mathrm{^{\prime}}}=1}^{{n}^{\mathrm{^{\prime}}}}{{w}^{\mathrm{^{\prime}}}}_{{i}^{\mathrm{^{\prime}}}}{{w}^{\mathrm{^{\prime}}}}_{{j}^{\mathrm{^{\prime}}}}E\left\{{y}^{\mathrm{^{\prime}}}\left({{\varvec{x}}\mathbf{^{\prime}}}_{{i}^{\mathrm{^{\prime}}}}\right){y}^{\mathrm{^{\prime}}}\left({{\varvec{x}}\mathbf{^{\prime}}}_{{j}^{\mathrm{^{\prime}}}}\right)\right\}\\ - 2\sum_{i=1}^{n}{w}_{i}E\left\{y\left({\varvec{x}}\right)y\left({{\varvec{x}}}_{i}\right)\right\}- 2\sum_{{i}^{\mathrm{^{\prime}}}=1}^{{n}^{\mathrm{^{\prime}}}}{{w}^{\mathrm{^{\prime}}}}_{{i}^{\mathrm{^{\prime}}}}E\left\{y\left({\varvec{x}}\right){y}^{\mathrm{^{\prime}}}\left({{\varvec{x}}\mathbf{^{\prime}}}_{{i}^{\mathrm{^{\prime}}}}\right)\right\}+ 2\sum_{i=1}^{n}\sum_{{i}^{\mathrm{^{\prime}}}=1}^{{n}^{\mathrm{^{\prime}}}}{w}_{i}{{w}^{\mathrm{^{\prime}}}}_{{i}^{\mathrm{^{\prime}}}}E\left\{y\left({{\varvec{x}}}_{i}\right){y}^{\mathrm{^{\prime}}}\left({{\varvec{x}}\mathbf{^{\prime}}}_{{i}^{\mathrm{^{\prime}}}}\right)\right\}.\end{array}$$
(5)

Using (2), \({C}^{\prime}\left({\varvec{h}}\right)=E\left\{y\left({\varvec{x}}\right){y}^{\prime}\left({\varvec{x}}+{\varvec{h}}\right)\right\},\) and \({C}^{{\prime}{\prime}}\left({\varvec{h}}\right)=E\left\{{y}^{\prime}\left({\varvec{x}}\right){y}^{\prime}\left({\varvec{x}}+{\varvec{h}}\right)\right\},\ \ 2{\gamma }^{{\prime}{\prime}}\left({\varvec{h}}\right)=E\left\{{\left[{y}^{\prime}\left({\varvec{x}}\right)-{y}^{\prime}\left({\varvec{x}}+{\varvec{h}}\right)\right]}^{2}\right\},\) which are related to each other by

$$C{^{\prime}}{^{\prime}}\left({\varvec{h}}\right)=C{^{\prime}}{^{\prime}}\left(0\right)- \gamma^{{\prime}{\prime}}\left({\varvec{h}}\right) ,$$
(6)

the variance S in (5) is rewritten as

$$\begin{array}{c}S=\,C\left(0\right)+\sum_{i=1}^{n}\sum_{j=1}^{n}{w}_{i}{w}_{j}C\left({{\varvec{x}}}_{j}-{{\varvec{x}}}_{i}\right)+\sum_{{i}^{\mathrm{^{\prime}}}=1}^{{n}^{\mathrm{^{\prime}}}}\sum_{{j}^{\mathrm{^{\prime}}}=1}^{{n}^{\mathrm{^{\prime}}}}{{w}^{\mathrm{^{\prime}}}}_{{i}^{{^{\prime}}}}{{w}^{{^{\prime}}}}_{{j}^{{^{\prime}}}}C^{\prime\prime}\left({{\varvec{x}}^{\prime}}_{j{^{\prime}}}-{{\varvec{x}}^{\prime}}_{i{^{\prime}}}\right) \\ -2\sum_{i=1}^{n}{w}_{i}C\left({\varvec{x}}_{i}-{\varvec{x}}\right)-2\sum_{i^{\prime}=1}^{n^{\prime}}{w^{\prime}}_{i^{\prime}}C^{\prime}\left({{\varvec{x}}^{\prime}}_{i^{\prime}}-{\varvec{x}}\right)+ 2\sum_{i=1}^{n}\sum_{{i}^{\prime}=1}^{{n}^{\prime}}{w}_{i}{{w}^{\prime}}_{i^{\prime}}C^{\prime}\left({{\varvec{x}}^{\prime}}_{i^{\prime}}-{{\varvec{x}}}_{i}\right) .\end{array}$$
(7)

This yields the partial derivatives

$$\begin{array}{l}{\partial S}/{\partial {w}_{i}}=2\sum_{j=1}^{n}{w}_{j}C\left({{\varvec{x}}}_{j}-{{\varvec{x}}}_{i}\right)-2C\left({{\varvec{x}}}_{i}-{\varvec{x}}\right)+2\sum_{i^{\prime}=1}^{n^{\prime}}{{w}^{\prime}}_{{i}^{\prime}}C^{\prime}\left({{\varvec{x}}^{\prime}}_{i^{\prime}}-{{\varvec{x}}}_{i}\right),\ \ i=1, 2, \ldots , n\\ {\partial S}/{\partial {w^{\prime}}_{i^{\prime}}}=2\sum_{j^{\prime}=1}^{n^{\prime}}{w^{\prime}}_{j^{\prime}}C{^{\prime}}{^{\prime}}\left({{\varvec{x}}\boldsymbol{^{\prime}}}_{j{\prime}}-{{\varvec{x}}\boldsymbol{^{\prime}}}_{i{\prime}}\right)-2C^{\prime}\left({{\varvec{x}}^{\prime}}_{{i}{\prime}}-{\varvec{x}}\right)+2\sum_{i=1}^{n}{w}_{i}C^{\prime}\left({{\varvec{x}}^{\prime}}_{{i}^{\prime}}-{{\varvec{x}}}_{i}\right),\ \ i^{\prime}=1, 2, \ldots , n^{\prime}\end{array}$$

and the normal equations

$$\left(\begin{array}{cccccc}C\left({{\varvec{x}}}_{1}-{{\varvec{x}}}_{1}\right)& \cdots & C\left({{\varvec{x}}}_{n}-{{\varvec{x}}}_{1}\right)& C^{\prime}\left({{\varvec{x}}\boldsymbol{^{\prime}}}_{1}-{{\varvec{x}}}_{1}\right)& \cdots & C^{\prime}\left({{\varvec{x}}\boldsymbol{^{\prime}}}_{n^{\prime}}-{{\varvec{x}}}_{1}\right)\\ \vdots & \ddots & \vdots & \vdots & \ddots & \vdots \\ C\left({{\varvec{x}}}_{1}-{{\varvec{x}}}_{n}\right)& \cdots & C\left({{\varvec{x}}}_{n}-{{\varvec{x}}}_{n}\right)& C^{\prime}\left({{\varvec{x}}\boldsymbol{^{\prime}}}_{1}-{{\varvec{x}}}_{n}\right)& \cdots & C^{\prime}\left({{\varvec{x}}\boldsymbol{^{\prime}}}_{n^{\prime}}-{{\varvec{x}}}_{n}\right)\\ C^{\prime}\left({{\varvec{x}}\boldsymbol{^{\prime}}}_{1}-{{\varvec{x}}}_{1}\right)& \cdots & C^{\prime}\left({{\varvec{x}}\boldsymbol{^{\prime}}}_{1}-{{\varvec{x}}}_{n}\right)& C{^{\prime}}{^{\prime}}\left({{\varvec{x}}\boldsymbol{^{\prime}}}_{1}-{{\varvec{x}}\boldsymbol{^{\prime}}}_{1}\right)& \cdots & C{^{\prime\prime}}\left({{\varvec{x}}\boldsymbol{^{\prime}}}_{n^{\prime}}-{{\varvec{x}}\boldsymbol{^{\prime}}}_{1}\right)\\ \vdots & \ddots & \vdots & \vdots & \ddots & \vdots \\ C^{\prime}\left({{\varvec{x}}\boldsymbol{^{\prime}}}_{n^{\prime}}-{{\varvec{x}}}_{1}\right)& \cdots & C^{\prime}\left({{\varvec{x}}\boldsymbol{^{\prime}}}_{n^{\prime}}-{{\varvec{x}}}_{n}\right)& C{^{\prime}}{^{\prime}}\left({{\varvec{x}}\boldsymbol{^{\prime}}}_{1}-{{\varvec{x}}\boldsymbol{^{\prime}}}_{n^{\prime}}\right)& \cdots & C{^{\prime}}{^{\prime}}\left({{\varvec{x}}\boldsymbol{^{\prime}}}_{n^{\prime}}-{{\varvec{x}}\boldsymbol{^{\prime}}}_{n^{\prime}}\right)\end{array}\right)\left(\begin{array}{c}{w}_{1}\\ \vdots \\ {w}_{n}\\ {w^{\prime}}_{1}\\ \vdots \\ {w^{\prime}}_{n^{\prime}}\end{array}\right)=\left(\begin{array}{c}C\left({{\varvec{x}}}_{1}-{\varvec{x}}\right)\\ \vdots \\ C\left({{\varvec{x}}}_{n}-{\varvec{x}}\right)\\ C\left({{\varvec{x}}\boldsymbol{^{\prime}}}_{1}-{\varvec{x}}\right)\\ \vdots \\ C\left({{\varvec{x}}\boldsymbol{^{\prime}}}_{n^{\prime}}-{\varvec{x}}\right)\end{array}\right) .$$
(8)

The number of covariances included in this system of linear equations is as many as \({\left(n+n\mathrm{^{\prime}}\right)}^{2}+n+n^{\prime}\), which requires much computation. However, because the volume support of the gravimetric depth is larger than that of the seismic depth, it can be assumed that the statistical dependence of seismic depth on gravimetric depth is limited to the co-located depths. This approximation is called the Markov Model 2, and results in linear relations

$$C\mathrm{^{\prime}}({\varvec{h}})={\rho }_{12}C{^{\prime}}{^{\prime}}({\varvec{h}})$$
(9)

and

$$C({\varvec{h}})={\rho }_{12}^{2}C{^{\prime}}{^{\prime}}({\varvec{h}})+\left(1-{\rho }_{12}^{2}\right){\rho }_{R}({\varvec{h}})$$
(10)

where \({\rho }_{12}\) is the correlation coefficient inferred from co-located data \(y\left({\varvec{x}}\right)\) and \(y^{\prime}\left({\varvec{x}}\right)\), and \({\rho }_{R}({\varvec{h}})\) is any permissible covariance (Journel 1999). If \(C{^{\prime}}{^{\prime}}\) is calculated from the gravimetric depths, all the covariances in (8) can be obtained using (9) and (10). Therefore, we can solve (8) and calculate the cokriging estimate and variance using the solution of (8), the observation equation (4), and the normal score back-transformation. It is additionally noted that

$$C\left(0\right)= {C}^{{\prime}{\prime}}\left(0\right)=1$$
(11)

because the probability density function for \(y\left({\varvec{x}}\right)\) and \(y^{\prime}\left({\varvec{x}}\right)\) is the normal distribution with zero mean and unit variance.

To obtain the depths at the nodes of a grid as the depth distribution of an interface, we adopted the procedure of the Sequential Gaussian Co-Simulation (Remy et al. 2009) as follows:

  1. 1.

    Each node \({\varvec{x}}\) of the grid on an interface is defined.

  2. 2.

    The initial data set is constructed from the normal score transforms of the seismic and gravimetric depths.

  3. 3.

    for each node

    1. 3.1

      Cokriging of the data set is performed to obtain cokriging estimate and variance.

    2. 3.2

      A value is derived from the normal distribution with the cokriging estimate and variance and added to the data set.

    end for

  4. 4.

    The simulated values are back-transformed.

Modeling in Kathmandu Valley (results)

The depth distributions of the five interfaces, which comprised the basement surface (basement/sediment interface) and the four sedimentary interfaces, were determined using the data and methods described in the previous two sections. In Step 1 of the above procedure, a grid with 200 m intervals was applied to the rectangular region including the Kathmandu Valley. This grid and its nodes (\({{\varvec{x}}}_{k}, k=1, 2, \ldots , K\)) were commonly used for all the five interfaces. In Step 2, we took the normal score transforms of the gravimetric depths for the basement surface (\({y}^{\prime}\left({{\varvec{x}}\boldsymbol{^{\prime}}}_{i^{\prime}}\right), i^{\prime}=1,\cdots ,n^{\prime}\)) and those of the seismic depths for an interface (\(y\left({{\varvec{x}}}_{i}\right), i=1,\cdots ,n\)), and constructed the initial data set by combining \({y}^{\prime}\left({{\varvec{x}}\boldsymbol{^{\prime}}}_{i{\prime}}\right)\) and \(y\left({{\varvec{x}}}_{i}\right)\). The correlation coefficient \({\rho }_{12}\) of the normal score transforms of the co-located seismic and gravimetric depths were calculated in advance.

In 3.1 of Step 3, we proceeded to a random node at \({{\varvec{x}}}_{k}\) in the grid. We first calculated the variogram \({\gamma^{{\prime}{\prime}}}\left({\varvec{h}}\right)\) of \({y}^{\prime}\left({{\varvec{x}}\boldsymbol{^{\prime}}}_{i{\prime}}\right)\) since we used the approximation of Markov Model 2. The calculated values are plotted as red crosses in Additional file 1: Fig. S4, and the black line in this figure indicates that a spherical model:

$${\upgamma }_{s}\left({\varvec{h}}\right)=\left\{\begin{array}{cl}{3}/{2}\cdot\left|{\varvec{h}}\right|/{a}-{1}/{2}\cdot{\left({\left|{\varvec{h}}\right|}/{a}\right)}^{3} & \mathrm{if} \left|{\varvec{h}}\right|\le a,\\ 1 & {\mathrm{otherwise}}\end{array}\right.$$
(12)

with a range \(a=8000\) m fit them. \(C{^{\prime}}{^{\prime}}\left({\varvec{h}}\right)\) s in (8) were calculated using (6), (11) and \(\gamma^{{\prime}{\prime}}\left({\varvec{h}}\right)={\upgamma }_{s}\left({\varvec{h}}\right)\) in (12) with the range. \(C^{\prime}\left({\varvec{h}}\right)\)s in (8) were calculated using \({\rho }_{12}\) calculated in Step 2 and (9). We assumed \({\rho }_{R}({\varvec{h}})\) in (10) to be \(1-{\upgamma }_{s}\left({\varvec{h}}\right)\), where \({\upgamma }_{s}\left({\varvec{h}}\right)\) is the spherical model in (12) with \(a=8000\) m. \(C\left({\varvec{h}}\right)\)s in (8) are calculated using this assumption and (10). Thus, the system of linear equations in (8) was set up and solved for \({w}_{1}, \ldots , {w}_{n}\) and \({w^{\prime}}_{1}, \ldots , {w^{\prime}}_{n^{\prime}}\). From these solutions and the observation equation (4), we obtained the cokriging estimate (estimate by the cokriging) for \(y\) at \({{\varvec{x}}}_{k}\):

$${y}_{*}\left({{\varvec{x}}}_{k}\right)=\sum_{i=1}^{n}{w}_{i}y\left({{\varvec{x}}}_{i}\right)+\sum_{i{\prime}=1}^{n{\prime}}{w{\prime}}_{i{\prime}}y{\prime}\left({{\varvec{x}}\boldsymbol{^{\prime}}}_{i{\prime}}\right).$$

Using the solutions together with (7), we obtained the cokriging variance (variance by the cokriging) \(S\left({{\varvec{x}}}_{k}\right)\) for \(y\) at \({{\varvec{x}}}_{k}\).

In 3.2 of Step 3, as reported in Pyrcz and Deutsch (2014), we drew a random residual \(R\left({{\varvec{x}}}_{k}\right)\) that followed a normal distribution with zero mean and variance of \(S\left({{\varvec{x}}}_{k}\right)\) using a pseudo-random number generator for a given “seed”. We added the cokriging estimate and the residual to get the simulated value \({y}_{s}\left({{\varvec{x}}}_{k}\right)={y}_{*}\left({{\varvec{x}}}_{k}\right)+R\left({{\varvec{x}}}_{k}\right)\). At the end of the for loop in Step 3, we add \({y}_{s}\left({{\varvec{x}}}_{k}\right)\) to the initial data set to ensure that the covariance with this value and all future predictions is correct.

We revisited the beginning of the for loop with a next random node at \({{\varvec{x}}}_{k^{\prime}}\) in the grid. The processes of 3.1 and 3.2 were repeated for this node and the data set after the previous iteration to obtain the simulated value \({y}_{s}\left({{\varvec{x}}}_{k^{\prime}}\right)\) and added this to the data set. We then visited all \(K\) nodes in a random order to generate the simulated values. In Step 4, all the simulated values in the normal score units were finally back-transformed to the distribution of depths of each interface, which was collectively called a realization. Steps 3 and 4 were repeated 50 times with different random number seeds, and the average of the 50 realizations was used as a model for the depth distribution of each interface. Figure 4 shows the models for the four sedimentary interfaces and the basement surface. To show the level of confidence for them, we drew the distribution of the standard deviation for the depth of the basement surface in Additional file 1: Fig. S5.

Fig. 4
figure 4

Modeled depth distributions of the four sedimentary interfaces (a to d) and the basement surface (e). The brown line in each distribution indicates the basin boundary of Shrestha et al. (1998)

Comparing Figs. 1b and 4, we found deep interfaces, which correspond to thick sediments, in the center of the valley and along the Bagmati River and its tributaries. In the southern half of the valley, the thickest sediments (deepest interfaces) are not distributed along the Bagmati River but extend south from where the river runs from east to west. The southern extension is located in an area where lacustrine sediments and delta deposits are distributed (Sakai et al. 2016). This suggests that sedimentation in the valley is predominantly related to the Paleo-Kathmandu Lake.

To observe the effect of the valley on seismic ground motion, we simulated ground velocities from an M 5.4 earthquake in the east at points A and B, which represent the edge and center of the valley (left panel of Fig. 5). We used a 3D finite difference method with the fourth-order difference scheme (Levander 1988), a staggered grid (Virieux 1986), and viscoelastic formulation of Robertsson et al. (1994). A bandpass filter from 0.1 to 0.5 Hz was applied to the simulation results. The ground velocities at point A had simple waveforms consisting mainly of body waves, whereas those at point B had complex waveforms with long duration and large amplitudes, including surface waves that developed while propagating through the valley (right panel of Fig. 5).

Fig. 5
figure 5

Right panel compares the simulated ground velocities from an M 5.4 earthquake in the east at points A and B, which represent the edge and center of the basin as shown in the topographic map on the left. The brown and black lines in the map indicate the basin and sedimentary boundaries of Shrestha et al. (1998), respectively

Modeling around Kathmandu Valley

Koketsu et al. (2016) constructed a regional-scale velocity structure model around the Kathmandu Valley to simulate the ground motions caused by the 2015 Gorkha earthquake. The target region is indicated by the large rectangle in Fig. 1a. This model was based on the geometry of the Indian and Eurasian plates by Sapkota et al. (2013) and the crustal structures in CRUST 1.0 developed by Laske et al. (2013). However, CRUST 1.0 provides no basement on the Nepalese side along the MFT; therefore a layer of VS 2.1 km/s was introduced as on the Indian side. The shape of the Kathmandu Valley was defined according to Yoshida and Igarashi (1984), and the valley was filled with uniform sediments of VS 0.6 km/s. The VS of the Indian lithosphere was assumed to be 3.2 km/s. Apart from these, the VS, VP, and ρ of each layer were obtained from CRUST 1.0. QS was obtained from VS using the empirical relationship of Brocher (2008), and QP was assumed to be twice QS.

The underthrusting Indian lithosphere was modeled to a depth of 8.4 km in this model (Fig. 6a), but extended beyond that depth to the northern edge of the modeled region, as shown in Fig. 6b. This extension was performed following well-known reviews on Himalayan tectonics (e.g., Avouac 2003; Bilham 2019). The uniform sediments in the Kathmandu Valley (Fig. 6a) were replaced with the velocity structure model obtained in the previous sections, as shown in Fig. 6b. If we use the old model in the Kathmandu Valley, for example, the calculated phase velocities and R/V spectral ratios at SDB (green lines in Additional file 1: Fig. S6) do not match the observed ones (gray circles), but those calculated with our new model (violet lines) well fit to the observed ones. Kobayashi et al. (2016) found that the source region of the Gorkha earthquake was located in the VS 3.5 km/s layer of their velocity structure model. Because large historical earthquakes occurred even on the shallow part of the underthrusting lithosphere, the VS 3.5 km/s layer on the Nepalese side was raised to below the MFT, as shown in Fig. 6b.

Fig. 6
figure 6

Panels a and b show the cross sections of the regional-scale velocity structure models by Koketsu et al. (2016) and this study, respectively, along line XY in Fig. 1a. The underthrusting Indian plate is modeled to a depth of 8.4 km in a, but extends beyond that depth to the northern edge of the modeled region in b. The sediments in the Kathmandu Valley are assumed to be homogeneous with VS of 0.6 km/s in a, while the velocity structure model obtained in the previous sections is applied in b. The VS 3.5 km/s layer on the Nepal side is raised to below the MFT (dotted line) based on Kobayashi et al. (2016)

Conclusions

We analyzed the microtremors observed by Yokoi et al. (2021) and the seismic ground motions observed by Takai et al. (2021) to obtain horizontally layered structures at their observation sites. Kawasaki et al. (2019) conducted a reflection survey to obtain a two-dimensionally layered structure along a survey line. From these results, we obtained seismic depths of four sedimentary interfaces and basement surface (basement/sediment interface). Pradhan et al. (2018) performed gravity observations to obtain the gravimetric depths of the basement surface. The accuracy of the seismic depths is higher than that of the gravimetric depths, while the number of the former is smaller than that of the latter. To generate the depth distributions of the interfaces from these depths with such different attributes, we adopted the Sequential Gaussian Co-Simulation (Remy et al. 2009) including the cokriging approach (e.g. Doyen 1988). We combined the results of this method and assumptions for deeper parts (Additional file 1: Table S1) into a three-dimensional velocity structure model of the Kathmandu Valley. The model suggests that the sedimentation in the valley is predominantly related to the Paleo-Kathmandu Lake. We then incorporate this model into a regional-scale model for ground motion simulation (Koketsu et al. 2016). We also revised the regional-scale model to solve the problems of the extension of the underthrusting lithosphere and VS in its adjacent area.