Space-time (tensor product) model in Ngme2 • ngme2

Introduction

In this vignette, we will show how to fit the separable space-time model (using tensor product structure in ngme2).

The separable space-time model is defined by the Kronecker product between the precision matrices of the spatial and temporal random effects. Additional information about separable space-time models can be found in Cameletti et al. (2013).

Model structure

For the usual model, we have the following structure: $\mathbf{K} \mathbf{X}(s) = \boldsymbol{\epsilon},$ where $\mathbf{K}$ is some operator matrix, $\boldsymbol{\epsilon}$ represents the noise (Gaussian or non-Gaussian).

In the tensor product model, the operator matrix $K$ can be constructed by other two operator matrices of two models:

$K = K_l \otimes K_r,$ where $K_l$ and $K_r$ are the operator matrices of the first and second models, respectively.

A toy example

To use the space-time model, we need to first define the mesh of the model, i.e., the discretization of the space-time domain (time $\times$ location). It can be done by providing the mesh for each. For example, we can provide the same mesh as in the regular spatial model, and also provide the mesh for the time index.

The R interface for tensor product model requires map as a list of 2 indices and 2 operators namely first (time) and second (space) to build the model.

The following is one simple example of how to build the space-time (2d location) model.

Here the mesh of the model will be ordered according to the order of the time index (year).

set.seed(16)
library(ngme2)

n <- 10
# generate time randomly of length n
time <- sample(2001:2004, n, replace = TRUE)
# generate 2d location randomly of length n
loc <- cbind(runif(n), runif(n)) * 10

# show the time and loc
data.frame(time, loc)
#>    time        X1       X2
#> 1  2001 3.1974498 9.676396
#> 2  2003 5.9111487 8.120988
#> 3  2003 1.5721967 5.445619
#> 4  2001 6.6138198 4.306273
#> 5  2003 5.2534078 2.278224
#> 6  2003 2.4020197 6.487465
#> 7  2004 8.4731194 9.513628
#> 8  2004 6.8851465 9.739844
#> 9  2002 7.1672573 7.647669
#> 10 2002 0.7615558 4.748097


# create the mesh for space (2d location)
mesh <- fmesher::fm_mesh_2d(
  loc.domain = cbind(c(0, 1, 1, 0, 0) * 10, c(0, 0, 1, 1, 0) * 5),
  max.edge = c(1, 10),
  cutoff = 0.1
)
plot(mesh)


# define the space-time model
m0 <- ngme2::f(
  map=list(time, loc), # from the data
  model="tp",
  first=list(model="ar1"), # ar1 model for time (mesh generated automatically)
  second = list(model="matern", mesh = mesh)
)

# show the model
m0
#> Model type: Tensor product
#>     first: AR(1)
#>         rho = 0
#>     second: Matern
#>         kappa = 1
#> Noise type: NORMAL
#> Noise parameters: 
#>     sigma = 1

A AR(1) x Matern 2d example

Now let’s turn to simulate and estimate this type of model.

##############################  simulation
mesh2d <- fmesher::fm_mesh_2d(
  loc.domain = cbind(c(0, 1, 1, 0, 0) * 10, c(0, 0, 1, 1, 0) * 5),
  max.edge = c(1, 10),
  cutoff = 0.1
)
mesh2d$n
#> [1] 256

# generate random loc for each year
n_obs <- c(102, 85, 120, 105, 109, 100) # observation for each year
year <- rep(2001:2006, times = n_obs)

# 2d coordinate
x <- runif(sum(n_obs)) * 10;
y <- runif(sum(n_obs)) * 5

# set the model for simulation
true_model <- ngme2::f(
  map = list(year, ~x+y),
  model = "tp",
  first = list(model="ar1", rho = 0.5),
  second = list(model="matern", mesh = mesh2d),
  noise = noise_nig(mu=-2, sigma=1, nu=2)
)

W <- simulate(true_model)[[1]]
Y_obs <- W + rnorm(length(W), sd = 0.5)
df <- data.frame(year, x, y, Y_obs)

Next we run the estimation:

##############################  estimation
ngme_fit <- ngme(
  Y_obs ~ 0 + f(
    map = list(year, ~x+y),
    model="tp",
    name="tp",
    first = list(model="ar1"),
    second = list(model="matern", mesh = mesh2d),
    noise = noise_nig()
    # control = control_f(numer_grad = T)
  ),
  data = df,
  family = "normal",
  control_opt = control_opt(
    iterations = 1000,
    n_parallel_chain = 4,
    optimizer = adamW()
    # rao_blackwellization = TRUE
  ),
  debug = FALSE
)
#> Starting estimation... 
#> 
#> Starting posterior sampling... 
#> Posterior sampling done! 
#> Average standard deviation of the posterior W:  0.614065 
#> Note:
#>       1. Use ngme_post_samples(..) to access the posterior samples.
#>       2. Use ngme_result(..) to access different latent models.

ngme_fit
#> *** Ngme object ***
#> 
#> Fixed effects: 
#>   None
#> 
#> Models: 
#> $tp
#>   Model type: Tensor product
#>       first: AR(1)
#>           rho = 0.393
#>       second: Matern
#>           kappa = 0.79
#>   Noise type: NIG
#>   Noise parameters: 
#>       mu = -1.54
#>       sigma = 0.406
#>       nu = 1.31
#> 
#> Measurement noise: 
#>   Noise type: NORMAL
#>   Noise parameters: 
#>       sigma = 0.484

To see the results of estimation, we can use traceplot function.

traceplot(ngme_fit, "tp", hline=c(rho=0.5, kappa=1, mu=-2, sigma=1, nu=2))

#> Last estimates:
#> $rho
#> [1] 0.368578
#> 
#> $kappa
#> [1] 0.795146
#> 
#> $mu
#> [1] -1.543945
#> 
#> $sigma
#> [1] 0.454738
#> 
#> $nu
#> [1] 1.456951
# compare noise density
plot(noise_nig(mu=-2, sigma=1, nu=2), ngme_result(ngme_fit, "tp")$noise)

# traceplot(ngme_fit)

Doing prediction

Next we will show how to do prediction at unknown year and location.

# new predict location
n_new <- 10
# generate time randomly of length n
time_new <- sample(2001:2004, n_new, replace = TRUE)
# generate 2d location randomly of length n
loc_new <- cbind(runif(n_new), runif(n_new)) * 10

# For the tp model (model name we give), we need two arguments to do the prediction.
pd <- predict(ngme_fit, map=list(tp=list(
  year=time_new,
  pos=loc_new
)))
head(pd)
#> $mean
#>  [1]  0.39575614 -0.89271305  0.01622907  0.28586548  0.06282132  0.33569592
#>  [7]  0.08182618  0.00000000  0.04033962 -0.91512267
#> 
#> $sd
#>  [1] 0.3415866 0.4713553 0.3653255 0.3937367 0.2121533 0.3390735 0.2363070
#>  [8] 0.0000000 0.1934200 0.3406784
#> 
#> $`0.05q`
#>  [1] -0.1815658 -1.7473698 -0.5662457 -0.4778345 -0.3264473 -0.2587674
#>  [7] -0.3689481  0.0000000 -0.2911713 -1.4961883
#> 
#> $`0.95q`
#>  [1]  0.8580689 -0.2619931  0.6130286  0.8182632  0.3640016  0.8520269
#>  [7]  0.4250076  0.0000000  0.3339397 -0.3983097
#> 
#> $median
#>  [1]  0.41545900 -0.81132308  0.03443021  0.35508068  0.09272731  0.36196914
#>  [7]  0.10888361  0.00000000  0.05804622 -0.88064731
#> 
#> $mode
#>  [1]  0.35000000 -0.45949608  0.18210392  0.43353745  0.15521835  0.37519348
#>  [7]  0.17509467  0.00000000  0.08801994 -0.85803729

References

Cameletti, M., Lindgren, F., Simpson, D., & Rue, H. (2013). Spatio-temporal modeling of particulate matter concentration through the SPDE approach. AStA Advances in Statistical Analysis, 97(2), 109-131. https://doi.org/10.1007/s10182-012-0196-3