Hidden Markov Precipitation Model

The groundwater market model described by (Cialenco & Ludkovski, 2025) is stochastically driven by groundwater recharge process $\{R_t\}_{t\in\mathbb{N}_0}$ for year $t$. In order for this model to be a useful tool for groundwater conservation and planning we need to be able to efficiently simulate this process whilst ensuring the result is

  • 1️⃣ Markovian

  • 2️⃣ Exogenous

  • 3️⃣ Low-cost

  • 4️⃣ Controllable

Assuming that precipitation is the primary driver of recharge we model annual precipitation using a Hidden Markov Model (HMM) before mapping to groundwater recharge using functional estimates. In this note we outline our procedure and provide a summary of our results.

Background

HMMs consist of a pair of stochastic processes $(S(t), P(t))$ known as the (unobserved) latent process and the (observed) emission process respectively.

Fig 1: HMM Connected Graph.

The latent state process $\{S(t)\}$ is a Markov chain (MC) taking values in finite state space $\mathcal{S}:=\{1, …, S\}$. The number of states is specified a priori and the interpretation of these states can only be inferred post-fitting.

The MC $\{S_t\}$ is described by its transition probability matrix:

\[\begin{aligned} \Gamma & := (\gamma_{i,j})_{i,j\in\mathcal{S}} \\ & = ((\mathbb{P}(S(t+1)=j|S(t)=i)))_{i,j\in\mathcal{S}}, \end{aligned}\]

with initial state distribution vector

\[\boldsymbol{\pi}:=(\pi_i)_{i\in\mathcal{S}} = (\mathbb{P}(S(1)=i))_{i\in\mathcal{S}}.\]

The precipitation process data $\boldsymbol{P}(t)\in\mathbb{R}_+^M$ are realizations of precipitation densities that are conditionally dependent only on the current state of the process $S(t)$

\[\mathbb{P}(\boldsymbol{P}(t)|\boldsymbol{P}(1:t-1),S(1:t)=s(1:t)) = \mathbb{P}(\boldsymbol{P}(t)|S(t)=s(t)).\]

The precipitation density vector is given by

\[\begin{aligned} \boldsymbol{\eta}(t|S(t)) & = (\eta^{(m)}(t|S(t)))_{m\in\mathcal{M}} \\ & = (\mathbb{P}(\boldsymbol{P}^{(m)}(t)|S(t)=j))_{m\in\mathcal{M}}. \end{aligned}\]

Model

For our implementation we focus on the Central Valley (CV) aquifer in California which is split into multiple water regions which we index by $m\in{\cal M}$. We assume that total annual groundwater recharge $R(t)$ is given by

\[\begin{aligned} R(t) & = \sum_{m\in\mathcal{M}} R^{(m)}(t) = \sum_{m\in\mathcal{M}} f_m(P^{(m)}(t)), \end{aligned}\]

where:

  1. $f_m$ are region specific (possibly non-linear) percolation functions; and

  2. $P^m(t)$ are the regional average precipitation time series.

The precipitation time series processes $P^{(m)}(t)$ are given by a common state HMM and the percolation functions are fitted via a local polynomial regression (LOESS).

Data

Our precipitation data is computed from the gridded 100-year historic daily weather data (Product_100yr) provided by the California Department of Water Resources (DWR) Open Data Portal.

Our percolation data is computed using data from California Central Valley Fine-Grid Groundwater-Surface Water Simulation Model (C2VSimFG) created and made available by the California Department of Water Resources (DWR).

Rainfall data

We consider three basins: Sacramento River (SR), San Joaquin river (SJ) and Tulare (TL).

Fig 2: California CV annual rainfall time series grouped by hydrological region.

Percolation data

Fig 3: California CV annual percolation time series grouped by hydrological region.

Results

Model Parameters

Through exploratory data analysis we determine that 3 latent states are optimal and that the log-normal density with regional parameters $\mu_m$ and $\sigma_m$ best fit the data characteristics. The fitted emission distribution parameters are summarized in Table 1 and Table 2 below.

Log-normal parameters

Region μ1 σ1 μ2 σ2 μ3 σ3
SR 7.162 0.117 6.855 0.148 6.519 0.217
SJ 6.930 0.102 6.636 0.081 6.216 0.181
TL 6.470 0.149 6.152 0.141 5.717 0.180


Log-normal mean/standard deviation

Region m1 s1 m2 s2 m3 s3
SR 1298.34 152.43 959.06 142.72 694.05 152.40
SJ 1027.83 105.11 764.54 62.03 508.97 92.88
TL 652.69 97.79 474.35 67.22 309.12 57.05


Retrospectively we interpret the states as follows:

  • State 3 low precipitation
  • State 2 medium precipitation
  • State 1 high precipitation

The fitted transition probabilities are

\[\Gamma := \begin{bmatrix} 0.110 & 0.131 & 0.759 \\ 0.207 & 0.359 & 0.435 \\ 0.196 & 0.282 & 0.522 \end{bmatrix}.\]

The corresponding stationary distribution of the chain is

\[\mu\cdot \Gamma = \mu \implies \mu = \begin{bmatrix}0.183 & 0.276 & 0.541\end{bmatrix}.\]
Fig 4: Latent state process transition probability diagram.

Producing a plot of the precipitation distributions in each of the 3 regimes we see that our model seems to avoid redundancy with clear distinction between latent state distributions.

Fig 5: Precipitation distribution in 3 California basins

Next, we produce plots for our fitted local polynomial regression connecting precipitation $P(t)$ to recharge $R(t)$.

Fig 6: Loess polynomial regression plot of percolation against precipitation.

From these results we observe that recharge is crudely linear in precipitation in all regions which indicates reasonable extrapolation. We also note that there is much higher recharge in San Joaquin and Tulare despite arid conditions due to river runoff, induced recharge and geological factors.

Simulations

We conclude by applying our entire simulation engine to produce 1000 year simulations of groundwater recharge.

Groundwater Simulation

Obtained by combining simulated precipitation with fitted recharge mapping.

Fig 7: 1000-year simulation of groundwater recharge.

Precipitation Densities

Fig 8: Comparison of kernel density estimates based on the fitted HMM and product_1000yr simulations for annual precipitation.

References

  1. SIFIN2025.jpg
    A groundwater market model
    Igor Cialenco and Mike Ludkovski
    SIAM Journal on Financial Mathematics, 2025



Enjoy Reading This Article?

Here are some more articles you might like to read next:

  • Dynamic Phenomena in Multi-period Groundwater Markets
  • Chino Basin Water Rights - An Empirical Overview
  • Pro-rata Distribution for Groundwater Market
  • RL for groundwater markets
  • A Stochastic Game Framework for Groundwater Markets