\`x^2+y_1+z_12^34\`
Advanced Search
Article Contents
Article Contents

Optimal sensor placement under model uncertainty in the weak-constraint 4D-Var framework

  • *Corresponding author: Alen Alexanderian

    *Corresponding author: Alen Alexanderian 

The work of AA was supported in part by the US National Science Foundation (NSF) grant DMS-2111044. HD and AKS were supported by the Department of Energy, Office of Science, Advanced Scientific Computing Research (ASCR) Program through the award DE-SC0023188. AKS was also supported by the NSF, in part, through the award DMS-1845406. VR was supported by the U.S. Department of Energy, Office of Science, ASCR Program under contracts DE-AC02-06CH11357 and DE-SC0023188.

Abstract / Introduction Full Text(HTML) Figure(6) / Table(7) Related Papers Cited by
  • In data assimilation, the model may be subject to uncertainties and errors. The weak-constraint data assimilation framework enables incorporating model uncertainty in the dynamics of the governing equations. We propose a new framework for near-optimal sensor placement in the weak-constrained setting. This is achieved by first deriving a design criterion based on the expected information gain, which involves the Kullback-Leibler divergence from the forecast prior to the posterior distribution. An explicit formula for this criterion is provided, assuming that the model error and background are independent and Gaussian and the dynamics are linear. We discuss algorithmic approaches to efficiently evaluate this criterion through randomized approximations. To provide further insight and flexibility in computations, we also provide alternative expressions for the criteria. We provide an algorithm to find near-optimal experimental designs using column subset selection, including a randomized algorithm that avoids computing the adjoint of the forward operator. Through numerical experiments in one and two spatial dimensions, we show the effectiveness of our proposed methods.

    Mathematics Subject Classification: Primary: 65M32, 62K05; Secondary: 65F40, 65F60.

    Citation:

    \begin{equation} \\ \end{equation}
  • 加载中
  • Figure 1.  The sensor locations determined by RAF-OED for different values of $ k $

    Figure 2.  Comparison of different sensor placement methods. The histogram represents the criterion values for random designs

    Figure 3.  Figures illustrating the candidate sensor locations (left), initial condition $ c_0 $ (middle), and velocity field $ \boldsymbol v $ (right)

    Figure 4.  Snapshots of $ c_h $

    Figure 5.  Comparison between the RAF-OED (dashed black) and the random designs (histogram)

    Figure 6.  The sensor locations determined by RAF-OED for different $ k $ values

    Table 1.  A summary of matrices involved for all the different formulations for computing the WC4D-Var criterion $ \phi_{\mathrm{EIG}} $

    Formulation/Matrix $ {\bf{\Gamma }}_\mathit{\boldsymbol{R}}^{-1} $ $ {\bf{\Gamma }}_{{\rm{mod}}}^{-1} $ $ {\bf{\Gamma }}_{{\rm{mod}}}^{\frac 12} $ $ {\bf{L}}^{-1} / {\bf{L}}^{\top} $ Matrix Size
    Unpreconditioned eq. 25 $\checkmark$ $\checkmark$ $ N_d $
    Preconditioned (22) $\checkmark$ $\checkmark$ $\checkmark$ $ N_d $
    SP-I (26) $ N_d+N_m $
    SP-II (27) $\checkmark$ $ 2N_d $
     | Show Table
    DownLoad: CSV

    Table 2.  Computational complexity of XNysTrace and Stochastic Lanczos Quadrature algorithms. Here, $N$ denotes the number of random probes, and $T_\boldsymbol {f({\bf{E}})}$ represents the computational cost of a single application of $f({\bf{E}})$ via Lanczos, where $n_{\rm{iter}}$ is the number of Lanczos iterations. The cost $T_{{\bf{E}}}$ will depend of each formulation; see Table 1

    Algorithm Operations (flops) Reference
    SLQ $N\cdot T_\boldsymbol {f({\bf{E}})}=N(n_{\rm{iter}}T_{{\bf{E}}}+d\cdot n^2_{\rm{iter}} )$ [48, Section 3]
    XNysTrace+Lanczos $N\cdot T_\boldsymbol {f({\bf{E}})}+\mathcal{O}(N^2 d)$ [18, Section 2.2]
     | Show Table
    DownLoad: CSV

    Table 3.  Results for Log-Determinant Approximation using SLQ with $ N = 2^3 $

    Formulation Matrix size $ d $ Lanczos Iters Rel. Error
    Unpreconditioned (25) $ 4010 $ 533 $ 7.9 \times 10^{-4} $
    Preconditioned (22) $ 4010 $ 37 $ 1.5 \times 10^{-4} $
    SP-I (26) $ 8300 $ 350 $ 1.1 \times 10^{-4} $
    SP-II (27) $ 8020 $ 550 $ 9.0 \times 10^{-2} $
     | Show Table
    DownLoad: CSV

    Table 4.  Number of samples, average Lanczos iterations, average relative errors, and standard deviations for Hutchinson's and XNysTrace estimators over 100 trials

    Samples Lanczos SLQ XNysTrace+Lanczos
    $ (N) $ Iters. Mean Std. Dev. Mean Std. Dev.
    2 73.5 $ 2.99 \times 10^{-4} $ $ 2.49 \times 10^{-4} $ $ 4.17 \times 10^{-4} $ $ 2.99 \times 10^{-4} $
    4 148.2 $ 2.08 \times 10^{-4} $ $ 1.62 \times 10^{-4} $ $ 2.40 \times 10^{-4} $ $ 1.85 \times 10^{-4} $
    8 296.3 $ 1.48 \times 10^{-4} $ $ 1.28 \times 10^{-4} $ $ 1.80 \times 10^{-4} $ $ 1.48 \times 10^{-4} $
    16 592.1 $ 1.01 \times 10^{-4} $ $ 8.08 \times 10^{-5} $ $ 1.11 \times 10^{-4} $ $ 7.85 \times 10^{-5} $
    32 1186.9 $ 8.31 \times 10^{-5} $ $ 5.48 \times 10^{-5} $ $ 5.59 \times 10^{-5} $ $ 4.11 \times 10^{-5} $
    64 2369.5 $ 4.85 \times 10^{-5} $ $ 4.58 \times 10^{-5} $ $ 4.01 \times 10^{-6} $ $ 3.18 \times 10^{-6} $
    128 4741.9 $ 3.75 \times 10^{-5} $ $ 2.90 \times 10^{-5} $ $ 1.05 \times 10^{-7} $ $ 8.24 \times 10^{-8} $
     | Show Table
    DownLoad: CSV

    Table 5.  EIG Values from Different Methods

    Method $ \phi_{\mathrm{EIG}}({\bf{S}}) $ Method $ \phi_{\mathrm{EIG}}({\bf{S}}) $
    Best ($ {\bf{S}}_ \rm{max} $) 106.30 Best ($ {\bf{S}}_ \rm{max} $) unknown
    Greedy ($ {\bf{S}}={\bf{S}}_\text{G} $) 95.76 Greedy ($ {\bf{S}}={\bf{S}}_\text{G} $) 127.31
    Algorithm 1 97.71 Algorithm 1 125.89
    RAF-OED 97.71 RAF-OED 127.22
    k = 5 k = 10
     | Show Table
    DownLoad: CSV

    Table 6.  Comparison of SLQ and XNysTrace over 100 trials

    $ N $ SLQ XNysTrace
    Mean RSD SD Mean RSD SD
    2 $ 4.1612 \times 10^{5} $ $ 8.83 \times 10^{-2} $% 368 $ 3.9789 \times 10^{5} $ $ 2.96 \times 10^{-2} $% 118
    4 $ 4.1612 \times 10^{5} $ $ 6.75 \times 10^{-2} $% 281 $ 3.9790 \times 10^{5} $ $ 2.29 \times 10^{-2} $ % 91
    8 $ 4.1609 \times 10^{5} $ $ 5.15 \times 10^{-2} $% $ \mathit{\boldsymbol{214}} $ $ 3.9788 \times 10^{5} $ $ 1.56 \times 10^{-2} $% $ \mathit{\boldsymbol{62}} $
     | Show Table
    DownLoad: CSV

    Table 7.  SC-based EIG values evaluated on sensor sets selected for the WC formulation. $ \mathbf{S}^\text{RGKS}_\text{WC} $ denotes the WC-selection obtained using the RAF-OED algorithm

    $ \phi_{\mathrm{EIG}}^\text{SC}(\mathbf{S}^\text{RGKS}_\text{WC}) $
    $ \alpha = 5\% $ $ 179.65 $
    $ \alpha = 3\% $ $ 178.26 $
    $ \alpha = 2\% $ $ 175.39 $
    $ \alpha = 1\% $ $ 170.54 $
     | Show Table
    DownLoad: CSV
  • [1] A. Alexanderian, Expository Paper: A primer on homogenization of elliptic PDEs with stationary and ergodic random coefficient functions, The Rocky Mountain Journal of Mathematics, 45 (2015), 703-735.  doi: 10.1216/RMJ-2015-45-3-703.
    [2] A. AlexanderianP. J. Gloor and O. Ghattas, On Bayesian A- and D-Optimal Experimental Designs in Infinite Dimensions, Bayesian Analysis, 11 (2016), 671-695.  doi: 10.1214/15-BA969.
    [3] A. Alexanderian, R. Nicholson and N. Petra, Optimal design of large-scale nonlinear Bayesian inverse problems under model uncertainty, Inverse Problems, 40 (2024), 095001, 30 pp. doi: 10.1088/1361-6420/ad602e.
    [4] A. Alexanderian and A. K. Saibaba, Efficient D-optimal design of experiments for infinite-dimensional bayesian linear inverse problems, SIAM Journal on Scientific Computing, 40 (2018), A2956-A2985. doi: 10.1137/17M115712X.
    [5] S. Asprey and S. Macchietto, Designing robust optimal dynamic experiments, Journal of Process Control, 12 (2002), 545-556.  doi: 10.1016/S0959-1524(01)00020-8.
    [6] A. Attia, S. Leyffer and T. Munson, Robust A-optimal experimental design for bayesian inverse problems, preprint, arXiv: 2305.03855, 2023.
    [7] Y. BaekW. Aquino and S. Mukherjee, Generalized Bayes approach to inverse problems with model misspecification, Inverse Problems, 39 (2023), 105011.  doi: 10.1088/1361-6420/acf51c.
    [8] Z. BaiG. Fahey and G. Golub, Some large-scale matrix computation problems, Journal of Computational and Applied Mathematics, 74 (1996), 71-89.  doi: 10.1016/0377-0427(96)00018-0.
    [9] I. A. Baratta, J. P. Dean, J. S. Dokken, M. Habera, J. S. Hale, C. N. Richardson, M. E. Rognes, M. W. Scroggs, N. Sime and G. N. Wells, DOLFINx: The next generation FEniCS problem solving environment, Dec. 2023.
    [10] J. BehrmannW. GrathwohlR. T. Q. ChenD. Duvenaud and J.-H. Jacobsen, Invertible Residual Networks, Proceedings of Machine Learning Research, 97 (2019), 573-582. 
    [11] A. Bensoussan, J. Lions and G. Papanicolaou, Asymptotic Analysis for Periodic Structures, AMS Chelsea Publishing Series. North-Holland Publishing Company; Sole distributors for the U.S.A. and Canada, Elsevier North-Holland, 1978.
    [12] M. Benzi, G. H. Golub and J. Liesen, Numerical solution of saddle point problems, Acta Numerica, 14 (2005), 1–137. doi: 10.1017/S0962492904000212.
    [13] R. Bhatia, Matrix Analysis, Grad. Texts in Math., 169, Springer-Verlag, New York, 1997.
    [14] S. Boyd and  L. VandenbergheConvex Optimization, Cambridge University Press, 2004. 
    [15] A. Carrassi, M. Bocquet, L. Bertino and G. Evensen, Data assimilation in the geosciences: An overview of methods, issues, and perspectives, Wiley Interdisciplinary Reviews: Climate Change, 9 (2018), e535. doi: 10.1002/wcc.535.
    [16] H. DetteM. Konstantinou and A. Zhigljavsky, A new approach to optimal designs for correlated observations, The Annals of Statistics, 45 (2017), 1579-1608.  doi: 10.1214/16-AOS1500.
    [17] R. Dong, H. Leng, J. Song, C. Zhao, J. Li and Y. Lan, A system for hybrid 4DVar-EnKF data assimilation based on deep learning, In 2022 IEEE International Conference on Systems, Man and Cybernetics (SMC), (2022), 2358-2365.
    [18] E. N. Epperly, J. A. Tropp and R. J. Webber, XTrace: Making the most of every sample in stochastic trace estimation, SIAM Journal on Matrix Analysis and Applications, 45 (2024), 1–23. doi: 10.1137/23M1548323.
    [19] S. Eswar, V. Rao and A. K. Saibaba, Bayesian {D}-Optimal experimental designs via column subset selection, preprint, arXiv: 2402.16000, 2024.
    [20] G. Evensen, Sequential data assimilation with a nonlinear quasi‐geostrophic model using Monte Carlo methods to forecast error statistics, Journal of Geophysical Research, 99 (1994), 10143-10162.  doi: 10.1029/94JC00572.
    [21] M. Fisher, Y. Trémolet, H. Auvinen, D. Tan and P. Poli, Weak-constraint and long-window 4D-Var, ECMWF Technical Memoranda, 47 (2011).
    [22] M. A. Freitag, Numerical linear algebra in data assimilation, Gesellschaft für Angewandte Mathematik und Mechanik, 43 (2020), e202000014, 27 pp. doi: 10.1002/gamm.202000014.
    [23] G. N. GaticaN. Heuer and S. Meddahi, On the numerical analysis of nonlinear twofold saddle point problems, IMA Journal of Numerical Analysis, 23 (2003), 301-330. 
    [24] M. Ghil and P. Malanotte-Rizzoli, Data assimilation in meteorology and oceanography, Advances in Geophysics, 33 (1991), 141-266.  doi: 10.1016/S0065-2687(08)60442-2.
    [25] G. H. Golub and C. F. Van Loan, Matrix Computations, 4$^{th}$ edition, Johns Hopkins University Press, Philadelphia, PA, 2013.
    [26] S. GrattonS. GürolE. Simon and P. L. Toint, Guaranteeing the convergence of the saddle formulation for weakly constrained 4{D}-var data assimilation, Quarterly Journal of the Royal Meteorological Society, 144 (2018), 2592-2602.  doi: 10.1002/qj.3355.
    [27] M. Gu and S. C. Eisenstat, Efficient algorithms for computing a strong rank-revealing QR factorization, SIAM Journal on Scientific Computing, 17 (1996), 848-869.  doi: 10.1137/0917055.
    [28] N. HalkoP. G. Martinsson and J. A. Tropp, Finding structure with randomness: Probabilistic algorithms for constructing approximate matrix decompositions, SIAM Review, 53 (2011), 217-288.  doi: 10.1137/090771806.
    [29] M. HamrudM. Bonavita and L. Isaksen, EnKF and hybrid gain ensemble data assimilation. Part I: EnKF implementation, Monthly Weather Review, 143 (2015), 4847-4864.  doi: 10.1175/MWR-D-14-00333.1.
    [30] C. R. HarrisK. J. MillmanS. J. van der WaltR. GommersP. VirtanenD. CournapeauE. WieserJ. TaylorS. BergN. J. SmithR. KernM. PicusS. HoyerM. H. van KerkwijkM. BrettA. HaldaneJ. F. del RíoM. WiebeP. PetersonP. Gérard-MarchantK. SheppardT. ReddyW. WeckesserH. AbbasiC. Gohlke and T. E. Oliphant, Array programming with NumPy, Nature, 585 (2020), 357-362.  doi: 10.1038/s41586-020-2649-2.
    [31] N. J. Higham and S. H. Cheng, Modifying the inertia of matrices arising in optimization, Linear Algebra and its Applications, 275/276 (1998), 261-279.  doi: 10.1016/S0024-3795(97)10015-5.
    [32] R. A. Horn and C. R. Johnson, Matrix Analysis, 2$^{nd}$ edition, Cambridge University Press, Cambridge, second edition, 2013.
    [33] X. Huan, J. Jagalur and Y. Marzouk, Optimal experimental design: Formulations and computations, Acta Numerica, 33 (2024), 715–840. doi: 10.1017/S0962492924000023.
    [34] X.-Y. Huang, Q. Xiao, D. M. Barker, X. Zhang, J. Michalakes, W. Huang, T. Henderson, J. Bray, Y. Chen, Z. Ma, J. Dudhia, Y. Guo, X. Zhang, D.-J. Won, H.-C. Lin and Y.-H. Kuo, Four-dimensional variational data assimilation for WRF: Formulation and preliminary results, Monthly Weather Review, 137 (2009), 299–314. doi: 10.1175/2008MWR2577.1.
    [35] M. Hutchinson, A stochastic estimator of the trace of the influence matrix for laplacian smoothing splines, Communications in Statistics - Simulation and Computation, 18 (1989), 1059-1076.  doi: 10.1080/03610918908812806.
    [36] J. Kaipio and E. Somersalo, Statistical and Computational Inverse Problems, Springer, Dordrecht, 2005.
    [37] J. Kiefer, Optimum experimental designs, J. Roy. Statist. Soc. Ser. B, 21 (1959), 272-319. 
    [38] P. LaloyauxM. BonavitaM. Chrust and S. Gürol, Exploring the potential and limitations of weak-constraint 4D-Var, Quarterly Journal of the Royal Meteorological Society, 146 (2020), 4067-4082.  doi: 10.1002/qj.3891.
    [39] S. Maio and A. Alexanderian, On submodularity of the expected information gain, preprint, arXiv: 2505.04145, 2025.
    [40] G. L. Nemhauser, L. A. Wolsey and M. L. Fisher, An analysis of approximations for maximizing submodular set functions-I, Math. Program., 14 (1978), 265–294. doi: 10.1007/BF01588971.
    [41] H. NgodockM. CarrierS. Smith and I. Souopgui, Weak and strong constraints variational data assimilation with the NCOM-4DVAR in the Agulhas region using the representer method, Monthly Weather Review, 145 (2017), 1755-1764.  doi: 10.1175/MWR-D-16-0264.1.
    [42] J. Sacks and D. Ylvisaker, Designs for regression problems with correlated errors, The Annals of Mathematical Statistics, 37 (1966), 66-89.  doi: 10.1214/aoms/1177699599.
    [43] A. Scarinci, M. Fehler and Y. Marzouk, Bayesian inference under model misspecification using transport-Lagrangian distances: An application to seismic inversion, preprint, arXiv: 2105.07027, 2021.
    [44] A. M. Stuart, Inverse problems: A Bayesian perspective, Acta Numerica, 19 (2010), 451–559. doi: 10.1017/S0962492910000061.
    [45] M. K. Tippett, J. L. Anderson, C. H. Bishop, T. M. Hamill and J. S. Whitaker, Ensemble square root filters, Statistical Methods for Climate Scientists, 2003.
    [46] Y. Trémolet, Accounting for an imperfect model in 4D-Var, Quarterly Journal of the Royal Meteorological Society, 132 (2006), 2483-2504.  doi: 10.1256/qj.05.224.
    [47] R. E. TsirpitziF. Miller and C. F. Burman, Robust optimal designs using a model misspecification term, Metrika, 86 (2023), 781-804.  doi: 10.1007/s00184-023-00893-6.
    [48] S. UbaruJ. Chen and Y. Saad, Fast estimation of $tr(f({A}))$ via stochastic Lanczos quadrature, SIAM Journal on Matrix Analysis and Applications, 38 (2017), 1075-1099.  doi: 10.1137/16M1104974.
    [49] R. Vershynin, High-Dimensional Probability: An Introduction with Applications in Data Science, volume 47. Cambridge university press, 2018.
    [50] U. Villa, N. Petra and O. Ghattas, HIPPYlib: An extensible software framework for large-scale inverse problems governed by PDEs: Part I: Deterministic inversion and linearized Bayesian inference, ACM Transactions on Mathematical Software, 47 (2021), 34 pp.
    [51] P. Virtanen, R. Gommers, T. E. Oliphant, M. Haberland, T. Reddy, D. Cournapeau, E. Burovski, P. Peterson, W. Weckesser, J. Bright, S. J. van der Walt, M. Brett, J. Wilson, K. J. Millman, N. Mayorov, A. R. J. Nelson, E. Jones, R. Kern, E. Larson, C. J. Carey, İ. Polat, Y. Feng, E. W. Moore, J. VanderPlas, D. Laxalde, J. Perktold, R. Cimrman, I. Henriksen, E. A. Quintero, C. R. Harris, A. M. Archibald, A. H. Ribeiro, F. Pedregosa, P. van Mulbregt and SciPy 1.0 Contributors, SciPy 1.0: Fundamental algorithms for scientific computing in python, Nature Methods, 17 (2020), 261-272. doi: 10.1038/s41592-020-0772-5.
    [52] H. White, Maximum likelihood estimation of misspecified models, Econometrica, 50 (1982), 1-25.  doi: 10.2307/1912526.
    [53] L. Xu, T. Rosmond, J. Goerss and B. Chua, Toward a weak constraint operational 4D-Var system: Application to the Burgers' equation, Meteorologische Zeitschrift, 16 (2007), 741–753. doi: 10.1127/0941-2948/2007/0246.
  • 加载中

Figures(6)

Tables(7)

SHARE

Article Metrics

HTML views(3908) PDF downloads(577) Cited by(0)

Access History

Catalog

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return