• Previous Article
    Controllability problems for the heat equation on a half-axis with a bounded control in the Neumann boundary condition
  • MCRF Home
  • This Issue
  • Next Article
    Time-inconsistent stochastic optimal control problems: A backward stochastic partial differential equations approach
doi: 10.3934/mcrf.2020025

Linear-quadratic-Gaussian mean-field-game with partial observation and common noise

1. 

International Center for Decision and Risk Analysis Jindal School of Management, The University of Texas at Dallas and School of Data Sciences, City University of Hong Kong

2. 

Zhongtai Securities Institute for Financial Studies, Shandong University, Jinan, Shandong 250100, China

3. 

Department of Applied Mathematics, The Hong Kong Polytechnic University, Hong Kong

 

Received  January 2019 Revised  February 2020 Published  May 2020

This paper considers a class of linear-quadratic-Gaussian (LQG) mean-field games (MFGs) with partial observation structure for individual agents. Unlike other literature, there are some special features in our formulation. First, the individual state is driven by some common-noise due to the external factor and the state-average thus becomes a random process instead of a deterministic quantity. Second, the sensor function of individual observation depends on state-average thus the agents are coupled in triple manner: not only in their states and cost functionals, but also through their observation mechanism. The decentralized strategies for individual agents are derived by the Kalman filtering and separation principle. The consistency condition is obtained which is equivalent to the wellposedness of some forward-backward stochastic differential equation (FBSDE) driven by common noise. Finally, the related $ \epsilon $-Nash equilibrium property is verified.

Citation: Alain Bensoussan, Xinwei Feng, Jianhui Huang. Linear-quadratic-Gaussian mean-field-game with partial observation and common noise. Mathematical Control & Related Fields, doi: 10.3934/mcrf.2020025
References:
[1]

M. Bardi, Explicit solutions of some linear-quadratic mean field games, Netw. Heterog. Media, 7 (2012), 243-261.  doi: 10.3934/nhm.2012.7.243.  Google Scholar

[2] A. Bensoussan, Stochastic Control of Partially Observable Systems, Cambridge University Press, Cambridge, 1992.  doi: 10.1017/CBO9780511526503.  Google Scholar
[3]

A. Bensoussan, J. Frehse and P. Yam, Mean Field Games and Mean Field Type Control Theory, SpringerBriefs in Mathematics, Springer, New York, 2013. doi: 10.1007/978-1-4614-8508-7.  Google Scholar

[4]

A. BensoussanK. C. J. SungS. C. P. Yam and S. P. Yung, Linear-quadratic mean field games, J. Optim. Theory Appl., 169 (2016), 496-529.  doi: 10.1007/s10957-015-0819-4.  Google Scholar

[5]

R. Carmona and F. Delarue, Probabilistic analysis of mean-field games, SIAM J. Control Optim., 51 (2013), 2705-2734.  doi: 10.1137/120883499.  Google Scholar

[6]

R. Carmona and F. Delarue, Probabilistic theory of mean field games with applications. Ⅱ. Mean field games with common noise and master equations, in Probability Theory and Stochastic Modelling, 84, Springer, Cham, 2018.  Google Scholar

[7]

R. CarmonaJ.-P. Fouque and L.-H. Sun, Mean field games and systemic risk, Commun. Math. Sci., 13 (2015), 911-933.  doi: 10.4310/CMS.2015.v13.n4.a4.  Google Scholar

[8]

C. Dogbé, Modeling crowd dynamics by the mean-field limit approach, Math. Comput. Modelling, 52 (2010), 1506-1520.  doi: 10.1016/j.mcm.2010.06.012.  Google Scholar

[9]

G. M. Erickson, Differential game methods of advertising competition, European Journal Operational Research, 83 (1995), 431-438.   Google Scholar

[10]

W. Fleming and W. Rishel, Deterministic and Stochastic Control of Partially Observable Systems, Springer-Verlag, 1975. Google Scholar

[11]

O. Guéant, J.-M. Lasry and P.-L. Lions, Mean field games and applications, Paris-Princeton Lectures on Mathematical Finance 2010, Lecture Notes in Mathematics, 2003, Springer, Berlin, 2011,205–266. doi: 10.1007/978-3-642-14660-2_3.  Google Scholar

[12]

A. Haurie and P. Marcotte, On the relationship between Nash-Cournot and Wardrop equilibria, Networks, 15 (1985), 295-308.  doi: 10.1002/net.3230150303.  Google Scholar

[13]

G.-D. Hu, Symplectic Runge-Kutta methods for the linear quadratic regulator problem, Int. J. Math. Anal. (Ruse), 1 (2007), 293-304.   Google Scholar

[14]

J. HuangY. Hu and T. Nie, Linear-quadratic-Gaussian mixed mean-field games with heterogeneous input constraints, SIAM J. Control Optim., 56 (2018), 2835-2877.  doi: 10.1137/17M1151420.  Google Scholar

[15]

J. Huang and S. Wang, Dynamic optimization of large-population systems with partial information, J. Optim. Theory Appl., 168 (2016), 231-245.  doi: 10.1007/s10957-015-0740-x.  Google Scholar

[16]

M. Huang, Large-population LQG games involving a major player: The Nash certainty equivalence principle, SIAM J. Control Optim., 48 (2010), 3318-3353.  doi: 10.1137/080735370.  Google Scholar

[17]

M. HuangP. E. Caines and R. P. Malhamé, Uplink power adjustment in wireless communication systems: A stochastic control analysis, IEEE Trans. Automat. Control, 49 (2004), 1693-1708.  doi: 10.1109/TAC.2004.835388.  Google Scholar

[18]

M. Huang, P. E. Caines and R. P. Malhamé, Distributed multi-agent decision-making with partial observations: Asymptotic Nash equilibria, Proceedings of the 17th International Symposium on Mathematical Theory of Networks and Systems (MTNS), (2006), 2725–2730. Google Scholar

[19]

M. HuangP. E. Caines and R. P. Malhamé, Large-population cost-coupled LQG problems with non-uniform agents: Individual-mass behavior and decentralized $\epsilon$-Nash equilibria, IEEE Trans. Automat. Control, 52 (2007), 1560-1571.  doi: 10.1109/TAC.2007.904450.  Google Scholar

[20]

M. HuangR. P. Malhamé and P. E. Caines, Large population stochastic dynamic games: Closed-loop McKean-Vlasov systems and the Nash certainty equivalence principle, Commun. Inf. Syst., 6 (2006), 221-251.  doi: 10.4310/CIS.2006.v6.n3.a5.  Google Scholar

[21]

G. Kallianpur, Stochastic filtering theory, in Applications of Mathematics, 13, Springer-Verlag, New York-Berlin, 1980.  Google Scholar

[22]

A. C. Kizilkale and R. P. Malhamé, Collective target tracking mean field control for Markovian jump-driven models of electric water heating loads, in Control of Complex Systems: Theory and Applications, 2016,559–584. Google Scholar

[23]

P. E. Kloeden and E. Platen, Numerical solution of sochastic differential equations, in Applications of Mathematics (New York), 23, Springer-Verlag, Berlin, 1992. doi: 10.1007/978-3-662-12616-5.  Google Scholar

[24]

V. E. Lambson, Self-enforcing collusion in large dynamic markets, J. Econom. Theory, 34 (1984), 282-291.  doi: 10.1016/0022-0531(84)90145-5.  Google Scholar

[25]

J.-M. Lasry and P.-L. Lions, Mean field games, Jpn. J. Math., 2 (2007), 229-260.  doi: 10.1007/s11537-007-0657-8.  Google Scholar

[26]

J. Ma and J. Yong, Forward-backward stochastic differential equations and their applications, in Lecture Notes in Mathematics, 1702, Springer-Verlag, Berlin, 1999.  Google Scholar

[27]

Z. MaD. Callaway and I. Hiskens, Decentralized charging control of large populations of plug-in electric vehicles, IEEE Transactions on Control Systems Technology, 21 (2013), 67-78.   Google Scholar

[28]

S. L. Nguyen and M. Huang, Linear-quadratic-Gaussian mixed games with continuum-parameterized minor players, SIAM J. Control Optim., 50 (2012), 2907-2937.  doi: 10.1137/110841217.  Google Scholar

[29]

M. NourianP. E. CainesR. P. Malhamé and M. Huang, Nash, social and centralized solutions to consensus problems via mean field control theory, IEEE Trans. Automat. Control, 58 (2013), 639-653.  doi: 10.1109/TAC.2012.2215399.  Google Scholar

[30]

B. Øksendal, Stochastic Differential Equations. An Introduction with Applications, Universitext, $6^th$ edition, Springer-Verlag, Berlin, 2003. doi: 10.1007/978-3-642-14394-6.  Google Scholar

[31]

N. Şen and P. E. Caines, Nonlinear filtering theory for McKean–Vlasov type stochastic differential equations, SIAM J. Control Optim., 54 (2016), 153-174.  doi: 10.1137/15M1013304.  Google Scholar

[32]

H. TembineQ. Zhu and T. Başar, Risk-sensitive mean-field games, IEEE Trans. Automat. Control, 59 (2014), 835-850.  doi: 10.1109/TAC.2013.2289711.  Google Scholar

[33]

G. Wang and Z. Wu, Kalman-Bucy filtering equations of forward and backward stochastic systems and applications to recursive optimal control problems, J. Math. Anal. Appl., 342 (2008), 1280-1296.  doi: 10.1016/j.jmaa.2007.12.072.  Google Scholar

[34]

G. WangZ. Wu and J. Xiong, A linear-quadratic optimal control problem of forward-backward stochastic differential equations with partial information, IEEE Trans. Automat. Control, 60 (2015), 2904-2916.  doi: 10.1109/TAC.2015.2411871.  Google Scholar

[35]

Y. WeintraubL. Benkard and B. Van Roy, Markov perfect industry dynamics with many firms, Econometrica, 76 (2008), 1375-1411.  doi: 10.3982/ECTA6158.  Google Scholar

[36]

W. M. Wonham., On the separation theorem of stochastic control., SIAM J. Control Optim., 6 (1968), 312–326. doi: 10.1137/0306023.  Google Scholar

[37]

H. YinP. G. MehtaS. P. Meyn and U. V. Shanbhag, Synchronization of coupled oscillators is a game, IEEE Trans. Automat. Control, 57 (2012), 920-935.  doi: 10.1109/TAC.2011.2168082.  Google Scholar

[38]

J. Yong and X. Y. Zhou, Stochastic controls. Hamiltonian systems and HJB equations, in Applications of Mathematics (New York), 43, Springer-Verlag, New York, 1999. doi: 10.1007/978-1-4612-1466-3.  Google Scholar

show all references

References:
[1]

M. Bardi, Explicit solutions of some linear-quadratic mean field games, Netw. Heterog. Media, 7 (2012), 243-261.  doi: 10.3934/nhm.2012.7.243.  Google Scholar

[2] A. Bensoussan, Stochastic Control of Partially Observable Systems, Cambridge University Press, Cambridge, 1992.  doi: 10.1017/CBO9780511526503.  Google Scholar
[3]

A. Bensoussan, J. Frehse and P. Yam, Mean Field Games and Mean Field Type Control Theory, SpringerBriefs in Mathematics, Springer, New York, 2013. doi: 10.1007/978-1-4614-8508-7.  Google Scholar

[4]

A. BensoussanK. C. J. SungS. C. P. Yam and S. P. Yung, Linear-quadratic mean field games, J. Optim. Theory Appl., 169 (2016), 496-529.  doi: 10.1007/s10957-015-0819-4.  Google Scholar

[5]

R. Carmona and F. Delarue, Probabilistic analysis of mean-field games, SIAM J. Control Optim., 51 (2013), 2705-2734.  doi: 10.1137/120883499.  Google Scholar

[6]

R. Carmona and F. Delarue, Probabilistic theory of mean field games with applications. Ⅱ. Mean field games with common noise and master equations, in Probability Theory and Stochastic Modelling, 84, Springer, Cham, 2018.  Google Scholar

[7]

R. CarmonaJ.-P. Fouque and L.-H. Sun, Mean field games and systemic risk, Commun. Math. Sci., 13 (2015), 911-933.  doi: 10.4310/CMS.2015.v13.n4.a4.  Google Scholar

[8]

C. Dogbé, Modeling crowd dynamics by the mean-field limit approach, Math. Comput. Modelling, 52 (2010), 1506-1520.  doi: 10.1016/j.mcm.2010.06.012.  Google Scholar

[9]

G. M. Erickson, Differential game methods of advertising competition, European Journal Operational Research, 83 (1995), 431-438.   Google Scholar

[10]

W. Fleming and W. Rishel, Deterministic and Stochastic Control of Partially Observable Systems, Springer-Verlag, 1975. Google Scholar

[11]

O. Guéant, J.-M. Lasry and P.-L. Lions, Mean field games and applications, Paris-Princeton Lectures on Mathematical Finance 2010, Lecture Notes in Mathematics, 2003, Springer, Berlin, 2011,205–266. doi: 10.1007/978-3-642-14660-2_3.  Google Scholar

[12]

A. Haurie and P. Marcotte, On the relationship between Nash-Cournot and Wardrop equilibria, Networks, 15 (1985), 295-308.  doi: 10.1002/net.3230150303.  Google Scholar

[13]

G.-D. Hu, Symplectic Runge-Kutta methods for the linear quadratic regulator problem, Int. J. Math. Anal. (Ruse), 1 (2007), 293-304.   Google Scholar

[14]

J. HuangY. Hu and T. Nie, Linear-quadratic-Gaussian mixed mean-field games with heterogeneous input constraints, SIAM J. Control Optim., 56 (2018), 2835-2877.  doi: 10.1137/17M1151420.  Google Scholar

[15]

J. Huang and S. Wang, Dynamic optimization of large-population systems with partial information, J. Optim. Theory Appl., 168 (2016), 231-245.  doi: 10.1007/s10957-015-0740-x.  Google Scholar

[16]

M. Huang, Large-population LQG games involving a major player: The Nash certainty equivalence principle, SIAM J. Control Optim., 48 (2010), 3318-3353.  doi: 10.1137/080735370.  Google Scholar

[17]

M. HuangP. E. Caines and R. P. Malhamé, Uplink power adjustment in wireless communication systems: A stochastic control analysis, IEEE Trans. Automat. Control, 49 (2004), 1693-1708.  doi: 10.1109/TAC.2004.835388.  Google Scholar

[18]

M. Huang, P. E. Caines and R. P. Malhamé, Distributed multi-agent decision-making with partial observations: Asymptotic Nash equilibria, Proceedings of the 17th International Symposium on Mathematical Theory of Networks and Systems (MTNS), (2006), 2725–2730. Google Scholar

[19]

M. HuangP. E. Caines and R. P. Malhamé, Large-population cost-coupled LQG problems with non-uniform agents: Individual-mass behavior and decentralized $\epsilon$-Nash equilibria, IEEE Trans. Automat. Control, 52 (2007), 1560-1571.  doi: 10.1109/TAC.2007.904450.  Google Scholar

[20]

M. HuangR. P. Malhamé and P. E. Caines, Large population stochastic dynamic games: Closed-loop McKean-Vlasov systems and the Nash certainty equivalence principle, Commun. Inf. Syst., 6 (2006), 221-251.  doi: 10.4310/CIS.2006.v6.n3.a5.  Google Scholar

[21]

G. Kallianpur, Stochastic filtering theory, in Applications of Mathematics, 13, Springer-Verlag, New York-Berlin, 1980.  Google Scholar

[22]

A. C. Kizilkale and R. P. Malhamé, Collective target tracking mean field control for Markovian jump-driven models of electric water heating loads, in Control of Complex Systems: Theory and Applications, 2016,559–584. Google Scholar

[23]

P. E. Kloeden and E. Platen, Numerical solution of sochastic differential equations, in Applications of Mathematics (New York), 23, Springer-Verlag, Berlin, 1992. doi: 10.1007/978-3-662-12616-5.  Google Scholar

[24]

V. E. Lambson, Self-enforcing collusion in large dynamic markets, J. Econom. Theory, 34 (1984), 282-291.  doi: 10.1016/0022-0531(84)90145-5.  Google Scholar

[25]

J.-M. Lasry and P.-L. Lions, Mean field games, Jpn. J. Math., 2 (2007), 229-260.  doi: 10.1007/s11537-007-0657-8.  Google Scholar

[26]

J. Ma and J. Yong, Forward-backward stochastic differential equations and their applications, in Lecture Notes in Mathematics, 1702, Springer-Verlag, Berlin, 1999.  Google Scholar

[27]

Z. MaD. Callaway and I. Hiskens, Decentralized charging control of large populations of plug-in electric vehicles, IEEE Transactions on Control Systems Technology, 21 (2013), 67-78.   Google Scholar

[28]

S. L. Nguyen and M. Huang, Linear-quadratic-Gaussian mixed games with continuum-parameterized minor players, SIAM J. Control Optim., 50 (2012), 2907-2937.  doi: 10.1137/110841217.  Google Scholar

[29]

M. NourianP. E. CainesR. P. Malhamé and M. Huang, Nash, social and centralized solutions to consensus problems via mean field control theory, IEEE Trans. Automat. Control, 58 (2013), 639-653.  doi: 10.1109/TAC.2012.2215399.  Google Scholar

[30]

B. Øksendal, Stochastic Differential Equations. An Introduction with Applications, Universitext, $6^th$ edition, Springer-Verlag, Berlin, 2003. doi: 10.1007/978-3-642-14394-6.  Google Scholar

[31]

N. Şen and P. E. Caines, Nonlinear filtering theory for McKean–Vlasov type stochastic differential equations, SIAM J. Control Optim., 54 (2016), 153-174.  doi: 10.1137/15M1013304.  Google Scholar

[32]

H. TembineQ. Zhu and T. Başar, Risk-sensitive mean-field games, IEEE Trans. Automat. Control, 59 (2014), 835-850.  doi: 10.1109/TAC.2013.2289711.  Google Scholar

[33]

G. Wang and Z. Wu, Kalman-Bucy filtering equations of forward and backward stochastic systems and applications to recursive optimal control problems, J. Math. Anal. Appl., 342 (2008), 1280-1296.  doi: 10.1016/j.jmaa.2007.12.072.  Google Scholar

[34]

G. WangZ. Wu and J. Xiong, A linear-quadratic optimal control problem of forward-backward stochastic differential equations with partial information, IEEE Trans. Automat. Control, 60 (2015), 2904-2916.  doi: 10.1109/TAC.2015.2411871.  Google Scholar

[35]

Y. WeintraubL. Benkard and B. Van Roy, Markov perfect industry dynamics with many firms, Econometrica, 76 (2008), 1375-1411.  doi: 10.3982/ECTA6158.  Google Scholar

[36]

W. M. Wonham., On the separation theorem of stochastic control., SIAM J. Control Optim., 6 (1968), 312–326. doi: 10.1137/0306023.  Google Scholar

[37]

H. YinP. G. MehtaS. P. Meyn and U. V. Shanbhag, Synchronization of coupled oscillators is a game, IEEE Trans. Automat. Control, 57 (2012), 920-935.  doi: 10.1109/TAC.2011.2168082.  Google Scholar

[38]

J. Yong and X. Y. Zhou, Stochastic controls. Hamiltonian systems and HJB equations, in Applications of Mathematics (New York), 43, Springer-Verlag, New York, 1999. doi: 10.1007/978-1-4612-1466-3.  Google Scholar

Figure 1.  Trajectories of the type-1 agents' states when N = 500
Figure 2.  Trajectories of the type-2 agents' states when N = 500
Figure 3.  Trajectories of the type-1 agents state average and the mean field term
Figure 4.  Trajectories of the type-2 agents state average and the mean field term
[1]

Adel Chala, Dahbia Hafayed. On stochastic maximum principle for risk-sensitive of fully coupled forward-backward stochastic control of mean-field type with application. Evolution Equations & Control Theory, 2020, 9 (3) : 817-843. doi: 10.3934/eect.2020035

[2]

Haiyan Zhang. A necessary condition for mean-field type stochastic differential equations with correlated state and observation noises. Journal of Industrial & Management Optimization, 2016, 12 (4) : 1287-1301. doi: 10.3934/jimo.2016.12.1287

[3]

Jianhui Huang, Shujun Wang, Zhen Wu. Backward-forward linear-quadratic mean-field games with major and minor agents. Probability, Uncertainty and Quantitative Risk, 2016, 1 (0) : 8-. doi: 10.1186/s41546-016-0009-9

[4]

Xin Chen, Ana Bela Cruzeiro. Stochastic geodesics and forward-backward stochastic differential equations on Lie groups. Conference Publications, 2013, 2013 (special) : 115-121. doi: 10.3934/proc.2013.2013.115

[5]

Dariusz Borkowski. Forward and backward filtering based on backward stochastic differential equations. Inverse Problems & Imaging, 2016, 10 (2) : 305-325. doi: 10.3934/ipi.2016002

[6]

Juan Li, Wenqiang Li. Controlled reflected mean-field backward stochastic differential equations coupled with value function and related PDEs. Mathematical Control & Related Fields, 2015, 5 (3) : 501-516. doi: 10.3934/mcrf.2015.5.501

[7]

Diogo Gomes, Marc Sedjro. One-dimensional, forward-forward mean-field games with congestion. Discrete & Continuous Dynamical Systems - S, 2018, 11 (5) : 901-914. doi: 10.3934/dcdss.2018054

[8]

Yufeng Shi, Tianxiao Wang, Jiongmin Yong. Mean-field backward stochastic Volterra integral equations. Discrete & Continuous Dynamical Systems - B, 2013, 18 (7) : 1929-1967. doi: 10.3934/dcdsb.2013.18.1929

[9]

Jie Xiong, Shuaiqi Zhang, Yi Zhuang. A partially observed non-zero sum differential game of forward-backward stochastic differential equations and its application in finance. Mathematical Control & Related Fields, 2019, 9 (2) : 257-276. doi: 10.3934/mcrf.2019013

[10]

Yufeng Shi, Tianxiao Wang, Jiongmin Yong. Optimal control problems of forward-backward stochastic Volterra integral equations. Mathematical Control & Related Fields, 2015, 5 (3) : 613-649. doi: 10.3934/mcrf.2015.5.613

[11]

Kai Du, Jianhui Huang, Zhen Wu. Linear quadratic mean-field-game of backward stochastic differential systems. Mathematical Control & Related Fields, 2018, 8 (3&4) : 653-678. doi: 10.3934/mcrf.2018028

[12]

Tianxiao Wang. Characterizations of equilibrium controls in time inconsistent mean-field stochastic linear quadratic problems. I. Mathematical Control & Related Fields, 2019, 9 (2) : 385-409. doi: 10.3934/mcrf.2019018

[13]

Rui Mu, Zhen Wu. Nash equilibrium points of recursive nonzero-sum stochastic differential games with unbounded coefficients and related multiple\\ dimensional BSDEs. Mathematical Control & Related Fields, 2017, 7 (2) : 289-304. doi: 10.3934/mcrf.2017010

[14]

Jianhui Huang, Xun Li, Jiongmin Yong. A linear-quadratic optimal control problem for mean-field stochastic differential equations in infinite horizon. Mathematical Control & Related Fields, 2015, 5 (1) : 97-139. doi: 10.3934/mcrf.2015.5.97

[15]

Diogo A. Gomes, Gabriel E. Pires, Héctor Sánchez-Morgado. A-priori estimates for stationary mean-field games. Networks & Heterogeneous Media, 2012, 7 (2) : 303-314. doi: 10.3934/nhm.2012.7.303

[16]

Jiongmin Yong. Forward-backward evolution equations and applications. Mathematical Control & Related Fields, 2016, 6 (4) : 653-704. doi: 10.3934/mcrf.2016019

[17]

Fabio Paronetto. Elliptic approximation of forward-backward parabolic equations. Communications on Pure & Applied Analysis, 2020, 19 (2) : 1017-1036. doi: 10.3934/cpaa.2020047

[18]

Max-Olivier Hongler. Mean-field games and swarms dynamics in Gaussian and non-Gaussian environments. Journal of Dynamics & Games, 2020, 7 (1) : 1-20. doi: 10.3934/jdg.2020001

[19]

Salah Eddine Choutri, Boualem Djehiche, Hamidou Tembine. Optimal control and zero-sum games for Markov chains of mean-field type. Mathematical Control & Related Fields, 2019, 9 (3) : 571-605. doi: 10.3934/mcrf.2019026

[20]

Diogo A. Gomes, Hiroyoshi Mitake, Kengo Terai. The selection problem for some first-order stationary Mean-field games. Networks & Heterogeneous Media, 2020  doi: 10.3934/nhm.2020019

2019 Impact Factor: 0.857

Metrics

  • PDF downloads (66)
  • HTML views (161)
  • Cited by (0)

Other articles
by authors

[Back to Top]