|
1.INTRODUCTIONUnmanned aerial vehicles (UAVs) have been widely emerged and employed in many areas such as military, scientific and civilian scenarios in the past decade. Because of their excellent manoeuvrability, the aerial communication system based on UAVs has been regarded as a new promising paradigm, which can promote rapid and flexible deployment1, 2. Moreover, resource allocation is a crucial communication issue in UAV system. The discussion of resource allocation issue includes but is not limited to transmission power, service users and subchannels. It is important to solve these issues to improve the energy efficiency and coverage of UAV system. For these resource allocation problems, using a competitive way to deploy multiple UAVs has become an emerging research topic. Particularly, game theory is naturally a tool to model such an interactive mechanism between UAVs and enable us to study their complicated behaviors3, 4. In this way, such shared resources between UAVs are frequently considered as coupled constraints in non-cooperative games. As a reasonable solution to non-cooperative games, a generalized Nash equilibrium (GNE) refers to a strategy profile satisfying local and coupled constraints, in which no one can benefit from unilaterally changing its own strategy. Significant theoretic and algorithmic achievements of GNE seeking have been made5. Furthermore, a lot of research has been done to deploy multiple UAVs in a distributed manner to carry out tasks in various applications. This motivates the interest to seek GNE distributedly in UAV games, where the UAVs obtain a GNE by making decisions with local information and communicating with their neighbors through networks. Various distributed algorithms have been investigated, including asymmetric projection algorithms6, projected algorithms via non-smooth projected tracking dynamics7, and approximate projection-free algorithms8. On the other hand, most references on UAV allocation are based on a fundamental assumption that all UAVs exactly share the resource allocation constraints information. Whereas, this argument is not always held in practical situations due to the underlying uncertainty. The uncertainty is usually due to many factors such as heavy communication burden2 or environmental noise interferences9. Hence, the robustness of UAV allocation games needs to be considered. One way to deal with uncertainties is by robust optimization, which provides a paradigm that relies on worst-case analysis. Specifically, computing the worst case means evaluating the solution by realizing the uncertainty that is most unfavourable. By employing the idea of robust optimization in games, the concept of robust game was first proposed in Reference10. This idea is not only limited to theoretical research, but also applied to other practical scenarios11, 12. The motivation of this work is to model a robust game to investigate the resource allocation issues among UAVs, and seek a GNE under the worst case via a distributed environment. The main technical contributions are listed as follows. Firstly, we employ game theory to build a robust model with polyhedral uncertain parameters for describing the competition among UAVs. Secondly, we convert the original robust game into an extended certain game by employing the idea of robust optimization. Then we investigate a novel continuous-time dynamics for seeking GNE of this certain game in a distributed way by gradient descent and projected output feedback. Finally, we illustrate several examples to present the feasibility of the distributed dynamics. The remainder is organized as follows. Section 2 formulates a distributed robust game with parameter uncertainties in coupled constraints. Then Section 3 provides a distributed dynamics by considering a robust counterpart. Moreover, Section 4 presents numerical examples for illustration of the proposed dynamics in real UAV applications. Finally, we summary the results obtained in this paper in Section 5. 2.PROBLEM FORMULATIONWe first model a distributed robust game with resource allocation constraints to study the UAVs’ interactions in this section. We consider an N-UAV communication system where UAVs take the optimal resources allocation. This formulates a non-cooperative game. We denote 𝒢 = (I, Ɛ) as a connected network graph, where I ≜ {1, … N} is the UAV set and ɛ is the edge set. Also, we take A = [aij] ∈ ℝn×n as the connected matrix of 𝒢. If the pair (j, i) ∈ ɛ, then we have aij ˃ 0, which indicates that UAV j can exchange the information with i, as shown in Figure 1. For i ∈ I, UAV i chooses a decision variable xi from a compact and convex set Ci ⊆ ℝn. We denote , x ≜ col{x1, … xN} ∈ C as the strategy profile for all UAVs, and x−i ≜ col{x1, …, xi−1, xi+1, …, xN} as the strategy profile for all UAVs except i. The convex cost function for UAV i is Ji(xi, x−i): ℝnN → ℝ, which is continuously differentiable with respect to xi, Ji(x) is Lipschitz continuous in x. Since the shared resources such as spectrum and power resources are usually limited, there exists a coupled allocation constraint in the UAV communication network. Moreover, considering the inevitable uncertainties in practice, the allocation constraint has uncertain parameters, which come from polyhedral uncertain convex sets. These uncertainties are usually caused by many factors, such as heavy transmission burden in communication channels2 or noise interferences in complex environments10. We denote U ⊆ ℝNn as the set for this allocation constraint. All UAVs should satisfy where Ξi is a polyhedral uncertain set, defined as Ξi = {θi ∈ ℝn: Pi θi ≤ di}, Pi ∈ ℝqi×n, di ∈ ℝqi. For all θi ∈ Ξi, must be satisfied. Denote 𝒳 ≜ U∩C as the feasible set of this game. The feasible set of UAV i is In a nutshell, given x−i, UAV i aims to solve For a reasonable solution of equation (1), a generalized Nash equilibrium (GNE) can be regarded an action profile x* that satisfies in which no UAV can profit from unilaterally deviating from its own action5. On the other hand, each UAV may only access its local cost function Ji and action set Θi in the multi-UAV network, since these are private information. Also, UAV i can only know rather than . To fulfil cooperation, all UAVs exchange their local information through the network graph 𝒢. In this way, this paper studies the approach to find a GNE of equation (1) via a distributed environment. 3.DYNAMICS DESIGNBased on the formulated robust game, we then consider a distributed dynamics for seeking a GNE of equation (1) under the worst case, that is, the solution satisfies all possible constraints, Due to the parameter uncertainty, we first utilize the idea in robust optimization to handle the coupled constraint10. Recalling that Ξi is a polyhedral uncertain set for i ∈ I. The independent optimization problem separated from equation (4) is By introducing a dual variable , equation (2) becomes We adopt the similar analysis for other UAVs. Whereupon, the robust game of equation (1) is transformed into a certain game with resource allocation constraints where zi = col{xi, τi} ∈ ℝn+qi, , , , . We denote gi(zi,z−i) ≜ col {∇xiJi (·,x−i), 0qi} ∈ ℝn+qi as the pseudo-gradient. We decompose bi. We define the projection operator ΠK: ℝn → K on a closed and convex set K as For designing a distributed dynamic, we introduce auxiliary variables yi ∈ ℝn, vi ∈ ℝ, ωi ∈ ℝ for each UAV i ∈ I. Moreover, we employ ΠΩi(·),Πℝ+(·) as two projection operators on Ωi and ℝ+, respectively. Then we propose the following distributed dynamics for game of equation (3): with the initial condition zi (0) ∈ ℝn, yi (0) ∈ ℝn, ωi (0) ∈ ℝ, λi (0) ∈ ℝ, vi (0) ∈ ℝ. Besides, aij represent the entries of the connected matrix A. In distributed dynamics of equation (4), UAV i calculates yi and vi by using gradient descent. ΠΩi(yi) is the projection of yi onto the local constraints Ωi and Πℝ+(vi) is the projection of vi on the half space ℝ+. The design idea is similar to Reference12. The local variable λi(t) ∈ ℝ+ is calculated as a Lagrangian multiplier to handle the coupled constraints, while the local auxiliary variable ωi is calculated for the consensus of λi. The consensus among these λi guarantees that the decision variables x of all UAVs converge to the consensual GNE x*. The convergence proof can refer to References8, 12. On the other hand, the terms ΠΩi(·) and Πℝ+(·) are regarded as the projected output feedback, which allows that the players’ initial variables do not need to restrict by the local constraints13-15. Besides, dynamics of equation (4) avoid the nonsmoothness derived by the projection on tangent cones in Reference12. 4.NUMERICAL EXPERIMENTSWe show several experiments in this section for the accuracy of distributed dynamics of equation (4). Considering a UAV communication system with N = 5 UAVs over a ring graph, i.e., These UAVs are regarded as decision-making units to adopt optimal strategy to achieve effective coverage value. For i ∈ I ≜ {1, … 5}, the action set for UAV i is Ci = {xi ∈ ℝ2: u112 ≤ xi ≤ u212}, with u1 = –30 and u2 = 30. Given x−i, UAV i aims to address where ρi = (15 − i)12 ∈ ℝ2. All UAVs meet the allocation constraint with the parameter satisfying an octagonal set, defined as Meanwhile, we set tolerance as ttol = 10−4. Figure 2 provides the trajectories along dynamics (4) of each UAV’s decision variables xi. Also, Figure 3 shows the Lagrangian multipliers λi reach consensus. Together with Figure 2, we present the correctness and feasibility of distributed dynamics of equation (4). Next, we show the effectiveness of dynamics of equation (4) by comparisons. The number of UAVs is increased to N = 20. Figure 4 presents the performance of dynamics of equation (4) and the algorithms in References8, 16. The horizontal axis denotes the running time and the vertical axis denotes the optimal error ||x − x*|| under different algorithms. As shown in Figure 4, dynamics of equation (4) converge with a faster rate. 5.CONCLUSIONA distributed robust game model has been considered to model the interactions among multiple UAVs. These UAVs share an allocation constraint, where parameters endowed with the constraint have general uncertainties. By adopting the idea of robust optimization, the original game has been handled under the worst case, and transformed into an associated fixed game. Then a distributed dynamics has been proposed to seek GNE for the converted game by utilizing gradient descent and projected output feedback. Finally, numerical experiments have been illustrated to show the correctness and effectiveness of the dynamics. ACKNOWLEDGMENTSThis work was supported by Zhejiang Key Laboratory of General Aviation Operation Technology, (JDGA2020-4). REFERENCESSun, Y., Ng, D. W. K., Xu, D., Dai, L. and Schober, R.,
“Resource allocation for solar powered UAV communication systems,”
2018 IEEE 19th Inter. Workshop on Signal Processing Advances in Wireless Communications (SPAWC), 1
–5
(2018). Google Scholar
Cui, J., Liu, Y. and Nallanathan, A.,
“Multi-agent reinforcement learning-based resource allocation for UAV networks,”
IEEE Transactions on Wireless Communications, 19
(2), 729
–743
(2019). https://doi.org/10.1109/TWC.7693 Google Scholar
Messous, M. A., Senouci, S. M., Sedjelmaci, H. and Cherkaoui, S.,
“A game theory based efficient computation offloading in an UAV network,”
IEEE Transactions on Vehicular Technology, 68
(5), 4964
–4974
(2019). https://doi.org/10.1109/TVT.25 Google Scholar
Ruan, L., Wang, J., Chen, J., et al.,
“Energy-efficient multi-UAV coverage deployment in UAV networks: A game-theoretic framework,”
China Communications, 15
(10), 194
–209
(2018). https://doi.org/10.1109/CC.2018.8485481 Google Scholar
Facchinei, F. and Pang, J. S., Finite-Dimensional Variational Inequalities and Complementarity Problems, Springer, New York
(2003). Google Scholar
Koshal, J., Nedic, A. and Shanbhag, U. V.,
“Distributed algorithms for aggregative games on graphs,”
Operations Research, 64
(3), 680
–704
(2016). https://doi.org/10.1287/opre.2016.1501 Google Scholar
Liang, S., Yi, P. and Hong, Y.,
“Distributed Nash equilibrium seeking for aggregative games with coupled constraints,”
Automatica, 85 179
–185
(2017). https://doi.org/10.1016/j.automatica.2017.07.064 Google Scholar
Xu, G., Chen, G., Qi, H. and Hong, Y.,
“Efficient algorithm for approximating Nash equilibrium of distributed aggregative games,”
arXiv preprint arXiv:2108.12142,
(2021). Google Scholar
Bertuccelli, L., Choi, H. L., Cho, P. and How, J.,
“Real-time multi-UAV task assignment in dynamic and uncertain environments,”
AIAA Guidance, Navigation, and Control Con, 5776
(2009). Google Scholar
Aghassi, M. and Bertsimas, D.,
“Robust game theory,”
Mathematical Programming, 107
(1), 231
–273
(2006). https://doi.org/10.1007/s10107-005-0686-0 Google Scholar
Cheng, Z., Chen, G. and Hong, Y.,
“Single-leader-multiple-followers Stackelberg security game with hypergame framework,”
IEEE Transactions on Information Forensics and Security, 17 954
–969
(2022). https://doi.org/10.1109/TIFS.2022.3155294 Google Scholar
Chen, G., Ming, Y., Hong, Y. and Yi, P.,
“Distributed algorithm for ɛ-generalized Nash equilibria with uncertain coupled constraints,”
Automatica, 123 109313
(2021). https://doi.org/10.1016/j.automatica.2020.109313 Google Scholar
Chen, G., Yi, P., and Hong, Y.,
“2021 Distributed optimization with projection-free dynamics arXiv preprint arXiv: 2105.02450,”
Google Scholar
Chen, G., Zeng, X. and Hong, Y.,
“2019 Distributed optimisation design for solving the Stein equation with constraints IET Control Theory & Applications,”
13
(15), 2492
–2499 Google Scholar
Xu, G., Chen, G., and Qi, H.,
“Algorithm design and approximation analysis on distributed robust game,”
(2022). Google Scholar
Chen, G., Li, W., Xu, G. and Hong, Y.,
“Distributed mirror descent algorithm with Bregman damping for nonsmooth constrained optimization,”
arXiv preprint arXiv:2108.12136,
(2021). Google Scholar
|