Neural network-based velocity-controllable UAV flocking

T. He; L. Wang

doi:10.1017/aer.2022.61

Neural network-based velocity-controllable UAV flocking

Published online by Cambridge University Press: 23 June 2022

T. He

and

L. Wang

Show author details

T. He: Affiliation:
Research Institute of Electronic Science and Technology, University of Electronic Science and Technology of China, Chengdu China
L. Wang*: Affiliation:
Research Institute of Electronic Science and Technology, University of Electronic Science and Technology of China, Chengdu China
*: *Corresponding author. Email: [email protected]

Article contents

Abstract
Nomenclature
Introduction
Model
Problem formulation
UAV Flocking control based on optimisation
Simulation results
Conclusions
Footnotes
References

Rights & Permissions

Abstract

The unmanned aerial vehicle (UAV) flocking among obstacles was transferred to a velocity-controllable UAV flocking problem, which means that multi-UAV gradually form and maintain the $\alpha$ -lattice geometry as they track the desired flocking velocity, and can be applied to tasks such as obstacle avoidance and velocity tracking. Velocity-controllable UAV flocking problem is a multi-objective flocking controller parameters optimisation problem, for which we design flocking velocity and geometry objective function, and solve them using a multi-objective particle swarm optimisation algorithm (MOPSO). On this basis, to address the problem that MOPSO has random results and long computation time, we propose to use a neural network (NN) to approximate the mathematical relationship between the UAV flocking state and the flocking controller parameters. We simulate the flight process of 5 and 49 UAVs performing obstacle avoidance and velocity tracking tasks, respectively. The results show that the proposed UAV flocking controller has better convergence performance, obtains reproducible results, reduces computation time, and can be used for large-scale UAV flocking control.

Keywords

Unmanned aerial vehicle (UAV)Flocking Multi-objective particle swarm optimisation (MOPSO)Neural network (NN)Obstacle avoidance

Type: Research Article
Information: The Aeronautical Journal , Volume 127 , Issue 1308 , February 2023 , pp. 289 - 304

DOI: https://doi.org/10.1017/aer.2022.61 [Opens in a new window]
Copyright: © The Author(s), 2022. Published by Cambridge University Press on behalf of Royal Aeronautical Society

Nomenclature

N: number of UAVs
$P^i$: position vector of UAV i (m)
$P_x^i$: x component of position vector of UAV i (m)
$P_y^i$: y component of position vector of UAV i (m)
$P^{core}$: position vector of flocking core (m)
$P^{core}_x$: x component of position vector of flocking core (m)
$P^{core}_y$: y component of position vector of flocking core (m)
$V^i$: velocity vector of UAV i (m/s)
$V_x^i$: x component of velocity vector of UAV i (m/s)
$V_y^i$: y component of velocity vector of UAV i (m/s)
$V_{xy}^i$: horizontal airspeed of UAV i (m/s)
$V_{xy}^{\min}$: lower limits of horizontal airspeed (m/s)
$V_{xy}^{\max}$: upper limits of horizontal airspeed (m/s)
$V_{xy}^{l}$: allowable control error of horizontal airspeed (m/s)
$V^e$: desired velocity vector (m/s)
$V^e_{xy}$: desired horizontal airspeed (m/s)
$efv^i$: desired flocking velocity vector of UAV i (m/s)
$efv^i_x$: x component of desired flocking velocity vector of UAV i (m/s)
$efv^i_y$: y component of desired flocking velocity vector of UAV i (m/s)
$\phi^i$: yaw angle of UAV i (rad)
$\phi^{l}$: allowable control error of yaw angle (rad)
$\phi^e$: desired yaw angle (rad)
$\delta^i$: desired flocking yaw angle of UAV i (rad)
$\delta^{core}$: desired flocking yaw angle of flocking core (rad)
$C_{\delta}$: parameter that controls the geometry of flocking
$n^{\max}$: maximum lateral overload (g)
g: gravitational acceleration ( $\mathrm{m/s}^2)$
$u^i$: control input vector of UAV i
$u_x^i$: x component of control input vector of UAV i
$u_y^i$: y component of control input vector of UAV i
$u_f^i$: flocking geometry control component of UAV i
$u_{av}^i$: horizontal airspeed alignment control component of UAV i
$u_c^i$: collision avoidance control component of UAV i
$u_{vf}^i$: flocking velocity control component of UAV i
$C_f$: strength coefficient of the flocking geometry control component
$C_{av}$: strength coefficient of the horizontal airspeed alignment control component
$C_c$: strength coefficient of the collision avoidance control component
$C_{vf}$: strength coefficient of the flocking velocity control component
$W_j^i$: influence weight of UAV j to UAV i for flocking geometry control
$W_i^i$: influence weight of UAV i to UAV i for flocking velocity control
$\hat{W}^i$: optimal influence weight vector of UAV i
$d^{ij}$: horizontal distance between UAV i and j (m)
$D_c$: maximum horizontal communication distance (m)
$D_{l1}$: minimum distance between UAVs to avoid collision (m)
$D_{l2}$: minimum distance allowed between the UAV and obstacle (m)
$D_d$: desired horizontal distance between UAVs(m)
$Q^j$: position vector of obstacle j (m)
$Q_x^j$: x component of position vector of obstacle j (m)
$Q_y^j$: y component of position vector of obstacle j (m)
$R^j_o$: radius of the obstacle j (m)
$S_o$: the set of obstacles within maximum perceived distance ahead of the UAV flocking in the
: desired velocity direction
$R_{formation}$: radius of the UAV flocking in the desired flocking yaw angle direction (m)
$obj_1^i$: flocking velocity objective function of UAV i
$obj_2^i$: flocking geometry objective function of UAV i
$input^i$: input vector i of NN
$label^i$: label vector i of NN

1.0 Introduction

Flocking is a collective behaviour that describes the phenomenon of all individuals being close to each other and moving at a common velocity. This phenomenon is universal in nature, such as jackdaw flocks [Reference Ling, Mclvor, Westley, van der Vaart, Vaughan, Thornton and Ouellette1], locust groups [Reference Dkhili, Berger, Idrissi Hassani, Ghaout, Peters and Piou2], fish school [Reference Becco, Vandewalle, Delcourt and Poncin3], honeybee colonies [Reference Arvin, Samsudin, Ramli and Bekravi4], etc. Animals in flock are more likely to avoid predators, which also makes them more conducive to getting more food. Flocking also bring many benefits to artificial agents (e.g. unmanned vehicles, autonomous ships and robots), especially UAVs. A UAV with single payload and limited ability to perform tasks cannot meet the demand. UAV flocking, on the other hand, can enhance system performance and efficiently accomplish more complex and larger missions by cooperating with each other and complementing each other’s strengths. Hence, UAV flocking are widely used in both military and civilian fields [Reference McGuire, De Wagter, Tuyls, Kappen and de Croon5, Reference Schranz, Umlauft, Sende and Elmenreich6, Reference Poljak and Återbenc7, Reference Park and Choi8, Reference Pham, La, Feil-Seifer and Deans9, Reference Márquez-Vega, Aguilera-Ruiz and Torres-Treviño10], while attracting the interest of a growing number of researchers.

Obstacle avoidance is a key problem that needs to be solved for safe UAV flocking flight, and numerous studies have been conducted to address this problem and various solutions have been proposed. In Ref. [Reference Wu, Sun, Huang, Zhang and Li11], an improved artificial potential field method is recommended, which uses k-means algorithm to divide space potential field into the inter-UAV potential field and the global potential field according to the action range of potential field internal force, thus optimising the calculation of attractive field between UAVs. It also introduces the attraction disturbance component of the target point and proposes a backtracking-filling method to solve the local minimum problem in the artificial potential fields. Dai et al. [Reference Dai, Kim, Wee, Lee and Lee12], use the geometric obstacle avoidance control method (GOACM) to plan a safe path, and the adaptive tracking control algorithm guarantees that the trajectory and velocity tracking errors converge to zero. Olfati-Saber [Reference Olfati-Saber13] presents a flocking algorithm with obstacle avoidance capability by creating a virtual agent on the boundary of each nearby obstacle. In the work most closely related to this research, Qiu and Duan [Reference Qiu and Duan14] modeled obstacle and collision avoidance as hard constraint, hoping to realise that the UAV moves more distance in the desired velocity direction in the process of changing direction for obstacle avoidance, but the hard constraints did not work in the whole flight process, and the velocity control performance is poor. Therefore, we transform the UAV flocking problem among obstacles into a velocity controllable UAV flocking problem, which means that multi-UAV gradually form and maintain the $\alpha$ -lattice [Reference Olfati-Saber13] geometry as they track the desired flocking velocity (e.g. optimal velocity for obstacle avoidance in complex environments). It is a multi-objective optimisation problem, for which we design flocking velocity and geometry objective function, and solve them using a multi-objective particle swarm optimisation algorithm.

The study of flocking originated with the pioneering work of Reynolds [Reference Reynolds15] in 1987, who introduced three heuristic rules: aggregation, alignment and separation to realise the computer simulation of collective behaviour. Then, different flocking models [Reference Fine and Shell16, Reference Zhu, Xie, Han, Meng and Teo17] and optimisation algorithms [Reference Beaver and Malikopoulos18] have been developed successively. The literature on the design of UAV flocking controller can be divided into three categories: One is to manually complete the design of flocking controller according to the rules, and then optimise the relevant parameters of flocking controller to improve the security of the system and the performance of the algorithm [Reference Olfati-Saber13, Reference Qiu and Duan14, Reference Vásárhelyi, Virágh, Somorjai, Nepusz, Eiben and Vicsek19, Reference Zhao, Chu, Zhang, Sun and Guo20]. The second is to directly solve the flocking controller by using the optimal control theory [Reference Beaver and Malikopoulos21, Reference Beaver, Kroninger and Malikopoulos22], which requires the minimum energy consumption with specific safety and task constraints. Third, a learning-from-data approach is used to design the controller autonomously, avoiding the high a prior knowledge requirements of manual design and also addresses, to some extent, control errors caused by incomplete problem models. In the literature [Reference Dierks, Brenner and Jagannathan23, Reference Yu, Ji, Miao and Zhou24, Reference Liu, Huang, Zhang and Zhao25, Reference Yang, Bai, Li, Shi, Yang, Wu and Chen26], a neural network term is added to the flocking controller to actively compensate for approximation errors in the non-linear dynamics model as well as uncertainties and external disturbances. The literature [Reference Nguyen, Nguyen and Nahavandi27, Reference Xu and Chen28, Reference Hu, Gama and Chen29, Reference Schilling, Lecoeur and Schiano30], uses a neural network as a controller to achieve an end-to-end mapping of flocking states to control outputs. In this paper, the design approach for the flocking controller is a combination of the first and third approaches. As we all know, in order to optimise the performance of the hand-designed UAV flocking controller, it is usually necessary to optimise its parameters using intelligent optimisation algorithm. However, there is a randomness in the results calculated by the intelligent optimisation algorithm and the computational complexity is high. If the mathematical relationship between the UAV flocking state and the controller parameters can be found, then the performance of the controller will be improved. Therefore, in this paper, we use the powerful approximation capabilities of NN [Reference Hornik31] and take the UAV flocking state as the input of training NN and the optimal flocking controller parameters calculated by MOPSO as the label of training NN. The trained NN successfully approximates the mathematical relationship between the UAV flocking state and the controller parameters.

The rest of the paper is organised as follows: UAV model, flocking model, obstacle avoidance and velocity tracking model are established in Section 2. Problem formulation is introduced in Section 3. Section 4 describes the UAV flocking control based on optimisation. Section 5 shows the simulation results in detail. Finally, the conclusion is presented in Section 6.

2.0 Model

2.1 UAV model

In this paper, N UAVs in a two-dimensional space are taken into consideration. The dynamic model of each UAV is described as follows [Reference Qiu and Duan14]:

(1)

\begin{equation}\begin{cases}\dot{P}_x^i=V_{xy}^i\cos\phi^i,\\[5pt] \dot{P}_y^i=V_{xy}^i\sin\phi^i,\\[5pt] \dot{V}_{xy}^i=u_x^i\cos\phi^i+u_y^i\sin\phi^i,\\[5pt] \dot{\phi}^i=u_y^i\cos\phi^i-u_x^i\sin\phi^i,\end{cases}\end{equation}

where, for UAV i, $P^i=(P_x^i, P_y^i)$ is the position vector, $V^i=(V_x^i, V_y^i)$ is the velocity vector, $V_{xy}^i$ is the horizontal airspeed, $\phi^i$ represents yaw angle, and $u^i = (u_x^i, u_y^i)$ is the control input vector. In addition, the following constraints must be met:

(2)

\begin{equation}\begin{cases}V_{xy}^{\min} \leq V_{xy}^i \leq V_{xy}^{\max},\\[5pt] \left| \dot{\phi}^i \right| \leq \frac{n^{\max}g}{V_{xy}^i},\\[5pt] V_{xy}^i=V^e_{xy}, & \text{if $\left|V_{xy}^i-V^e_{xy}\right|<V_{xy}^{l},$}\\[5pt] \phi^i=\phi^e, & \text{if $\left|\phi^i-\phi^e\right|<\phi^{l},$}\end{cases}\end{equation}

where, $n^{\max}=10$ g is the maximum lateral overload, $g=10\mathrm{m/s}^2$ is the gravitational acceleration, $V_{xy}^{\min}=5\mathrm{m/s}$ , $V_{xy}^{\max}=15\mathrm{m/s}$ are the lower and upper limits of horizontal airspeed, respectively, $V_{xy}^{l}=0.25\mathrm{m/s}$ , $\phi^{l}=0.1$ rad are allowable control error of horizontal airspeed, yaw angle, respectively, and $\phi^e = a\tan \left( \frac{V_y^e}{V_x^e} \right)$ , $V^e = (V_x^e,V_y^e)=(10,0)\mathrm{m/s}$ , $V^e_{xy}$ are the desired yaw angle, desired velocity vector, desired horizontal airspeed, respectively.

2.2 Flocking model

To achieve velocity controllable UAV flocking, the following movement rules need to be guaranteed: (1) Each UAV stay close to its neighbours; (2) All UAVs fly at a common velocity; (3) UAVs avoid conflict with each other; (4) The flocking velocity of UAVs is controllable.

Therefore, control input of UAV i consists of four components: $u_f^i$ is flocking geometry control component, which regulates the distance between UAV i and its neighbour. $u_{av}^i$ is horizontal airspeed alignment control component, which regulates the velocity of UAV i to be consistent with its neighbour. $u_c^i$ is collision avoidance control component, which regulates the safe distance among UAVs. $u_{vf}^i$ is flocking velocity control component, which regulates the velocity of UAV i to be consistent with the desired flocking velocity. Details are as follows [Reference Qiu and Duan14]:

(3)

\begin{equation}u_k^i=u_f^i+u_{av}^i+u_c^i+u_{vf}^i-V_k^i, \quad k=x,y,\end{equation}

where, $k=x, y$ denotes the corresponding components in the x, y directions.

The flocking geometry control component $u_f^i$ is as the following equation:

(4)

\begin{equation}u_f^i=C_f \sum_{j \in \left\{ d^{ij} \leq D_c \right\}} W_j^i(P_k^j-P_k^i) ln\! \left( \frac{d^{ij}}{D_d} \right)\end{equation}

where, $C_f=0.1$ denotes strength coefficient of the flocking geometry control component, $W_j^i$ denotes influence weight of UAV j to UAV i for flocking geometry control within [0,1], $d^{ij}=\sqrt{(P_x^i-P_x^j)^2 + (P_y^i-P_y^j)^2}$ denotes the horizontal distance between UAV i and j, $D_c=20$ m denotes the maximum horizontal communication distance, $D_d=10$ m denotes the desired horizontal distance between UAVs.

The horizontal airspeed alignment control component $u_{av}^i$ is as the following equation:

(5)

\begin{equation}u_{av}^i=C_{av} \sum_{j \in \left\{ d_{ij} \leq D_c \right\}} W_j^i(V_k^j-V_k^i)\end{equation}

where, $C_{av}=0.1$ denote strength coefficient of the horizontal airspeed alignment control component.

The collision avoidance control component $u_c^i$ is as the following equation:

(6)

\begin{equation}u_c^i=C_c \sum_{j \in \left\{ d^{ij} \leq D_{l1} \right\}} \left( \frac{1}{d^{ij}}-\frac{1}{D_{l1}} \right)^2 \frac{P_k^i-P_k^j}{d^{ij}}\end{equation}

where, $C_c=10^5$ denote strength coefficient of the collision avoidance control component, $D_{l1}=2$ m denotes the minimum distance between UAVs to avoid collision.

The flocking velocity control component $u_{vf}^i$ is as the following equation:

(7)

\begin{equation}u_{vf}^i=C_{vf}W_i^iefv^i_k\end{equation}

where, $C_{vf}=1$ denotes strength coefficient of the flocking velocity control component, $W_i^i$ denotes influence weight of UAV i to UAV i for flocking velocity control within $[1,1.1]$ , $efv^i=(efv^i_x,efv^i_y)=(V^{e}_{xy} \cos\delta^i, V^{e}_{xy} \sin\delta^i)$ denotes the desired flocking velocity vector of UAV i, and $\delta^i$ denotes the desired flocking yaw angle of UAV i.

2.3 Obstacle avoidance and velocity tracking model

Because the magnitude of the desired velocity remains constant throughout the flight, obstacle avoidance and velocity tracking can be achieved by setting an appropriate desired flocking yaw angle $\delta^i$ which is calculated as follows:

(8)

\begin{equation}\delta^{i}=a\tan \left( \frac{P^{core}_y + C_{\delta}\sin(\delta^{core}) - P^i_y}{P^{core}_x + C_{\delta}\cos(\delta^{core}) - P^i_x} \right)\end{equation}

(9)

\begin{equation}P^{core}_k=\frac{1}{N}\sum_{i=1}^{N} P_k^i, \quad k=x,y,\end{equation}

where $P^{core}=(P^{core}_x,P^{core}_y)$ is position vector of flocking core calculated by Equation (9), $C_{\delta}$ is the parameter that controls the geometry of flocking, $\delta^{core}$ is the desired flocking yaw angle of flocking core, which is directly set as the reference desired flocking yaw angle in the velocity tracking process, but its equation is as follows in the obstacle avoidance process:

(10)

\begin{equation}\delta^{core}=\begin{cases}a\tan \left( \dfrac{P^A_y - P^{core}_y}{P^A_x - P^{core}_x} \right), & \text{if $S_o \ne \emptyset,$}\\[9pt] a\tan \left( \dfrac{V_y^e}{V_x^e} \right), & \text{else,}\end{cases}\end{equation}

where $S_o$ is the set of obstacles within maximum perceived distance ahead of the UAV flocking in the desired velocity direction, as shown in Fig. 1. A is the point closest to the flocking core among the two edge points projected in the vertical direction of $V^e$ by the envelope of the obstacle closest to the flocking core in $S_o$ , $D_{l2}=10$ m is the minimum distance allowed between the UAV and obstacle, $R_{formation}$ is radius of the UAV flocking in the desired flocking yaw angle direction. Note that although the UAV flocking bypass the current obstacle, it cannot bypass the next obstacle until it passes through the current obstacle. The maximum perceived distance of UAV is 105.

Figure 1. Obstacle avoidance model.

3.0 Problem formulation

According to the above model, the influence weight vector $W^i=(W^i_1, W^i_2, \dots, W^i_N)$ directly determines the next state of the UAV i. Therefore, the UAV flocking control problem is transformed into solving the optimal influence weight vector $\hat{W}^i$ of UAV i which satisfies the following objective functions:

(11)

\begin{equation}\begin{cases}\min \quad obj^i = [obj_1^i,obj_2^i],\\[5pt] s.t. \quad \text{Equations (1)--(8)},\end{cases}\end{equation}

Flocking velocity objective function $obj_1^i$ indicates how far the actual velocity deviates from the desired flocking velocity. The equation is as follows:

(12)

\begin{equation}obj_1^i=\begin{cases}-\dfrac{(V_x^i,V_y^i).(efv^i_x,efv^i_y)}{2\left\|efv^i\right\|}, &\text{if} \,\,\, {efv^i \neq V^e} \,\, \text{or} \,\, {S_o \neq \emptyset,}\\[9pt] \sum_{k=x,y} \left| efv^i_k-V_k^i \right|, & \text{else,}\end{cases}\end{equation}

Where, $efv^i \neq V^e$ indicates the velocity tracking process, and $S_o \neq \emptyset$ represents obstacle avoidance process. In the process, the velocity of UAV i is not required to be exactly the same as the desired flocking velocity, but is allowed to change slightly to adapt to the process as soon as possible.

Flocking geometry objective function $obj_2^i$ describes the degree of UAV flocking and the alignment performance with neighbours velocities. The equation is as follows:

(13)

\begin{equation}obj_2^i=\sum_{j \in \left\{ d^{ij} \leq D_c \right\}} \left[ \left| D_d-d^{ij} \right|+\sum_{k=x,y} \left| V_k^i-V_k^j \right| \right]\end{equation}

4.0 UAV Flocking control based on optimisation

4.1 UAV flocking obstacle avoidance and velocity tracking based on MOPSO

UAV flocking obstacle avoidance and velocity tracking based on MOPSO is a special case of UAV flocking control framework (see Algorithm 1). It uses MOPSO [Reference Coello Coello and Lechuga32] to calculate the optimal influence weight vector $\hat{W}^i$ , The specific steps of the MOPSO are as follows:

Input: Objective function $obj^i_1$ and $obj^i_2$ , maximum generation $M_G$ , number of particles $N_P$ , and the upper and lower bounds of $L_g^h$ and $S_g^h$ are the same as those of influence weight vector. Where, $L_g^h$ and $S_g^h$ are position vector and velocity vector of the generation g of particle h respectively, $h=1,2,\dots, N_P$ , $g=1,2,\dots, M_G$ , $L_g^h$ is a feasible influence weight vector.

Initialisation: Initialise the $L_g^h$ and $S_g^h$ of $N_P$ particles within their upper and lower bounds respectively, current iteration $g = 1$ .

Step 1: Calculate the objective function $obj^i_1$ and $obj^i_2$ of all the particles by Equations (1)–(8) and (12)–(13).

Step 2: Rank the particles positions in the set $L_g$ by Pareto sorting scheme, store the positions of particles in the Pareto frontier in the repository REP.

Step 3: $pbest^h = L_1^h$ .

Step 4: gbest is selected randomly from REP.

Step 5: Update the velocity of each particle by the following equation.

(14)

\begin{equation}S_{g+1}^{h}=aS_{g}^{h}+r_{1}^T(pbest^h-L_{g}^h)+r_{2}^T(gbest-L_{g}^h)\end{equation}

Where, $a = 0.4$ is the inertia weight, $r_{1}$ and $r_{2}$ are random number vectors uniformly distributed in the interval [0,1].

Step 6: Update the position of each particle by the following equation.

(15)

\begin{equation}L_{g+1}^{h}=L_{g}^h+V_{g+1}^{h}\end{equation}

Step 7: Initialises the state of the particle if its position exceeds the upper and lower bounds.

Step 8: Calculate the objective function $obj^i_1$ and $obj^i_2$ of all the particles by Equations (1)–(8) and (12)–(13).

Step 9: If $L_{g+1}^{h}$ is better than $pbest^h$ , then $pbest^h = L_{g+1}^{h}$ .

Step 10: Rank the particles positions in the set $L_{g+1}$ and REP by Pareto sorting scheme, store the positions of particles in the Pareto frontier in the repository REP.

Step 11: The current iteration $g=g+1$ . If $g \le M_G$ , then return to Step 4. Otherwise, output the optimal influence weight vector which is the position of particle in the REP whose objective function $obj^i_2$ has the smallest value.

4.2 UAV flocking obstacle avoidance and velocity tracking based on NN

UAV flocking obstacle avoidance and velocity tracking based on NN is a special case of UAV flocking control framework(see Algorithm 1). It uses NN to calculate the optimal influence weight vector $\hat{W}^i$ , and the NN is generated as follows.

Generate training samples: The UAV flocking velocity tracking algorithm based on MOPSO is repeated $N_{repeat}=100$ times to obtain the training sample set $\{input^i,label^i\}_{i=1}^{N_{repeat} \times \frac{T_{\max}}{ts}}$ . Where, $input^i$ is the input vector i and expressed by Equation (17), $label^i$ is the label vector i and expressed by Equation (18), $P_x^i$ , $P_y^i$ are random numbers within [0,30], [75,150] respectively, but horizontal distance between any two UAVs should be less than maximum horizontal communication distance of $D_c$ , number of UAVs $N=5$ , maximum simulation time $T_{\max}=50$ s, sampling time ts=0.5s, $C_{\delta} = \infty$ , $\delta^{core}$ is expressed by Equation (16).

(16)

\begin{equation}\delta^{core}=\begin{cases}\dfrac{\pi}{3} \sin\left( \dfrac{2 \pi t}{0.8T_{\max}} \right), & \text{if $t \le 0.8T_{\max},$}\\[12pt] a\tan \left( \dfrac{V^e_y}{V^e_x} \right), & \text{if $t > 0.8T_{\max},$}\end{cases}\end{equation}

(17)

\begin{equation}\begin{aligned}input^i &=\bigg( \underbrace{P_x^1,P_y^1,V^1_x,V^1_y,efv^1_x,efv^1_y}_{State \, of \, UAV \, 1},\\[5pt]&\mathrel{\phantom{=}} \phantom{{}\bigg( } \underbrace{P_x^2,P_y^2,V^2_x,V^2_y,efv^2_x,efv^2_y}_{State \, of \, UAV \, 2},\\[5pt]&\mathrel{\phantom{=}} \phantom{{}\bigg( } \dots, \\[5pt]&\mathrel{\phantom{=}} \phantom{{}\bigg( } \underbrace{P_x^N,P_y^N,V^N_x,V^N_y,efv^N_x,efv^N_y}_{State \, of \, UAV \, N} \bigg)\end{aligned}\end{equation}

(18)

\begin{equation}\begin{aligned}label^i &=\bigg( \underbrace{\hat{W}^1_1,\hat{W}^1_2,\dots,\hat{W}^1_N}_{Optimal \, influence \, weight \, of \, UAV \, 1},\\[5pt]&\mathrel{\phantom{=}} \phantom{{}\bigg( } \underbrace{\hat{W}^2_1,\hat{W}^2_2,\dots,\hat{W}^2_N}_{Optimal \, influence \, weight \, of \, UAV \, 2},\\[5pt]&\mathrel{\phantom{=}} \phantom{{}\bigg(Optima} \dots, \\[5pt]&\mathrel{\phantom{=}} \phantom{{}\bigg( } \underbrace{\hat{W}^N_1,\hat{W}^N_2,\dots,\hat{W}^N_N}_{Optimal \, influence \, weight \, of \, UAV \, N} \bigg)\end{aligned}\end{equation}

Neural network structure: NN has four hidden layers, each layer has 6 $\times$ N neurons, and the dimensions of the input and output layers are 6 $\,\times\,$ N and N $\,\times\,$ N, respectively. Its structure is shown in Fig. 2. Where, the activation function of hidden layer, output layer are shown in Equations (19), (20), respectively.

(19)

\begin{equation}Tansig(o) = \frac{2}{1+exp(-2o)}-1\end{equation}

(20)

\begin{equation}Purelin(o) = o\end{equation}

Training neural network: Each dimension of the input data is normalised to $[-1, 1]$ before training, the loss function is the mean square error function is expressed by Equation (21), the training function is trainscg (scaled conjugate gradient backpropagation), epochs=1,000, and The ratio between training set, validation set and test set is $7:1.5:1.5$ .

(21)

\begin{equation}mse = \frac{1}{N_d}\sum_{k=1}^{N_d}(label_k - output_k)^2\end{equation}

Figure 2. Neural network structure.

4.3 Analysis of generalisation capability of UAV flocking control based on NN

We discuss the generalisation capability of UAV flocking control based on NN from the following two aspects:

Position of UAVs: Deviation occurs in the calculation result when UAV position exceeds min-max normalisation range of the NN input vector. This problem can be solved by moving all UAVs in the flocking to min-max normalisation range without changing their relative positions.

Number of UAVs: The input vector of NN only supports the state of k UAVs, and can not be directly applied to larger scale UAV flocking control. This problem can be solved by distributed computing, that is, each UAV only calculates its next state according to the nearest k-1 UAV and its own state.

5.0 Simulation results

In this section, taking obstacle avoidance and velocity tracking as application scenarios, two numerical simulation examples are designed to verify the effectiveness of our proposed algorithm. The first simulation shows UAV flocking obstacle avoidance and velocity tracking, and the second simulation shows generalisation capability of UAV flocking control based on NN. All simulations were conducted on laptops running Windows 10 and MATLAB R2020a with Intel(R) Core(TM) i5-10210U CPU and 16GB memory.

5.1 Simulation of UAV flocking obstacle avoidance and velocity tracking

In the simulation, five UAVs form a flocking, they first spend 50s crossing obstacles, then track the velocity for 50s. Initial parameters of UAVs and obstacles are shown in Table 1, and other initial parameters are as follows: maximum generation MG = 58, number of particles NP = 20, maximum simulation time $T_{\max}=100$ s, sampling time ts=0.5s, $C_{\delta} = \infty$ , and calculate $\delta^{core}$ by Equation (22).

(22)

\begin{equation}\delta^{core}=\begin{cases}-\dfrac{\pi}{3} \sin\left( \dfrac{2 \pi (t-50)}{30} \right), & \text{if $t > 50 \, {\rm and} \, t \le 80,$}\\[12pt] a\tan \left( \dfrac{V^e_y}{V^e_x} \right), & \text{if $t > 80,$}\end{cases}\end{equation}

Figure 3 depicts the simulation results of five UAVs flocking obstacle avoidance and velocity tracking based on NN where, $RD_{ij}$ denotes the relative distance between UAV i and UAV j, $D_{l1}$ is the minimum distance to avoid collision between UAVs. As shown in Fig. 3(a), five UAVs successfully completed obstacle avoidance and velocity tracking tasks and flew out of a smooth trajectory curve. As shown in Fig. 3(b), the difference of horizontal airspeed among UAVs was less than 1.5480m/s, and the horizontal airspeed of the UAV flocking quickly converges to the desired horizontal airspeed after fluctuating within the allowable range. As shown in Fig. 3(c), the UAV flocking has a good velocity tracking effect, and a slight yaw angle difference. As shown in Fig. 3(d), relative distance between UAVs is converges rapidly and fluctuates less and always greater than the minimum distance to avoid collision between UAVs. As shown in Fig. 3(e) and (f), the geometry and velocity of the UAV flocking converge rapidly and stably.

Table 1. Initial parameters of the UAVs and obstacles

Figure 3. Simulation results of 5 UAV flocking obstacle avoidance and velocity tracking based on NN: (a) trajectory curves, (b) horizontal airspeed curves, (c) yaw angle curves, (d) relative distance curves, (e) $\sum obj_1^i$ curves, and (f) $\sum obj_2^i$ curves.

Table 2. Statistical results of two algorithms for 5 UAVs

^a Convergence time of horizontal airspeed.

^b Convergence time of yaw angle.

^c Total simulation time.

Figure 4. Simulation results of 5 UAV flocking obstacle avoidance and velocity tracking based on MOPSO: (a) trajectory curves, (b) horizontal airspeed curves, (c) yaw angle curves, (d) relative distance curves, (e) $\sum obj_1^i$ curves, and (f) $\sum obj_2^i$ curves.

5.2 Simulation of generalisation capability of UAV flocking control based on NN

In the simulation, we adopt the neural network for 5 UAVs in the previous experiment to verify its generalization capability in a flocking of 49 UAVs, which first track the velocity for 60s, then spend 60s crossing obstacles, and finally track the velocity for 60s. The initial velocity of the UAV and radius of obstacle is the same as that shown in Table 1, and the positions of UAVs and obstacles have been moved away from where they were during training NN (due to the large amount of data, no specific data is given here). Other initial parameters are as follows: maximum simulation time $T_{\max}=180$ s, sampling time ts=0.5s, calculate $C_\delta$ by Equation (23), calculate $\delta^{core}_1$ for the first velocity tracking by Equation (24), calculate $\delta^{core}_2$ for the second velocity tracking by Equation (25).

(23)

\begin{equation}C_\delta=\begin{cases}80, & \text{if $S_o \ne \emptyset,$}\\[5pt] \infty, & \text{else,}\end{cases}\end{equation}

(24)

\begin{equation}\delta^{core}_1=\begin{cases}\dfrac{\pi}{3} \sin\left( \dfrac{2 \pi t}{40} \right), & \text{if $t \le 40,$}\\[9pt] a\tan \left( \dfrac{V^e_y}{V^e_x} \right), & {\text{if} \, t > 40 \,\,\, \text{and} \,\,\, t < 50,}\end{cases}\end{equation}

(25)

\begin{equation}\delta^{core}_2=\begin{cases}-\dfrac{\pi}{3} \sin\left( \dfrac{2 \pi (t-120)}{40} \right), & {\text{if} \, t > 120 \,\,\, \text{and} \,\,\, t \le 160,}\\[12pt] a\tan \left( \dfrac{V^e_y}{V^e_x} \right), & \text{if $t > 160,$}\end{cases}\end{equation}

Figure 5. Simulation results of 49 UAV flocking obstacle avoidance and velocity tracking based on NN: (a) trajectory curves, (b) horizontal airspeed curves, (c) yaw angle curves, (d) relative distance curves, (e) $\sum obj_1^i$ curves, and (f) $\sum obj_2^i$ curves.

Figure 5 depicts the simulation results of 49 UAV flocking obstacle avoidance and velocity tracking based on NN. where, $RD_{\min}$ is the minimum relative distance between UAVs. As shown in Fig. 5(a), the UAV flocking can safely pass through the obstacle area through geometric transformation, and the UAV position exceeds min-max normalisation range of the NN input vector, but it has no adverse impact on flight. As shown in Fig. 5(b), the horizontal airspeed of the UAV flocking quickly converges to the desired horizontal airspeed after fluctuating within the allowable range. As shown in Fig. 5(c), the UAV flocking has a good velocity tracking effect, and the yaw angle quickly converges to the desired yaw angle after fluctuations. As shown in Fig. 5(d), during the whole flight, the minimum relative distance between UAVs is always greater than the minimum distance to avoid collision between UAVs. As shown in Fig. 5(e) and (f), the geometry and velocity of the UAV flocking converge rapidly and stably. To sum up, UAV flocking controller based on NN has good generalisation capability and can control the flocking geometry to a certain extent.

To illustrate the superiority of the UAV flocking obstacle avoidance and velocity tracking based on NN, we compare it with the UAV flocking obstacle avoidance and velocity tracking based on the MOPSO. The statistical results of the two algorithms are shown in Table 3. From the data in the table, the NN-based algorithm achieves faster convergence of horizontal airspeed, especially the simulation time has been greatly reduced. The simulation results of the 49 UAV flocking obstacle avoidance and velocity tracking based on the MOPSO is shown in Fig. 6. From the comparison between Figs 5(a) and 6(a), it can be seen that in the NN-based experimental results the UAV is further away from the obstacle and safer. The comparison of Figs 5(b) and 6(b) shows that the horizontal airspeed deviates from the desired value in a shorter time, smaller amplitude, more regular, and achieves convergence earlier in the NN-based experimental results. Contrast Fig. 5(c) with Fig. 6(c) indicates that the NN-based method has a fast response to changes in yaw angle. In conclusion, the experimental results based on NN have smaller fluctuation, faster convergence, and better obstacle avoidance and velocity tracking effects.

Table 3. Statistical results of two algorithms for 49 UAVs

^a Convergence time of horizontal airspeed.

^b Convergence time of yaw angle.

^c Total simulation time.

Figure 6. Simulation results of 49 UAV flocking obstacle avoidance and velocity tracking based on MOPSO: (a) trajectory curves, (b) horizontal airspeed curves, (c) yaw angle curves, (d) relative distance curves, (e) $\sum obj_1^i$ curves, and (f) $\sum obj_2^i$ curves.

6.0 Conclusions

In this paper, we propose a method to design UAV flocking controller based on NN. The designed controller has the following advantages: (i) It has good generalisation capability and can be directly used in UAV flocking control with larger scale and wider flight range. (ii) The UAV flocking convergence and velocity controllability can be achieved simultaneously in the environment with and without obstacles. (iii) The trained NN successfully approximates the mathematical relationship between the UAV flocking state and the flocking controller parameters. It not only improves the quality of solution and obtains repeatable results, but also reduces computation time.

Nevertheless, there are still some drawbacks in this paper: (i) The structure of NN is determined by simple algorithm without optimisation. (ii) The quality of training samples is not carefully evaluated. In the future, we will continue to deepen the research on UAV flocking technology, and ultimately verify the feasibility of the algorithm in the actual flight environment.

Acknowledgements

This research was funded by the Sichuan Science and Technology Program (No. 2020YFG0059, 2019YJ0195) and the Fundamental Research Funds for the Central Universities (No. ZYGX2019J128).

Footnotes

The two authors, Tao He and Lei Wang, are co-first authors.

References

Ling, H., Mclvor, G.E., Westley, J., van der Vaart, K., Vaughan, R.T., Thornton, A. and Ouellette, N.T. Behavioural plasticity and the transition to order in jackdaw flocks, Nat. Commun., 2019, 10, (1), pp 5174. https://doi.org/10.1038/s41467-019-13281-4 CrossRef Google Scholar PubMed

Dkhili, J., Berger, U., Idrissi Hassani, L.M., Ghaout, S., Peters, R. and Piou, C. Self-organized spatial structures of locust groups emerging from local interaction, Ecol. Modell., 2017, 361, pp 26–40. https://doi.org/10.1016/j.ecolmodel.2017.07.020 CrossRef Google Scholar

Becco, C., Vandewalle, N., Delcourt, J. and Poncin, P. Experimental evidences of a structural and dynamical transition in fish school, Phys. A Stat. Mech. its Appl., 2006, 367, pp 487–493. https://doi.org/10.1016/j.physa.2005.11.041 CrossRef Google Scholar

Arvin, F., Samsudin, K., Ramli, A.R. and Bekravi, M. Imitation of honeybee aggregation with collective behavior of swarm robots, Int. J. Comput. Intell. Syst., 2011, 4, pp 739–748. https://doi.org/10.1080/18756891.2011.9727825 Google Scholar

McGuire, K.N., De Wagter, C., Tuyls, K., Kappen, H.J. and de Croon, G.C.H.E. Minimal navigation solution for a swarm of tiny flying robots to explore an unknown environment, Sci. Robot., 2019, 4, (35), pp eaaw9710. https://doi.org/10.1126/scirobotics.aaw9710 CrossRef Google Scholar PubMed

Schranz, M., Umlauft, M., Sende, M. and Elmenreich, W. Swarm robotic behaviors and current applications, Front. Robot. AI, 2020, 7. https://doi.org/10.3389/frobt.2020.00036 CrossRef Google Scholar

Poljak, M. and Återbenc, A. Use of drones in clinical microbiology and infectious diseases: current status, challenges and barriers, Clin. Microbiol. Infect., 2020, 26, (4), pp 425–430. https://doi.org/10.1016/j.cmi.2019.09.014 CrossRef Google Scholar PubMed

Park, S. and Choi, Y. Applications of unmanned aerial vehicles in mining from exploration to reclamation: a review, Minerals, 2020, 10, pp 663. https://doi.org/10.3390/min10080663 CrossRef Google Scholar

Pham, H.X., La, H.M., Feil-Seifer, D. and Deans, M.C. A distributed control framework of multiple unmanned aerial vehicles for dynamic wildfire tracking, IEEE Trans. Syst. Man Cybern. Syst., 2020, 50, (4), pp 1537–1548. https://doi.org/10.1109/TSMC.2018.2815988 CrossRef Google Scholar

Márquez-Vega, L.A., Aguilera-Ruiz, M. and Torres-Treviño, L.M. Multi-objective optimisation of a quadrotor flock performing target zone search, Swarm Evol. Comput., 2021, 60, pp 100733. https://doi.org/10.1016/j.swevo.2020.100733 CrossRef Google Scholar

Wu, E., Sun, Y., Huang, J., Zhang, C. and Li, Z. Multi UAV cluster control method based on virtual core in improved artificial potential field, IEEE Access, 2020, 8, pp 131647–131661. https://doi.org/10.1109/ACCESS.2020.3009972 CrossRef Google Scholar

Dai, Y., Kim, Y.G., Wee, S.G., Lee, D.H. and Lee, S.G. A switching formation strategy for obstacle avoidance of a multi-robot system based on robot priority model, ISA Trans., 2020, 56, pp 123–134. https://doi.org/10.1016/j.isatra.2014.10.008 CrossRef Google Scholar

Olfati-Saber, R. Flocking for multi-agent dynamic systems: algorithms and theory, IEEE Trans. Automat. Contr., 2006, 51, (3), pp 401–420. https://doi.org/10.1109/TAC.2005.864190 CrossRef Google Scholar

Qiu, H. and Duan, H. A multi-objective pigeon-inspired optimisation approach to UAV distributed flocking among obstacles, Inf. Sci., 2020, 509, pp 515–529. https://doi.org/10.1016/j.ins.2018.06.061 CrossRef Google Scholar

Reynolds, C.W., herds, and schools: a distributed behavioral model, Proceedings of SIGGRAPH’87, 1987, vol. 21, pp 25–34. https://doi.org/10.1145/280811.281008 CrossRef Google Scholar

Fine, B.T. and Shell, D.A. Unifying microscopic flocking motion models for virtual, robotic, and biological flock members, Auton. Robots, 2013, 35, (2–3), pp 195–219. https://doi.org/10.1007/s10514-013-9338-z CrossRef Google Scholar

Zhu, B., Xie, L., Han, D., Meng, X. and Teo, R. A survey on recent progress in control of swarm systems, Sci. China Inf. Sci., 2017, 60, (7), pp 1–24. https://doi.org/10.1007/s11432-016-9088-2 CrossRef Google Scholar

Beaver, L.E. and Malikopoulos, A.A. An overview on optimal flocking, Annu. Rev. Control, 2021, 51, pp 88–99. https://doi.org/10.1016/j.arcontrol.2021.03.004 CrossRef Google Scholar

Vásárhelyi, G., Virágh, C., Somorjai, G., Nepusz, T., Eiben, A.E. and Vicsek, T. Optimized flocking of autonomous drones in confined environments, Sci. Robot., 2018, 3, (20), pp 1–14. https://doi.org/10.1126/scirobotics.aat3536 CrossRef Google Scholar PubMed

Zhao, W., Chu, H., Zhang, M., Sun, T. and Guo, L. Flocking control of fixed-wing UAVs with cooperative obstacle avoidance capability, IEEE Access, 2019, 7, pp 17798–17808. https://doi.org/10.1109/ACCESS.2019.2895643 CrossRef Google Scholar

Beaver, L.E. and Malikopoulos, A.A. Beyond Reynolds: a constraint-driven approach to cluster flocking, 2020 59th IEEE Conference on Decision and Control (CDC), 2020, pp 208–213. https://doi.org/10.1109/CDC42340.2020.9304333 CrossRef Google Scholar

Beaver, L.E., Kroninger, C. and Malikopoulos, A.A. An optimal control approach to flocking, 2020 American Control Conference (ACC), 2020, pp 683–688. https://doi.org/10.23919/ACC45564.2020.9147311 Google Scholar

Dierks, T., Brenner, B. and Jagannathan, S. Neural network-based optimal control of mobile robot formations with reduced information exchange, IEEE Trans. Control Syst. Technol., 2013, 21, (4), pp 1407–1415. https://doi.org/10.1109/TCST.2012.2200484 CrossRef Google Scholar

Yu, J., Ji, J., Miao, Z. and Zhou, J. Neural network-based region reaching formation control for multi-robot systems in obstacle environment, Neurocomputing, 2019, 333, pp 11–21. https://doi.org/10.1016/j.neucom.2018.12.051 CrossRef Google Scholar

Liu, Y., Huang, P., Zhang, F. and Zhao, Y. Distributed formation control using artificial potentials and neural network for constrained multiagent systems, IEEE Trans. Control Syst. Technol., 2020, 28, (2), pp 697–704. https://doi.org/10.1109/TCST.2018.2884226 CrossRef Google Scholar

Yang, S., Bai, W., Li, T., Shi, Q., Yang, Y., Wu, Y. and Chen, C.L.P. Neural-network-based formation control with collision, obstacle avoidance and connectivity maintenance for a class of second-order nonlinear multi-agent systems, Neurocomputing, 2021, 439, pp 243–255. https://doi.org/10.1016/j.neucom.2020.12.106 CrossRef Google Scholar

Nguyen, T.T., Nguyen, N.D. and Nahavandi, S. Deep reinforcement learning for multiagent systems: a review of challenges, solutions, and applications, IEEE Trans. Cybern., 2020, 50, (9), pp 3826–3839. https://doi.org/10.1109/TCYB.2020.2977374 CrossRef Google Scholar

Xu, D. and Chen, G. Autonomous and cooperative control of UAV cluster with multi-agent reinforcement learning, Aeronaut. J., 2022, pp 1–20. https://doi.org/10.1017/aer.2021.112 CrossRef Google Scholar

Hu, T.K., Gama, F., Chen, T., et al. Vgai: end-to-end learning of vision-based decentralized controllers for robot swarms, IEEE International Conference on Acoustics, Speech and Signal Processing, 2021, pp 4900–4904. https://doi.org/10.1109/ICASSP39728.2021.9414219 CrossRef Google Scholar

Schilling, F., Lecoeur, J., Schiano, F., et al. Learning vision-based flight in drone swarms by imitation, IEEE Robot. Autom. Lett., 2019, 4, (4), pp 4523–4530. https://doi.org/10.1109/LRA.2019.2935377 CrossRef Google Scholar

Hornik, K. Approximation capabilities of multilayer feedforward networks, Neural Networks, 1991, 4, (2), pp 251–257. https://doi.org/10.1017/S0001924000025173 CrossRef Google Scholar

Coello Coello, C.A. and Lechuga, M.S. MOPSO: a proposal for multiple objective particle swarm optimisation, Proceedings of 2002 Congress on Evolutionary Computation CEC 2002, 2002, vol. 2, pp 1051–1056. https://doi.org/10.1016/0893-6080(91)90009-T CrossRef Google Scholar