1. Introduction
Unmanned bicycle (UB), which is well known for their ability to maintain balance when steering, has been becoming increasingly popular and have been widely used in rescue and other tasks within a narrow space due to its great flexibility and ability in intelligent transportation [Reference Shafiekhani, Mahjoob and Akraminia1]. However, to accomplish particular tasks, the major challenging technical issue of UB is to ensure satisfying self-balancing performance in different driving scenarios, especially at low or even zero speeds.
Accurate dynamic modelling is the basis of realizing excellent self-balance of the UB system. Recently, many researches have been carried out from the perspectives of both mathematical models and control approaches [Reference Shafiekhani, Mahjoob and Akraminia1–Reference Nenner, Linker and Gutman5]. For example, in ref. [Reference Chen and Dao6], the dynamics of the UB using Lagrange’s equations for quasi-coordinates are developed, which considers pure rolling without slipping constraints between the ground and the two wheels. In ref. [Reference Astrom, Klein and Lennartsson7], the dynamics of bicycles such as self-stabilizing models and rear wheel steering models are considered and analysed from the perspective of control. In ref. [Reference Defoort and Murakami8], a dynamic model is presented, which considers the geometric-stabilization mechanisms due to bicycle trail. It is pointed out that the geometry is the most important factor that affects the stability of the bicycle. In ref. [Reference Xiong, Wang and Liu9], the dynamics of the Whipple bicycle is analysed, and a complete non-linear model is constructed, from which the equilibrium point of the bicycle in both straight and circular motions is determined. In ref. [Reference Tofigh, Mahjoob, Hanachi and Ayati10], the gyrostabilizer is used as the actuator to keep the UB balance, and the dynamic model of the UB together with the gyrostabilizer is developed using Newton–Euler method. Unfortunately, gyrostabilizers can dramatically increase the bicycle’s mass. Inspired by [Reference Grau, Kapitola, Weiss and Noack11], this work installs an independent reaction wheel (RW) in the UB. RW is usually used in spacecraft attitude control due to the advantages of simple structure, low cost and long duration. When the bicycle tilts, a high-performance motor-driven RW actively provides an anti-roll torque, which counteracts the tilting torque and balances the bicycle. Owing to the RW, the model of the UB is changed from a conventional single-input double-output system to a single-input single–output system, and as a result, the balancing control of the UB is greatly simplified.
On the other hand, the superior control strategy is also crucial for the UB self-balance. Many non-linear control strategies have been developed for non-linear systems, such as model-predictive control [Reference Buizza Avanzini, Zanchettin and Rocco12–Reference Nfaileh, Alipour, Tarvirdizadeh and Hadi13], fuzzy control [Reference Dimeas, Sako, Moulianitis and Aspragathos14–Reference Li, Zhang, Wu, Chen, Liu and Yao15], robust control [Reference Chocoteco, Morales, Feliu and Sira-Ramírez16], adaptive control [Reference Pierri, Muscio and Caccavale17] and sliding mode control (SMC) [Reference Sariyildiz, Mutlu and Yu18–Reference Villanueva, Luque-Vega, González-Jiménez and Arellano-Muro20]. Among these methods, as a classical non-linear control algorithm, SMC has been widely used in a variety of mechatronic systems due to its fast response, high precision and strong robustness against external disturbances. In ref. [Reference Liu, Liu and Wu21], a novel adaptive sliding controller based on a robust disturbance observer is designed for non-linear uncertain robotic systems to deal with the tracking control problem. In ref. [Reference Qiao and Zhang22], an adaptive non-singular integral terminal SMC for AUVs is presented to eliminate the need for pre-known uncertainty bounds, which offers the improved tracking accuracy, rapid convergence and robustness against uncertainties and disturbances. In ref. [Reference Qiao and Zhang23], an adaptive fast non-singular integral terminal SMC, based on desired trajectory, is proposed for AUVs, achieving faster convergence rates in AUV dynamics. In ref. [Reference Lin24], an adaptive non-singular terminal sliding mode tracking controller is designed for robotic systems using fuzzy wavelet networks to improve the control performance. In ref. [Reference Ma, Liu, Xiong, Li, Liu and Sun25], a novel predefined-time barrier function (BF) adaptive SMC strategy for robust control of disturbed systems is designed. This approach ensures that, should an escape event occur at any given time instant, the system trajectory is guaranteed to return to its ultimate bound within a predefined time frame. SMC has also been applied to UB due to its superior performance and strong robustness. In ref. [Reference Chiu and Wu26], a gyroscopic balancer based on fuzzy SMC is used to control the bicycle, wherein the roll angle is controlled by a backstepping algorithm. In ref. [Reference Hwang, Wu and Shih27], fuzzy SMC is designed, wherein the large uncertainty caused by complex ground conditions is well tackled by the inherent under actuated control scheme. Moreover, in the recent years, many artificial intelligence technologies are applied to the control field. In ref. [Reference Yao, Xu, Jiang and Yao28], reinforcement learning is introduced for high-accuracy tracking control of 6-degree-of-freedom (6-DOF) hydraulic robotic manipulators, demonstrating its effectiveness in providing system-level performance guarantees through experimental validation on a 6-DOF platform.
Based on the above discussions, for the purpose of achieving excellent balancing control of the UB, a BF-based adaptive integral terminal sliding mode (BFITSM) control scheme combined with an extreme learning machine (ELM) observer is proposed in this paper. The main contributions of this article are summarized as follows:
1) An adaptive integral terminal sliding mode controller combining ITSM and BF techniques is designed such that the reaching phase can be efficiently eliminated to improve the convergence performance. Since the BF-based adaptive gain is updated according to the lumped uncertainty, overestimation can be effectively suppressed. Moreover, the size of the region that the roll angle tracking error converges to can be exactly predefined. However, the parameter selection of BF is one of the difficulties in the controller design.
2) To counteract the effects of uncertain disturbances and reduce the chattering phenomenon, an ELM observer is developed to estimate the lumped uncertainty. The output weights of the proposed ELM are adaptively adjusted to ensure the Lyapunov stability of the closed-loop control. As such, the estimation of the ELM observer is used as compensating control input to further improve the closed-loop control performance. To achieve better training effect, the value of the $\boldsymbol{O}_{i}$ should be appropriately chosen. In this work, the output weight $\boldsymbol{O}_{\boldsymbol{i}}$ is chosen according to the Lyapunov function to keep the control system stable and achieve the excellent estimate.
3) Combined with the BF adaptive gain and ELM techniques, a finite-time control strategy that without any disturbance upper bound information and has excellent convergence performance is realized and applied to the UB experimental to verify the effectiveness. Experimental results demonstrate the excellent control performance of the proposed controller.
The reminder of this paper is organized as follows. In Section 2, the uncertain dynamic model of the RW-based UB is presented. An adaptive ITSM controller based on ELM observer is designed in Section 3, and the closed-loop stability analysis is rigorously given in detail. In Section 4, experiment studies on an actual UB are carried out by comparison with conventional control schemes. Section 5 finally concludes the paper.
2. Modeling of reaction wheel unmanned bicycle
Fig. 1(a) and 1(b) shows the top and rear views for the RW-based UB system, respectively. The dynamics of the UB can be described by the following second-order differential equation [Reference Astrom, Klein and Lennartsson7]:
For a better illustration, the signification of the notations appeared in Fig. 1 and Eq. (1) are given in Table I.
Since the roll angle of the bicycle is kept within a relatively small range in the process of motion, $\sin (\theta )$ in (1) can be approximated as $\theta$ [Reference Chen, Yan, Wang, Shao, Kurniawan and Wang29]. For simplification, $\frac{mf}{c}$ and $\frac{mh}{c}$ are denoted as $N$ and $P$ , respectively, and $mgh$ is denoted as $M$ . Then, at a small roll angle $\theta$ , (1) can be rewritten as
Since it is impossible to obtain accurate parameter values in a practical UB system, parametric uncertainties have to be considered in the physical model. The system parameters can be normally divided into a nominal part and an uncertain part as follows [Reference Sariyildiz, Mutlu and Yu18]:
where $J_{0}$ , $N_{0}$ , $P_{0}$ and $M_{0}$ are the nominal values determined from preliminary experiments, and $\Delta J$ , $\Delta N$ , $\Delta P$ and $\Delta M$ are the corresponding model parameter errors, respectively. Then, the dynamical Eq. (2) can be rewritten as
where $l$ is the lumped uncertainty, which is given by
According to Appendix A, the closed-loop control signal is chosen to be upper bounded by the following polynomial function in this paper:
Then the lumped uncertainty is bounded by [Reference Hu and Wang30], the detailed derivation process is given in Appendix B:
where $\overline{l}$ is the upper bound of $l$ , given by
where $\zeta _{0}$ , $\zeta _{1}$ , $\zeta _{2}$ , $\mu _{0}$ , $\mu _{1}$ and $\mu _{2}$ are unknown positive constants.
The control objective is to keep the roll angle of the UB within a small range under the impact of system uncertainty. To balance the UB, an appropriate reaction force $u$ can be produced such that the roll angle $\theta$ can be retained to be zero with high precision, fast response and strong robustness.
Remark 1. In practice, it is quite difficult to obtain the accurate bound information of the lumped uncertainty in (11). To deal with the effect of the lumped uncertainty, a conservatively large constant is usually selected in conventional SMC systems. However, the overly large bound may lead to severe control chattering and large control efforts. Although boundary layer technique can be normally used to alleviate this phenomenon, the control precision will be accordingly degraded. In this paper, in order to achieve better control performance, an ELM observer and BF-based adaptive ITSM controller will be designed such that not only the need for the prior uncertainty information can be eliminated wherein the control gain will be adaptively adjusted but also the control precision can be exactly ensured by tuning parameters.
3. Design of controller
In this section, we develop an ELM-based BFITSM (ELM-BFITSM) controller for the RW-based UB system. First, a feedback control component using ITSM is designed without considering the system uncertainty. Then, the compensating control component based on ELM and the reaching control component using BF-based adaptive law are designed to deal with the lumped uncertainty.
3.1. Design of ELM-BFITSM
In this work, the tracking error $e$ is defined as
where $\theta _{d}=0$ is the target roll angle which equals to zero under absolutely balanced state. Then, according to Eq. (7), the second derivative of error $e$ can be obtained as follows when $\theta _{d}=0$ :
To derive the controller, a novel integral terminal sliding mode variable $s$ is designed as [Reference Chen, Liu, Wang, Hu, Zheng, Ye and Zhang31–Reference Ye and Wang32]:
where $\alpha _{1}\gt 0$ and $\alpha _{2}\gt 0$ are constants, $q_{1}$ and $q_{2}$ are odd integers satisfying $q_{2}\gt q_{1}\gt 0$ to keep $e^{q_{1}/q_{2}}$ a real number and ensure finite time convergence of $e$ [Reference Feng, Yu and Man33]. The initial value $e_{I}(0)$ is set as
where $e(0)$ and $\dot{e}(0)$ are the initial values of the actual error status available for eliminating the reaching time. Ignoring the lumped uncertainty and combining $\dot{s}=0$ with (13), the equivalent control input of the proposed sliding mode controller can be designed as
Next, based on the ELM technique, the compensating control input $u_{1}$ of the closed-loop control system is designed as
where $\hat{l}$ is the estimate value of the lumped uncertainty via ELM that will be given later.
Furthermore, in order to achieve that the system converges from arbitrary initial state to the preset sliding mode surface quickly, a reaching control input is introduced as follows:
where $\hat{k}$ represents an adaptive control gain.
The schematic diagram of the proposed ELM-BFITSM is shown in Fig. 2. In this paper, we define a small positive number $\varepsilon$ by which to define the region of ( $-\varepsilon,\varepsilon$ ) that the sliding variable will converge to [Reference Shao, Zheng, Wang, Wang, Lu and Man34–Reference Labbadi, Hashim, Eltoukhy and Djemai35]. If the initial value of the sliding mode variable $s$ is outside the region $[{-}\frac{\varepsilon }{2},\frac{\varepsilon }{2}]$ , namely $\left| s(0)\right| \gt \frac{\varepsilon }{2}$ , it will converge to the region under a monotonically increasing gain. It defines $\overline{t}$ as the time instant when the initial state firstly converges into the above region. After the initial value of $s$ is in this region, namely $\left| s(0)\right| \leq \frac{\varepsilon }{2}$ , a BF-based gain will be activated. Explicitly, the proposed adaptive law is designed as
where the parameters $\sigma \geq 0$ , $\mu \gt 0$ , and the initial value of $\hat{k}(0)\gt 0$ . $\overline{t}$ is the time instant that the sliding variable converges to the region of $[{-}\frac{\varepsilon }{2},\frac{\varepsilon }{2}]$ , i.e., $\left| s(\overline{t})\right| =\frac{\varepsilon }{2}$ . $f_{b}(s)$ is the BF which is defined as
Remark 2. When $t\leq \overline{t}$ , under $\hat{k}=\mu \int _{0}^{t}\exp\! (\sigma | s| )| s| d\tau$ , the sliding variable $s$ will quickly converge to the region $[{-}\frac{\varepsilon }{2},\frac{\varepsilon }{2}]$ . When $t\gt \overline{t}$ , $\hat{k}$ is updated by the BF function $f_{b}(s)$ , whose output changes with the variation of $s$ . The reason why BF can reduce the chattering effect compared to constant gain is that: if $| s|$ gradually increases in $[0,\varepsilon )$ , $f_{b}(s)$ will also accordingly increase to a large number to pull $| s|$ back to the predefined region; if $| s|$ reaches the sliding surface, $\hat{k}$ will reduce to a small number. Meanwhile, in this way, it can guarantee that the control gain of the ELM-BFITSM is not overestimated.
Remark 3. In conventional SMC, an overlarge constant $k$ is usually selected as the gain of reaching law to keep the system stable, which requires the upper bound of $l$ at advance and leads to a serious chattering problem. The reason is that when $s$ crosses 0 frequently, the control law changes discontinuously in two values of $\pm k$ . In this paper, because of the introduction of BF, the final $u_{2}$ becomes continuous. Since the sliding variable $s$ is changing continuously, the $f_{b}(s)$ will be continuous either. It can be seen from Fig. 3 that when $s$ crosses 0, the final $u_{2}$ is continuously changed despite the involved signum function. Therefore, the signal chattering in control input can be effectively reduced. Moreover, $l$ is compensated by ELM ( $u_{1}$ ) such that the control gain needs to be greater than the estimation error and does not need to be greater than the upper bound, which will effectively reduce the chattering problem. Moreover, the BF function will further reduce chattering as we discussed above.
3.2. Design of ELM observer
The architecture of the ELM observer is shown in Fig. 4. The weight and bias of the input layer of ELM proposed by Huang [Reference Huang, Zhu and Siew36] are randomly generated without reverse adjustment, and the output layer weights are calculated directly through a least squares solution. Conversely, the radial basis function (RBF) necessitates the employment of iterative methodologies to ascertain the optimal values for the centres, widths, and weights of the network and require an iterative optimization process to adjust their parameters, which can be more time consuming. Therefore, compared with the conventional neural network algorithm, ELM has a lower algorithm complexity and a faster learning speed. The training process of ELM is mainly divided into two steps.
Step 1: Random generation of input weights $\boldsymbol{W}_{\boldsymbol{i}}$ and bias $\boldsymbol{b}_{\boldsymbol{i}}$
The output of the hidden layer is the input multiplied by the corresponding weight plus its deviation, and then it is summed by a non-linear function (activation function) of all the nodes. When the sample $X_{1}$ is input, the corresponding hidden layer output is as follows:
where $\boldsymbol{h}_{i}(X_{1})$ is the output of the i-th hidden layer node when the input sample $X_{1}$ is input, and G (·) is the activation function. We can obtain the output matrix H (X) of the hidden layer:
The network output of the feedforward neural network ELM with a single hidden layer is as follows:
where $\boldsymbol{T}=\left[\boldsymbol{t}_{1}^{\textrm{T}},\boldsymbol{t}_{2}^{\textrm{T}},\cdots,\boldsymbol{t}_{U}^{\textrm{T}}\right]^{\textrm{T}}\in R^{U\times m}$ , and $\boldsymbol{O}$ is the output weight, which will be obtained in Step 2.
Step 2: Obtain the output weight $\boldsymbol{O}_{\boldsymbol{i}}$ by minimizing the loss function
The goal of single hidden layer neural network learning is to minimize the output error, which is the difference between the network output $\boldsymbol{T}$ and the objective function $\boldsymbol{Y}$ . In order to achieve better training effect, it needs to choose the value of the $\boldsymbol{O}_{i}$ . That is, the weight $\boldsymbol{O}_{i}$ that connects the hidden layer. The output layer is solved by minimizing the approximate square variance. The objective function is as follows:
The above equation can also be written in the following form:
where $\boldsymbol{O}^{\textrm{*}}$ is the optimal solution that requires, and it can be solved by the following equation:
where $\boldsymbol{H}^{\dagger }$ is the generalized inverse of $\boldsymbol{H}$ .
Therefore, the estimated lumped uncertainty $\hat{l}$ in (18) can be obtained via the ELM mechanism modelled as
where $X_{in}(0)=[\theta,\dot{\theta }]^{\textrm{T}}$ is input vector and $\hat{\boldsymbol{O}}$ is an design function. The output matrix of the ELM is also designed as a piecewise function as shown below:
where $\rho$ is a positive constant and $\hat{k}$ is the adaptive gain given in (20). It should be noted that the initial condition of $\dot{\hat{\boldsymbol{O}}}^{T}$ depends on the initial condition $X_{in}(0)=[\theta (0),\dot{\theta }(0)]^{\textrm{T}}$ . In real applications, to keep a balanced state, we often select the initial condition as $\theta (0)=\dot{\theta }(0)=0$ .
The estimate value of lumped uncertainty is approximated by
where $l$ is the ideal output of single hidden-layer feedforward network, $\boldsymbol{O}^{*}$ is the optimal constant output weight vector, and $\zeta$ is the approximation error which is assumed to be bounded by
where $\tilde{l}=l-\hat{l}$ , and $k$ denotes the bound of the disturbance that is not compensated.
Remark 4. In this article, the inputs of ELM are chosen as $X_{in}=[\theta,\dot{\theta }]^{\textrm{T}}$ . The concealed layer input weights and biases are generated randomly according to the Gaussian distribution, and the output weights is obtained from the generalized inverse matrix. Different from the conventional ELM algorithm proposed for classification problems [Reference Huang, Lu, Tao, Li, Tan and Liu37–Reference Yan, Ji, Lu, Huang, Shen and Xue38], the proposed ELM algorithm estimates the lumped uncertainty and takes the estimate value as the compensating control input as shown in (18).
3.3. Stability analysis
Theorem 1: For system (2), if the sliding function is chosen as (14) and the controller is designed as
where $u_{0}$ is the equivalent control input (17), $u_{1}$ is the compensating control input (18) obtained by the ELM observer (28), and $u_{2}$ is the reaching control input (19). The proposed controller effectively avoids the singularity problem. Then, under the proposed controller (32), the sliding variable converges to ( $-\varepsilon,\varepsilon$ ) and the tracking error converges to $\left(-\dfrac{\varepsilon }{\alpha _{1}},\dfrac{\varepsilon }{\alpha _{1}}\right)$ , both in finite time.
Proof : According to (20), the proof will be accomplished by two steps, i.e., $0\leq t\leq \overline{t}$ and $t\gt \overline{t}$ .
Step 1. $\textbf{0}\leq \boldsymbol{t}\leq \overline{\boldsymbol{t}}$ for the control law (20)
Select a Lyapunov function $V_{1}$ as
where $L_{1}=\frac{s^{2}}{2}+\frac{\tau }{2}\tilde{k}^{2}$ with $\tau \gt 1$ , $\tilde{\boldsymbol{O}}$ is the estimation output error of the ELM output layer given by:
and $\tilde{k}$ is the adaptive error $\tilde{k}=\hat{k}-k$ .
Taking the derivative of $V_{1}$ with respect to time, we obtain:
Next, substituting (17), (18) and (19) into the above equation and combining (28) with (30) yield:
If $\dot{\hat{\boldsymbol{O}}}^{T}$ in (28) is chosen as
The disturbance in UB system is slow time varying, so there exists a slow-changing optimal parameter $\boldsymbol{O}^{\textrm{*}}$ , whose derivative is a very small number such that $\dot{\tilde{\boldsymbol{O}}}^{T}=-\dot{\hat{\boldsymbol{O}}}^{T}=-\rho s\boldsymbol{H}$ . Then, considering (20), (35) can be rewritten as
Till to now, it has been proved that for $0\leq t\leq \overline{t}$ the adaptive gain $\hat{k}=\eta \int _{0}^{t}\exp\! (\alpha | s| )| s| d\tau$ is bounded [Reference Shao, Zheng, Huang, Wang, Man and Fu39]. Then, defining $k$ as the upper bound of $\hat{k}$ that satisfies (31), we get $\hat{k}-k=-| k-\hat{k}|$ , and as a result,
Recalling (31) and considering that for any $\mu$ and $\sigma$ there exists a positive number $\tau$ such that $\tau \mu \exp\! (\sigma | s| )\gt 1$ , we have
for any $\left| s\!\left(t\right)\right| \gt \frac{\varepsilon }{2}$ , i.e., $\tilde{\boldsymbol{O}}\rightarrow 0$ which implies the convergence of the estimation. Particularly, for the Lyapunov function $L_{1}$ , by following a similar procedure, we can obtain:
By defining the following symbols:
where $\varrho _{1}$ , $\varrho _{2}\gt 0$ , we further obtain:
where $\vartheta _{1}=\min \{\sqrt{2}\varrho _{1},\sqrt{2}\varrho _{2}\}.$ Since $\vartheta _{1}\gt 0$ , there must exist a positive constant $T_{1}$ such that:
From (38), we have
Therefore, the inequality (40) satisfies the finite time stability criterion in Appendix C, which shows that there exists a finite time $\overline{t}$ for the sliding variable to converge to $[{-}\frac{\varepsilon }{2},\frac{\varepsilon }{2}]$ which is bounded by
Step 2: $\boldsymbol{t}\gt \overline{\boldsymbol{t}}$ for the control law (20).
To prove the stability, the following intermediate variable is introduced:
In order to prove that the sliding variable $s$ will be retained in the region of $({-}\varepsilon,\varepsilon )$ thereafter, we will first prove that $| s(t)| \leq \varnothing \lt \varepsilon$ will be ensured in finite time $t_{s}$ from $\left| s(\overline{t})\right| =\frac{\varepsilon }{2}\gt \varnothing $ . Considering the case of $\left| s(\overline{t})\right| =\frac{\varepsilon }{2}\gt \varnothing $ , a Lyapunov function $V_{2}$ is defined as
where $L_{2}=\frac{s^{2}}{2}+\frac{1}{2}\hat{k}^{2}$ . Taking the derivative of $V_{2}$ with respect to time yields.
According to Eqs. (17)-(20) and following a similar procedure in Step 1), we obtain:
where the fact of $\dot{s}=\frac{1}{J_{0}}\left(l-J_{0}\hat{k}\text{sign}(s)-\hat{l}\right)$ as indicated by (35) has been used. Furthermore, combining (28) with (30), we have
If $\dot{\hat{\boldsymbol{O}}}^{T}$ in (28) is chosen as
such that:
Then, (43) can be expressed as
According to (42), one can obtain that $\hat{k}(s)\gt \hat{k}(\varnothing )=| {J_{0}}^{-1}\zeta |$ . Then, we obtain:
for any $\varnothing \lt \left| s\!\left(t\right)\right| \leq \frac{\varepsilon }{2}$ which demonstrates that the estimation is converging. Moreover, by defining another intermediate variable $\varnothing^{\prime}=\varepsilon \frac{\left| \tilde{l}\right| }{\left| \tilde{l}\right| +J_{0}}$ and following a similar procedure, it can be obtained that:
Define the following symbols:
which are both positive since $\hat{k}(s)\gt \hat{k}(\varnothing^{\prime})=| {J_{0}}^{-1}\tilde{l}|$ . Then, we have
where $\vartheta _{2}=\min \{\sqrt{2}\varrho _{1},\sqrt{2}\varrho _{2}\}$ . Since $\vartheta _{2}\gt 0$ , there must exist a positive constant $T_{2}$ such that:
From (49), we have
Therefore, the inequality (51) satisfies the finite time stability criterion in Appendix C, which shows that there exists a finite time $\overline{t}$ for the sliding variable to converge to [- $\varnothing, \varnothing$ ] which is bounded by
According to [Reference Shao, Zheng, Wang, Wang, Lu and Man34–Reference Labbadi, Hashim, Eltoukhy and Djemai35], the sliding variable converges to $| s(t)| \leq \varnothing \lt \varepsilon$ in the finite time of $t=\overline{t}+t_{s}$ and will remain in that region ( $-\varepsilon,\varepsilon$ ). Furthermore, according to Appendix D, because of the ITSM (14)-(15), the tracking error $e$ will also be bounded in $\left| e\right| \lt \frac{\varepsilon }{\alpha _{1}}$ .
Here completes the proof.
Remark 5. The convergence of the sliding variable $s$ is divided into two steps: $\overline{t}$ represents the convergence time from the initial state to $\left| s(\overline{t})\right| =\frac{\varepsilon }{2}\gt \varnothing $ and $t_{s}$ represents the consumed time to further converge to $| s(t)| \leq \varnothing \lt \varepsilon$ . After the sliding variable converges, namely $| s| \leq \varnothing$ , the gain is finally bounded by $\hat{k}(s)\leq f_{b}( \varnothing )=| {J_{0}}^{-1}\zeta |$ as implied by (21). It can be seen from (42) that $ \varnothing $ accordingly increases (decreases) as the disturbance increases (decreases), and as a result, the tracking error can be limited as small as possible.
Remark 6. From (31), it can be concluded that when $s$ changes from 0 to $s=s_{2}\gt 0$ , $\hat{\boldsymbol{O}}$ in (30) needs to increase, so that $\zeta$ approaches 0. At the same time, $\hat{k}$ in (46) will also increase because of (20), therefore $\hat{\boldsymbol{O}}$ will increase. As a result, the disturbance compensation for the ELM observer can be more accurate.
3.4. Control parameter selection
The unmodelled dynamics, external disturbance and measurement noise in a practical system require that appropriate control parameters should be synthetically selected to balance the control speed, precision and signal smoothness. The parameters of ELM-BFITSM are shown in Table II.
-
1) Selections of $\alpha _{1}$ , $\alpha _{2}$ , $q_{1}$ , $q_{2} \,:\, \alpha _{1}$ affect the convergence rate of tracking error: the larger the value of $\alpha _{1}$ is, the faster the error will converge. However, an excessive $\alpha _{1}$ will increase the control amplitude as shown in (17). The integral term of the sliding variable is used to suppress the steady-state error, and the integral coefficient $\alpha _{2}$ is usually a small constant. $q_{1}/q_{2}$ will also affect the convergence rate of the error. Usually, $q_{1}$ and $q_{2}$ are odd integers, and $q_{1}/q_{2}$ is less than 1. In our experiment, we chose $\alpha _{1}$ = 2.4, $\alpha _{2}$ = 1, $q_{1}$ = 3, and $q_{2}$ = 5.
-
2) Selections of $\varepsilon$ , $\sigma$ , $\mu$ : The parameter $\varepsilon$ defines the convergence region of the tracking error. A smaller $\varepsilon$ implies a better control precision but too small $\varepsilon$ may cause control input saturation and lead to chattering problem. In the adaptive law (20), appropriate $\mu$ and $\sigma$ should be selected in order to make $s$ reach the region faster with an acceptable chattering. $\varepsilon$ = 0.05, $\mu$ = 1000, and $\sigma$ = 2.5 are selected.
-
3) Selections of $L$ , $\rho$ : The number of neural nodes $L$ in the hidden layer affects the estimation accuracy: the more the number of neural nodes in the hidden layer is, the more accurate the tracking result will be. However, considering that the calculation capacity of the STM32 microcontroller adopted in our bicycle is limited, the number of $L$ cannot be selected to be too large. The learning rate $\rho$ affects the estimation rate of uncertainty as shown in (36) and (45). A larger $\rho$ value can get a faster estimation result. However, too large $\rho$ value may also cause severe chattering. Finally, we chose $L$ = 5 and $\rho$ = 0.1.
4. Experimental verification
In this section, to demonstrate the superiority of the proposed controller, experiments compared with ITSM, AITSM and BFITSM controllers are carried out on a real UB system.
4.1. Experiment configurations
The UB system used in the experiment is shown in Fig. 5, in which Fig. 5(a) and 5(b) are its side and top views, respectively. The UB system mainly consists of a body frame, two wheels, and a RW. The RW is installed at the centre of the bicycle body, and its rotational direction is the same as the bicycle’s longitudinal direction. A servo motor is adopted to drive the RW with an equipped STM32 microcontroller. The UB is powered by an 11.1-volt lithium battery mounted at the rear bicycle. In addition, an encoder is used for measuring the speed of the RW, and an MPU-6050 motion sensor module is used to obtain the yaw angle of the bicycle.
The ITSM and AITSM controllers and the BFITSM controller in ref. [Reference Labbadi, Hashim, Eltoukhy and Djemai35] are used for comparison, presented as follows:
where the control parameters of above controller are chosen as the same as the proposed controller as given in Section 3.
4.2. Performance of stationary state (Case 1)
Ideally, we hope that the roll angle of the UB maintains at 0° under control. However, in practice, due to the inertial action of the UB and the dead-zone characteristics of the motor, the roll angle of the UB cannot absolutely be maintained at 0°. Therefore, the control goal is to reduce the roll angle as much as possible. In this case, the UB system is keeping static. The experimental results are shown in Fig. 6.
Fig. 6(a) shows the roll angle of the UB under the action of four controllers. It can be seen that although the UB is kept within an acceptable range, the roll angle under the ITSM controller is the largest, which reaches 0.0084 rad. The proposed ELM-BFITSM controller can restrain the roll angle at the smallest amplitude, and its maximum roll angle is only 0.0023 rad, which is smaller than AITSM control (0.0062 rad) and BFITSM control (0.0049 rad). Fig. 6 (b) shows the control inputs of the four controllers. It can be seen that the chattering range of ELM-BFITSM control is 0.0033 Nm, which is significantly smaller than ITSM control (0.0116 Nm), AITSM control (0.0061 Nm) and BFITSM control (0.0055 Nm). The reason is that the ELM observer and BF-based adaptive algorithm can well compensate for effect of the lumped uncertainty on the closed-loop system.
Fig. 6(c) shows the lumped uncertainty estimation of the UB system. It can be seen that the estimated disturbance varies during the experiment. As shown in Fig. 6(a), the tracking error under ELM-BFITSM is ensured in the predefined bound of ± $\frac{0.05}{2.4}$ = ±0.021 rad, which implies the proposed ELM-BFITSM can exactly predefine the region of the roll angle tracking error.
4.3. Performance of lateral force rejection (Case 2)
To verify the robustness of the ELM-BFITSM against external disturbances, one-tenth of the motor power is used to generate a lateral force. The lateral force is suddenly applied to the UB when it is at a stationary balanced state. The experimental results are shown in Fig. 7. It can be observed that the ITSM control has the largest maximum roll angle reaching 0.057 rad. In addition, the BFITSM control achieves a smaller maximum roll angle (0.042 rad) than AITSM control (0.047 rad). Compared with the above three controllers, the performance of ELM-BFITSM controller is significantly improved, whose maximum roll angle is 0.034 rad. The control signals are shown in Fig. 7(b), while the disturbance estimation in the proposed control by ELM observer is depicted in Fig. 7(c). The proposed ELM observer does not require any training process and can adaptively adjusted the output weights such that the estimated results can be changed in real time with the disturbance. It can be seen that due to the effective compensation of the disturbance by using ELM observer, not only the smallest value of the control amplitude for the proposed control is achieved but also the best control performance of the proposed control is obtained as clearly shown in Fig. 7(a).
4.4. Performance of S-curve driving (Case 3)
When the bicycle steers, the parametric uncertainty is activated due to the front wheel’s motion. In order to further verify the strong robustness of the ELM-BFITSM controller, the UB system is required to follow an S-curve path in this case. The tracking performance of the controllers is shown in Fig. 8. It can be seen from Fig. 8(a) that ITSM has the worst control performance with a maximum roll angle reaching 0.11 rad. Due to the introduction of BF, the maximum roll angle of BFITSM control reduces to 0.064 rad, which is smaller than AITSM (0.079 rad). Moreover, the tracking error of ELM-BFITSM is 0.053 rad, which indicates that the ELM-BFITSM achieves the best control performance under the changing disturbance condition. As seen from Fig. 8(b), the control signal of ELM-BFITSM is smoother than other three controllers, which does not have severe chattering problem. The estimate value of the uncertainty for the ELM-BFITSM control is shown in Fig. 8(c).
It can be seen from the above experimental results that, compared with ITSM, AITSM and BFITSM controllers, the proposed ELM-BFITSM control exhibits the best tracking performance. The reasons are given as follows: (i) Compared with the conventional ITSM control, better control performance is achieved due to the introduction of BF algorithm. (ii) Compared with adaptive ITSM and BFITSM control schemes, due to the fact that the ELM observer can compensate for the impacts of lumped uncertainty to effectively improve the control precision, the proposed control obtains the most superior control performance and robustness.
4.5. Performance comparisons
For performance analysis, root mean square error (RMSE) and maximum error (MAXE) values of the sampled tracking error $e(i)$ are taken into consideration for comparisons, which are defined as
where $G$ is the number of the sampled tracking error. Table III and Figs. 9(a) and 9(b) show the performance comparisons of four controllers in three cases. It can be seen that the proposed controller achieves the smallest RMSE and MAXE values, while the ITSM controller behaves with the worst performance, followed by the AITSM and BFITSM controllers. It can be concluded that compared with the ITSM and adaptive ITSM controllers, the proposed ELM-BFITSM controller can achieve excellent control performance of the UB system in practical applications.
5. Conclusion
In this paper, an ELM-BFITSM controller has been developed for the balancing control of an UB system. It has been shown that the proposed control strategy comprises a BF-based adaptive ITSM for eliminating the reaching phase and an ELM observer for further compensating the lumped uncertainty. As a result, the disturbance can be estimated and be fed forward by the inherent ELM observer. Moreover, due to the BF-based adaptive law, the control gain is automatically updated according the disturbance variations, and the overestimation is effectively suppressed. In addition, the proposed controller ensures that the roll angle of the UB converges to an exactly predefined region in finite time. The experimental results have verified the excellent control performance of the proposed controller compared to the conventional ITSM, conventional adaptive ITSM and BF-based ITSM controllers. Compared to ITSM, AITSM and BFITSM controllers, the RMSE of ELM-BFITSM control is improved by 72.6%, 57.4%, 52.2% in Case1; 31.4%, 20.1%, 7.5% in Case2 and 55.8%, 52.9%, 35% in Case3, respectively. In addition, the MAXE of the ELM-BFITSM control is improved by 72.6%, 62.9%, 53.1% in Case1; 40.4%, 27.7%, 19% in Case2 and 51.8%, 32.9%, 17.2% in Case3, respectively.
In practical applications, considering that some failures may occur such as actuator and sensor failures, our future research work will focus on sliding mode-based fault tolerant control of UB system.
Author contributions
Long Chen and Zhihui Jin conceived and designed the study. Guangyi Wang and Shuping He designed the hardware. Vladimir Stojanovic and Parisa Arabzadeh Bahri data gathering, simulation, and analyses. Zhihui Jin wrote this article. Ke Shao and Hai Wang provided guidance and supervision.
Financial support
This paper is partially supported by National Natural Science Foundation of China [Grant No. 61771178].
Competing interests
The authors declare no competing interest exist.
Ethical approval
None.
Appendix A
In this paper, the control law is designed as
Then, we have
where $v$ is the forward speed of the UB, $\delta$ is the steering angle of the front wheel relative to the positive direction of UB, which are set as the small value and change continuously in the experiment. $\hat{l}$ is the estimated result of ELM observer, which is bounded. $\hat{k}$ is the proposed adaptive law. If $\hat{k}=\mu \int _{0}^{t}\exp\! (\sigma | s| )| s| d\tau$ , the $\hat{k}$ is bounded because of the finite-time accessibility. If $\hat{k}=f_{b}(s)$ , the $\hat{k}$ is bounded because the $\hat{k}$ will increase quickly to pull $| s|$ back when $| s|$ gradually increases, so that the $\hat{k}$ is a large but bounded value.
Therefore, when $\zeta _{0}\gt | N_{0}v\dot{\delta }+P_{0}v^{2}\delta | +| \hat{l}| +| J_{0}\hat{k}|$ , $\zeta _{1}={max } (M_{0},J_{0}\alpha _{2})$ and $\zeta _{2}=J_{0}\alpha _{1}$ , the following inequality holds:
Appendix B
Based on (7), the $\ddot{\theta }$ is expressed as
The lumped uncertainty $l$ is given by
Then, we obtain:
Rearranging the terms, we get:
Therefore, we have
In this work, the disturbance $d$ and the uncertain items $\Delta N$ , $\Delta P$ , $\Delta M$ and $\Delta J$ are considered to be bounded. In addition, the steering angle $\delta$ is continuously changed such that its derivative is bounded as well [Reference Chen, Yan, Wang, Shao, Kurniawan and Wang29]. Therefore, the following inequalities for the bounded property are given as
Then, we yield:
As can be seen that the upper bound of the lumped uncertainty $l$ is associated with the structure of the designed controller. From (9), we have
Thus, we obtain the expression:
where:
This completes the proof.
Appendix C
Suppose a first-order non-linear differential inequality given by
where $\kappa \gt 0$ , $0\lt \eta \lt 1$ . $V(x)$ is a positive Lyapunov function with respect to the state $x\in R$ . Then, the function $V(x)$ will converge from any given initial condition $V(x(0))=V(0)$ to the origin in the finite time given by
The details can refer to ref. [Reference Moulay and Perruquetti40] and references therein.
Appendix D
The sliding function (14)-(15) can be rewritten as
Since $| s| \leq \varepsilon$ , if $\left| e\right| \gt \frac{\varepsilon }{\alpha _{1}}$ , we have $\alpha _{1}-\frac{\varepsilon }{e}\,:\!=\, \alpha _{1}^{\prime}\gt 0$ . It is clear that $e$ is decreasing to zero in finite time according to (78) [Reference Sariyildiz, Mutlu and Yu18]. In other words, $e$ will be bounded by
in finite time.