reliable docking control scheme for probe–drogue refueling...

10
Engineering Notes Reliable Docking Control Scheme for ProbeDrogue Refueling Jinrui Ren, Xunhua Dai, Quan Quan, Zi-Bo Wei, § and Kai-Yuan Cai Beihang University, 100191 Beijing, Peoples Republic of China DOI: 10.2514/1.G003708 I. Introduction A UTONOMOUS aerial refueling (AAR) is an important method to increase the voyage and endurance of unmanned aerial vehicles and avoid the conflict between the takeoff weight and the payload weight [1,2]. Among the aerial refueling methods in operation today, the probedrogue refueling (PDR) [3] is the most widely adopted one, owing to its flexibility and simple requirement for equipment. There are five stages in the process of PDR, and docking is the most critical and difficult stage because it is more susceptible to disturbances, which directly affects the success of AAR. The docking control task is to control the probe on the receiver link up with the drogue for fuel transfer. The docking control for PDR is a difficult task for two main reasons. The first reason is that the system model in the docking stage is a multi-input multi-output (MIMO) higher-order nonlinear system with nonminimum-phase, multiagent, and multidisturbance features, which is complex for control design. Moreover, the dynamics of the receiver is slower than that of the drogue, and so it is hard for the probe on the receiver to capture the moving drogue. The second reason is that the precision requirement for the PDR docking control is high. The docking error should be controlled within centimeter level, and the relative velocity between the probe and the drogue should be controlled within a small range, such as 1 1.5 ms [4]. Therefore, the docking controller design for PDR is important and challenging. Many research efforts have focused on developing docking control methods for PDR. First, the most commonly used method is the linear quadratic regulator [5,6] because it is simple, and the optimal feedback gain matrix can be obtained. It is a linear-model-based method, and the control quality may deteriorate in the presence of large disturbances. A second method is nonzero setpoint [7], which transforms a tracking problem into a stabilization problem. However, the reference state and reference input are limited to be constant; namely, the drogue is relatively static to the tanker. It means that the receiver just needs to perform a setpoint tracking task. Some improved work is then carried out to track a moving drogue [8], but the trajectory of the drogue is assumed to be known in advance. Third, to reject complex disturbances and uncertainties during the docking stage, active disturbance rejection control (ADRC) [9] and adaptive- based control [10,11] have received much attention recently. Generally, when ADRC is applied to the trajectory tracking of aircraft, the flight control system is designed by using scale separation. The considered system is usually divided into several loops, which leads to a complex control configuration. Adaptive- based control is also relatively complex. Moreover, fault-tolerant control is also studied. A fault-tolerant adaptive model inversion control for vision-based PDR was developed in Ref. [12]. Last, but not least, backstepping control was adopted in Ref. [13], which relies on the modeling precision of the PDR system. On the whole, there still exist some challenges to designing a docking controller for PDR. First, the PDR system is complicated. Moreover, as one of the main disturbances in the docking stage, the bow wave effect is a repetitive nonlinear disturbance and highly related to the states of the receiver and the drogue, which also makes docking control difficult [14,15]. Second, the problem of the slow dynamicsto track the fast dynamicsbecomes tougher if the bow wave effect is taken into account. Some feedback control methods may result in a chasing process between the receiver and the drogue, which may cause overcontrol. Besides, the chasing action may lead to impact and damage to the drogue and the probe, which is very dangerous and needs to be avoided according to ATP-56(B) issued by NATO (North Atlantic Treaty Organisation) [4]. Third, there are more than 20 states in the PDR system, and so it is quite demanding for sensors to detect so many states accurately. In practice, the Global Positioning System and vision-based navigation systems are common precise positioning methods for PDR [1]. Influenced by the environment, some unexpected sensor delay may happen. Besides, real-time online state feedback is time consuming. As a consequence, the stability and reliability of the docking operation may deteriorate. To cope with the aforementioned three challenges, a reliable docking control scheme for PDR is proposed in this study. Here, reliabilitymeans a small docking error, certain robustness against disturbances and uncertainties, and little dependence on the system model and sensors. The terminal iterative learning control (TILC, or terminal iterative learning controller, also designated TILC) method is used in the docking control scheme. TILC has captured extensive attention from many fields since it was first presented in Ref. [16]. Iterative learning control (ILC) [17] is an effective cycle-to-cycle control approach to achieve entire output trajectory tracking within a given time interval. When only the endpoint needs to be tracked, TILC is a good choice. In practice, when a pilot performs a docking operation, he/she adjusts the control input or the initial position of the receiver by learning from historical experience and sending feedforward commands to achieve a successful docking. Similarly, TILC gives feedforward control inputs or initial states based on the terminal output error from the last iteration. It is noteworthy that the three challenges mentioned earlier can be solved to some extent by TILC. First, TILC is basically a model-free control method, and little system model knowledge is required. Second, the control input of the receiver is a feedforward signal, which is calculated by the iterative learning process, and so the docking safety problem caused by some feedback control methods, which are mentioned in the last paragraph, can be avoided. Third, only terminal output information instead of all states is needed for TILC. Besides, TILC takes advantage of the repeatability of the docking stage. Given the advantages TILC has in tackling these challenges, TILC is a preferable way to solve the docking control problem for PDR. Received 18 March 2018; revision received 9 June 2019; accepted for publication 29 June 2019; published online 30 July 2019. Copyright © 2019 by the American Institute of Aeronautics and Astronautics, Inc. All rights reserved. All requests for copying and permission to reprint should be submitted to CCC at www.copyright.com; employ the eISSN 1533-3884 to initiate your request. See also AIAA Rights and Permissions www.aiaa.org/ randp. *Ph.D. Candidate, School of Automation Science and Electrical Engineering; [email protected]. Ph.D. Candidate, School of Automation Science and Electrical Engineering; [email protected]. Associate Professor, School of Automation Science and Electrical Engineering; [email protected]. § School of Automation Science and Electrical Engineering; weizibo@ comac.cc. Professor, School of Automation Science and Electrical Engineering; [email protected]. Article in Advance / 1 JOURNAL OF GUIDANCE,CONTROL, AND DYNAMICS Downloaded by BEIHANG UNIVERSITY on September 23, 2019 | http://arc.aiaa.org | DOI: 10.2514/1.G003708

Upload: others

Post on 31-Oct-2020

5 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Reliable Docking Control Scheme for Probe–Drogue Refueling ...rfly.buaa.edu.cn/doc/2019RenReliable.pdfEngineering Notes Reliable Docking Control Scheme for Probe–Drogue Refueling

Engineering NotesReliable Docking Control Scheme

for Probe–Drogue Refueling

Jinrui Ren,∗ Xunhua Dai,† Quan Quan,‡ Zi-Bo Wei,§

and Kai-Yuan Cai¶

Beihang University, 100191 Beijing,

People’s Republic of China

DOI: 10.2514/1.G003708

I. Introduction

AUTONOMOUS aerial refueling (AAR) is an important method

to increase the voyage and endurance of unmanned aerial

vehicles and avoid the conflict between the takeoff weight and the

payload weight [1,2]. Among the aerial refueling methods in

operation today, the probe–drogue refueling (PDR) [3] is the most

widely adopted one, owing to its flexibility and simple requirement

for equipment. There are five stages in the process of PDR, and

docking is the most critical and difficult stage because it is more

susceptible to disturbances, which directly affects the success of

AAR. The docking control task is to control the probe on the receiver

link upwith the drogue for fuel transfer. The docking control for PDR

is a difficult task for two main reasons. The first reason is that the

system model in the docking stage is a multi-input multi-output

(MIMO) higher-order nonlinear system with nonminimum-phase,

multiagent, and multidisturbance features, which is complex for

control design. Moreover, the dynamics of the receiver is slower than

that of the drogue, and so it is hard for the probe on the receiver to

capture the moving drogue. The second reason is that the precision

requirement for the PDR docking control is high. The docking error

should be controlled within centimeter level, and the relative velocity

between the probe and the drogue should be controlled within a small

range, such as 1 ∼ 1.5 m∕s [4]. Therefore, the docking controller

design for PDR is important and challenging.

Many research efforts have focused on developing docking control

methods for PDR. First, themost commonly usedmethod is the linear

quadratic regulator [5,6] because it is simple, and the optimal

feedback gain matrix can be obtained. It is a linear-model-based

method, and the control quality may deteriorate in the presence of

large disturbances. A second method is nonzero setpoint [7], which

transforms a tracking problem into a stabilization problem. However,

the reference state and reference input are limited to be constant;

namely, the drogue is relatively static to the tanker. It means that thereceiver just needs to perform a setpoint tracking task. Someimproved work is then carried out to track a moving drogue [8], butthe trajectory of the drogue is assumed to be known in advance. Third,to reject complex disturbances and uncertainties during the dockingstage, active disturbance rejection control (ADRC) [9] and adaptive-based control [10,11] have received much attention recently.Generally, when ADRC is applied to the trajectory tracking ofaircraft, the flight control system is designed by using scaleseparation. The considered system is usually divided into severalloops, which leads to a complex control configuration. Adaptive-based control is also relatively complex. Moreover, fault-tolerantcontrol is also studied. A fault-tolerant adaptive model inversioncontrol for vision-based PDR was developed in Ref. [12]. Last, butnot least, backstepping control was adopted in Ref. [13], which relieson the modeling precision of the PDR system.On the whole, there still exist some challenges to designing a

docking controller for PDR. First, the PDR system is complicated.Moreover, as one of the main disturbances in the docking stage, thebow wave effect is a repetitive nonlinear disturbance and highlyrelated to the states of the receiver and the drogue, which also makesdocking control difficult [14,15]. Second, the problem of the “slowdynamics” to track the “fast dynamics” becomes tougher if the bowwave effect is taken into account. Some feedback control methodsmay result in a chasing process between the receiver and the drogue,whichmay cause overcontrol. Besides, the chasing actionmay lead toimpact and damage to the drogue and the probe, which is verydangerous and needs to be avoided according toATP-56(B) issued byNATO (North Atlantic Treaty Organisation) [4]. Third, there aremore than 20 states in the PDR system, and so it is quite demandingfor sensors to detect somany states accurately. In practice, the GlobalPositioning System and vision-based navigation systems arecommon precise positioning methods for PDR [1]. Influenced by theenvironment, some unexpected sensor delay may happen. Besides,real-time online state feedback is time consuming. As a consequence,the stability and reliability of the docking operation may deteriorate.To cope with the aforementioned three challenges, a reliable

docking control scheme for PDR is proposed in this study. Here,“reliability” means a small docking error, certain robustness againstdisturbances and uncertainties, and little dependence on the systemmodel and sensors. The terminal iterative learning control (TILC, orterminal iterative learning controller, also designated TILC) methodis used in the docking control scheme. TILC has captured extensiveattention from many fields since it was first presented in Ref. [16].Iterative learning control (ILC) [17] is an effective cycle-to-cyclecontrol approach to achieve entire output trajectory tracking within agiven time interval. When only the endpoint needs to be tracked,TILC is a good choice. In practice, when a pilot performs a dockingoperation, he/she adjusts the control input or the initial position of thereceiver by learning from historical experience and sendingfeedforward commands to achieve a successful docking. Similarly,TILC gives feedforward control inputs or initial states based on theterminal output error from the last iteration. It is noteworthy that thethree challenges mentioned earlier can be solved to some extent byTILC. First, TILC is basically a model-free control method, and littlesystemmodel knowledge is required. Second, the control input of thereceiver is a feedforward signal, which is calculated by the iterativelearning process, and so the docking safety problem caused by somefeedback controlmethods, which arementioned in the last paragraph,can be avoided. Third, only terminal output information instead of allstates is needed for TILC. Besides, TILC takes advantage of therepeatability of the docking stage. Given the advantages TILC has intackling these challenges, TILC is a preferable way to solve thedocking control problem for PDR.

Received 18 March 2018; revision received 9 June 2019; accepted forpublication 29 June 2019; published online 30 July 2019. Copyright © 2019by the American Institute of Aeronautics and Astronautics, Inc. All rightsreserved. All requests for copying and permission to reprint should besubmitted to CCC at www.copyright.com; employ the eISSN 1533-3884 toinitiate your request. See also AIAA Rights and Permissions www.aiaa.org/randp.

*Ph.D. Candidate, School of Automation Science and ElectricalEngineering; [email protected].

†Ph.D. Candidate, School of Automation Science and ElectricalEngineering; [email protected].

‡Associate Professor, School of Automation Science and ElectricalEngineering; [email protected].

§School of Automation Science and Electrical Engineering; [email protected].

¶Professor, School of Automation Science and Electrical Engineering;[email protected].

Article in Advance / 1

JOURNAL OF GUIDANCE, CONTROL, AND DYNAMICS

Dow

nloa

ded

by B

EIH

AN

G U

NIV

ER

SIT

Y o

n Se

ptem

ber

23, 2

019

| http

://ar

c.ai

aa.o

rg |

DO

I: 1

0.25

14/1

.G00

3708

Page 2: Reliable Docking Control Scheme for Probe–Drogue Refueling ...rfly.buaa.edu.cn/doc/2019RenReliable.pdfEngineering Notes Reliable Docking Control Scheme for Probe–Drogue Refueling

In this Note, a TILC is designed for the PDR system (a MIMO

higher-order nonlinear system), and a convergence analysis is carried

out without linearization. The proposed controller selects the

combination of control inputs and initial values as its learning object,

which is consistent with the pilots’ operation. Besides, the iterative

optimization method is adopted to generate a suitable basis function

for the proposed TILC to achieve fast convergence, as well as to

control the docking process. By combining the generation of the basis

function and the proposed TILC, a reliable docking control scheme

for PDR is established.

A previous paper by Dai et al. [18] employed terminal iterative

learning to forecast the terminal position of the drogue and then

generate the tracking trajectory for the original autopilot system.

However, it is not a conventional TILC, and the low-level driving

signals still come from the original autopilot. There are significant

differences between the work presented in this Note and that

presented in Ref. [18]:1) The TILC works as an additional unit for the trajectory

generation of the original autopilot system in Ref. [18], whereas thedesigned TILC in this study is a low-level controller without a giventrajectory.2) The proposed TILC is a hybrid controller with control input

learning and initial position learning, whereas only the docking errorlearning was used in Ref. [18].3) A basis function is introduced into TILC for faster convergence.4) The controlling of the x-axis relative velocity is achieved by the

trajectory planning of the x-axis relative position rather than asaturation bound.

The main contributions of the study are twofold:

From the practical perspective, the contribution of this study is that

a reliable docking control scheme is proposed based on TILC. With

this control scheme, the PDR system can achieve a successful

docking quickly, accurately, and safely. TILC is a kind of model-free

feedforward learning control, which can avoid the chasing action andreduce sensor burden, and thus possesses some advantages overmodel-based control and real-time feedback control in terms of thedocking control problem.From the theoretical perspective, the contribution of this study is

that the TILC design problem for a class of MIMO higher-ordernonlinear system is addressed with two key problems solved. One isthat a convergence analysis is carried out without linearization; theother is that a suitable basis function is selected by the iterativeoptimization method. Moreover, the partial state controllability isalso considered in the TILC design.The Note is organized as follows. The model description and

problem statement are presented in Sec. II to transform the practicaldocking control problem into a theoretical TILC control problem.Section III gives a reliable docking control scheme including thegeneration of the basis function and the TILC implementation. InSec. IV, a suitable basis function is generated for the TILC. Section Vis devoted to the details of the TILC design with the convergenceanalysis. Illustrative simulations are provided in Sec. VI to show theeffectiveness of the proposed scheme. Section VII concludesthe Note.

II. Model Description and Problem Statement

A. PDR System Model in the Docking Stage

Figures 1 and 2 show the PDR system, which consists of a tankeraircraft with a flexible hose that trails behind and below the tanker, acone-shaped drogue mounted at the end of the hose, and a receiveraircraft equipped with a rigid probe protruding from its nose. Thedocking control task is to control the probe link upwith the drogue forfuel transfer. When establishing the PDR system model in thedocking stage, three commonly used coordinate frames are theground frame (og–xgygzg), the tanker frame (ot–xtytzt), and thedrogue equilibrium-point frame (od–xdydzd), which are shown inFig. 1. The definition of these coordinate frames can be foundin Ref. [15].According to Refs. [19,20], the receiver aircraft (F-16 aircraft is

considered) is modeled as

8<:

_xr � Arxr � Brur �Grfa�pr

vr

�� Crxr

(1)

�pp

vp

��

�pr

vr

��

�pp∕r0

�(2)

tx

t t( )o p

ty

gtv

tz

gth

gogx

gy

gz

d d( )o pdx

dy

dzpp

Tanker

Hose Drogue

Receiver

Fig. 1 Coordinate frames for the PDR system [15].

ppdp

dp

pp

P dp p

dr

dr

dr

a) Left view b) Rear view

d) Geometrical modelc) Top view

d px x

d py y

d pz z

pvd,xv

Fig. 2 Three views and the geometrical model of the PDR system.

2 Article in Advance / ENGINEERING NOTES

Dow

nloa

ded

by B

EIH

AN

G U

NIV

ER

SIT

Y o

n Se

ptem

ber

23, 2

019

| http

://ar

c.ai

aa.o

rg |

DO

I: 1

0.25

14/1

.G00

3708

Page 3: Reliable Docking Control Scheme for Probe–Drogue Refueling ...rfly.buaa.edu.cn/doc/2019RenReliable.pdfEngineering Notes Reliable Docking Control Scheme for Probe–Drogue Refueling

where xr ∈ R12 is the state vector of the receiver; and ur ∈ R4 is thecontrol input vector consisting of the throttle δT ∈ R, elevatorδe ∈ R, rudder δr ∈ R, and aileron δa ∈ R. The receiver mainlysuffers from the atmospheric turbulence force fa ∈ R3. Themodeling and simulation methods for fa have been extensivelystudied in the existing literature [5,21]. The vector pr � �xryrzr�T ∈R3 denotes the position of the receiver’s center ofmass under the tankframe, which is combined with the velocity of the receiver vr ∈ R tobe the system output. The vector pp � �xpypzp�T ∈ R3 and vp ∈ Rare the position and velocity of the front end of the probe, andpp∕r ∈ R3 denotes the relative position from pr to pp. Note that allthe system matrices Ar, Br, Gr, and Cr are time invariant and ofappropriate dimensions. Moreover, stability augmentation control(SAC) is often adopted for the receiver to place the poles of thereceiver system to reasonable positions in the left-half s plane [19]. Tothis end, a state feedback matrix Kr ∈ R4×12 can be designed, andthen matrix Ar becomes

�Ar � Ar − BrKr (3)

According to Ref. [15], the drogue dynamics is expressed by atransfer function, for which the corresponding state-spacerepresentation is 8><

>:_xd � Adxd �Gd�fb � fa��pd

vd,x

�� Cdxd

(4)

where xd ∈ R10 is the state of the drogue, for which the dimension isdetermined by the order of the fitting model in the identification; andthe input fb, fa ∈ R3 are the bow wave effect force [22] and theatmospheric turbulence force. Thevectorpd � �xdydzd�T ∈ R3 is theposition of the drogue under the tank frame, which is combined withthe velocity of the drogue in the x-axis direction of the tank framevd,x ∈ R to be the system output. Similar to the receivermodel, all thesystemmatrices Ad,Gd, andCd are time invariant and of appropriatedimensions. Furthermore, the bow wave effect can be represented bya nonlinear function [15]

fb � ϕ0�pd − pp� (5)

Apart from the disturbance forces fb and fa, there are no othercontrol inputs in the drogue dynamics (4). Therefore, the drogueposition is passively affected by the aerodynamic disturbances fromthe receiver (in a close range) and the atmospheric environment,which is a difficulty for the docking control design.On the whole, by combining Eqs. (1–5), a comprehensive model

for the PDR system is described as

8>>>>>>>><>>>>>>>>:

�_xd_xr

�|{z}

_x �

�Ad 00 �Ar

�|���{z���}

A

�xdxr

�|{z}

x �

�0Br

�|{z}

B

ur|{z}u �

�Gdϕ0�pd−pp�

0

�|������������{z������������}

ϕ�y� �

�Gdfa

Grfa

�|���{z���}

φ�pd−pp

vd,x−vp

�|�����{z�����}

y ��Cd −Cr �|����{z����}

C

�xdxr

�|{z}

x �

�−pp∕r0

�|���{z���}

d

(6)

When Eq. (6) is established in the drogue equilibrium-point frame,the initial states are

xd�0� � 0, xr�0� � xr0 (7)

The physicalmeaning of Eq. (7) is that, when t � 0, the drogue is stilland steady, and the receiver stays away from the drogue at an initialposition xr0.In practice, it is expected that the docking process lasts from tstart to

tend. The docking stage terminates when

tdock � argtmin jxd�t� − xp�t�j (8)

which is expected to be equal to tend (desired docking moment). Atthis moment, if

edock � k�yd�tdock�zd�tdock��T − �yp�tdock�zp�tdock��Tk < rd,

− vmax < vd,x�tdock� − vp�tdock� < −vmin (9)

the docking attempt is regarded as successful [23], where vmax andvmin are the thresholds of relative velocity to open the fuel valve at thedockingmoment and rd is the radius of the drogue as shown in Fig. 2.

B. TILC Problem Statement

Regarding the control design, there are four outputs of xd∕p, yd∕p,zd∕p, and vdx=p (pd − pp ≜ �xd∕pyd∕pzd∕p�T , vd,x − vp ≜ vdx=p) to becontrolled. The controlling of vdx=p can be achieved by the trajectoryplanning of xd∕p. Thus, the following system from Eq. (6) is left to beconsidered:

�_x � Ax� Bu� ϕ� �y� � φ�y � �Cx� �d

(10)

where �y � �xd∕pyd∕pzd∕p�T , u � �δTδeδaδr�T , �C is generated from Cby removing the last row, and �d is generated from d by removing thelast row.According to Sec. I, TILC is a preferable way to solve the docking

control problem. Thus, system (10) is rewritten as a model forcontroller design:

�_xk�t� � Axk�t� � Buk�t� � ϕ� �yk�t�� � φk

�yk�t� � �Cxk�t� � �d,xk�0� � x0,k(11)

where t ∈ �0,T� is the time with the cycle period T � tend − tstart, thesubscript k ∈ N� is the cycle number, xk ∈ R22, uk ∈ R4, and�yk ∈ R3. The desired terminal output �yd�T� is known. Henceforth,for convenience, the variables t and k will be omitted except whennecessary. The following assumptions are made on system (11).Assumption 1: The initial state xk�0� can be reset at every

iteration k.Assumption 2: The atmospheric turbulence force is bounded

and kφk − φk−1k ≤ D.Assumption 3: The function ϕ� �y� satisfies the local Lipschitz

condition onM, namely,

kϕ� �yk� − ϕ� �yk−1�k ≤ lϕk �yk − �yk−1k

where

M � f �yj �y − �yd�T� ∈ B�03×1,δ�g

is an open connected set with

B�03×1,δ� ≜ fξ ∈ R3jkξ − 03×1k ≤ δ,δ ∈ R�g

denoting a neighborhoodwith the radius δ around the origin 03×1, andlϕ is a positive Lipschitz constant.In practice, if a docking attempt fails, the receiver will retreat to the

standby position for the next attempt [18], which means the initialstate xk�0� can be reset at every iteration k, namely, Assumption 1.The initial state xk�0� can be measured by various types of sensors:for example, vision-based systems [12]. Assumption 2 is anassumption on the disturbance φ. It is well known that ILC canremove repetitive disturbances. Thus, one only needs to know thevariations of disturbances in any two consecutive cycles.Assumption2 describes the bounds of such variations. Because the dockingoperation is just allowed in calm days, atmospheric turbulence forcecan be bounded [5,21]. As for Assumption 3, the nonlinear termϕ� �y�comes from the bow wave effect, for which the concrete expressionsatisfies the local Lipschitz condition [15]. Thus, all the threepreceding assumptions are reasonable for the PDR docking controlproblem.

Article in Advance / ENGINEERING NOTES 3

Dow

nloa

ded

by B

EIH

AN

G U

NIV

ER

SIT

Y o

n Se

ptem

ber

23, 2

019

| http

://ar

c.ai

aa.o

rg |

DO

I: 1

0.25

14/1

.G00

3708

Page 4: Reliable Docking Control Scheme for Probe–Drogue Refueling ...rfly.buaa.edu.cn/doc/2019RenReliable.pdfEngineering Notes Reliable Docking Control Scheme for Probe–Drogue Refueling

For an actual PDR system, the drogue position and the relativeposition between the probe and the drogue are usually measured byvision-based sensors for which the measurement precision dependson the relative distance (higher precision in the closer distance).Therefore, compared with the trajectory data, the terminal positionsof the probe and the drogue are usually easier to measure in practice.Then, the controller directly uses the terminal docking error (relativeposition between the drogue and the probe at the dockingmoment) togive the next control input. Filters can also be used to remove thesensor noise.Objective: The control objective is to construct a sequence of

control uk�t�, t ∈ �0,T� for system (11) such that

k �yd�T� − �yk�T�k < rd as k → ∞ (12)

where �yk�T� is the corresponding output at the terminal time T drivenby uk�t�.

III. Reliable Docking Control Scheme

With control problem (12) in hand, a docking control schemeaiming for fast and reliable PDR docking is put forward in thissection, which is depicted in Fig. 3. The control scheme includestwo parts:1) Design a TILC to achieve the control objective (12).2) Generate a basis function for the designed TILC to obtain fast

convergence, as well as to control the docking process.The generation of the basis function matrix belongs to the

preparation stage of docking control, and the designedTILCworks inthe implementation stage of docking control. In the preparation stage,there is no need to consider the complex PDR model directly. Thereceiver system with SAC (13) is considered to get a suitable basisfunction for the TILC design based on the actual PDR system in theimplementation stage. Through the preparation work, a satisfactorybasis function can be attained offline before the docking operation.During the implementation stage, the designed TILC works in arepetitive and learning way to achieve a successful dockingeventually. Based on the established basis function matrix, the TILCis simple and easy to apply. The details of the two parts of theproposed docking control schemewill be described in the subsequentsections.

IV. Offline Generation of Basis Function

A better basis function leads to a faster convergence for TILC.Because the iterative process of the docking control for PDR isexpected to be successfully finished within 2 ∼ 3 attempts, the basisfunction Ub�t� should be carefully selected here. Regarding theselection of the basis function, the common practice is as follows:First, the output profile is parameterized into a polynomial or otherfunctions. Then, the unknown parameters in the proposed functionare calculated based on known conditions, which means the outputprofile is determined. Finally, with the known output profile and a

simplified system model, one may obtain the corresponding input,which can be used to determine a suitable basis function [16]. In thissection, a suitable basis function matrix is chosen by an iterativeoptimization method for better performance. First, a referencedocking trajectory pr,r�t� should be provided. Then, it is aimed toobtain the corresponding reference inputs leading to the givenreference docking trajectorypr,r�t�, and the obtained reference inputsmake up the desired basis function matrix Ub�t�.A reference docking trajectory is a reasonable smooth flight

trajectory for the receiver to finish the docking task. To this end, theexisting work mainly adopts three methods: a low-pass filter method[24], a polynomial interpolation method [6], and a terminal guidancemethod [25]. To establish basis function matrix Ub�t�, a referenceoutput trajectory pr,r�t� is predesigned to satisfy the dockingrequirements of PDR systems, and especially to guarantee theterminal docking error. Such a reference output trajectory also needsto give a docking relative velocity of 1 ∼ 1.5 m∕s. The generation ofthe reference docking trajectory is well studied and omitted here.As mentioned in Sec. I, ILC is a method to generate inputs that can

achieve a perfect tracking of the reference docking trajectory.However, the receiver is nonminimumphase, and so adjoint-type ILCis considered here. For the receiver with SAC

�_xr � �Arxr � Brurpr � �Crxr

(13)

an adjoint-type ILC is designed as

ur,k�1 � ur,k � αkG��pr,r�t� − pr�t�� (14)

where ur � �δTδeδaδr�T , �Cr is generated from Cr by removing thelast row, G is the operator of system (13), and G� is the adjointoperator of G. One can refer to Ref. [26] for the selection of αk andother details. After 20 offline iterations, the actual outputs track thereference docking trajectory completely, as shown in Fig. 4. Thecorresponding reference inputs δT,r, δa,r, δe,r, and δr,r are depicted inFig. 5. Then, the desired basis function matrix is obtained as

Ub�t� �

2664δT,r�t� 0 0 0

0 δa,r�t� 0 0

0 0 δe,r�t� 0

0 0 0 δr,r�t�

3775 (15)

In practice, in the presence of successful docking experiences, onecan directly select a practical docking trajectory as a referencedocking trajectory pr,r�t� and then obtain the corresponding basisfunction Ub�t�, or one can select a practical pilots’ operation as thebasis function Ub�t�.Until now, the desired basis function matrix Ub�t� has been

generated; namely, the preparation work of docking control has beendone. Next, a TILC based on the generated basis function matrixUb�t� is to be designed.

V. TILC Design

This section presents the TILC design for MIMO higher-ordernonlinear system (11) to achieve the control objective (12). For mostof the time, traditional TILC controllers use the control input or initialstate value as their learning object, whereas a hybrid TILC aiming toimprove the controller performance by a combined learning object isdeveloped in thisNote. Taking the proposed TILC into consideration,the overall closed-loop block diagram of a PDR system during thedocking process is depicted in Fig. 6.In general, only partial initial states are controllable, such as, in the

docking stage of PDR, the initial position of the receiver pr iscontrollable. The initial position of the receiver can be changed inevery docking attempt to perform a successful docking. According tothe state controllability, the initial state can be rewritten asFig. 3 TILC-based reliable docking control scheme.

4 Article in Advance / ENGINEERING NOTES

Dow

nloa

ded

by B

EIH

AN

G U

NIV

ER

SIT

Y o

n Se

ptem

ber

23, 2

019

| http

://ar

c.ai

aa.o

rg |

DO

I: 1

0.25

14/1

.G00

3708

Page 5: Reliable Docking Control Scheme for Probe–Drogue Refueling ...rfly.buaa.edu.cn/doc/2019RenReliable.pdfEngineering Notes Reliable Docking Control Scheme for Probe–Drogue Refueling

a) b)

c) d)Fig. 4 Iterative tracking for the given reference docking trajectory pr;r�t� by the adjoint-type ILC.

a) b)

c) d)Fig. 5 Reference inputs leading to the given reference docking trajectory pr;r�t�.

Fig. 6 Closed-loop system block diagram of a PDR system with the proposed TILC.

Article in Advance / ENGINEERING NOTES 5

Dow

nloa

ded

by B

EIH

AN

G U

NIV

ER

SIT

Y o

n Se

ptem

ber

23, 2

019

| http

://ar

c.ai

aa.o

rg |

DO

I: 1

0.25

14/1

.G00

3708

Page 6: Reliable Docking Control Scheme for Probe–Drogue Refueling ...rfly.buaa.edu.cn/doc/2019RenReliable.pdfEngineering Notes Reliable Docking Control Scheme for Probe–Drogue Refueling

x0,k � �ξTηTk �T � M1ξ�M2ηk (16)

where ξ ∈ R19 denotes the uncontrollable initial state, and ηk ∈ R3

(pr) denotes the controllable initial state, which will be used in the

next iteration;M1 � �IT190T3×19�T , andM2 � �0T19×3IT3 �T . The systemforwhich the initial state does not fit the order can be transformed into

the form of Eq. (16) by changing the state order.The learning control law for system (11) is designed as

uk�t� � Ub�t�qk (17)

where qk ∈ R4 is a constant parameter vector; andUb�t� ∈ R4×4 is a

diagonal basis function matrix, which was generated in the last

section.Because the combination of the control input and the initial state

value is chosen as the learning object, the learning update law of qkand ηk is proposed as

�qkηk

��

�qk−1ηk−1

��

�L1

L2

�ek−1�T� (18)

where L1 ∈ R4×3 and L2 ∈ R3×3 are constant parameter matrices,

and ek�T� ≜ �yd�T� − �yk�T� is the docking error. Then, the followingtheorem provides the convergence condition of the designed

controller including Eqs. (17) and (18).Theorem 1: For system (11), suppose that 1) Assumptions 1–3 are

satisfied; and 2) the control law and update law of TILC are designed

as Eqs. (17) and (18), respectively. If the following inequality

α� γ < 1 (19)

holds, then the docking error

kek�T�k →ε

1 − α − γ(20)

as k → ∞, where

α ���I3 − �CM2L2 −

ZT

0

�CBUb�τ�L1dτ��,

γ �Z

T

0

�k �CAk � lϕk �Ck2�β�τ�dτ,

β�t� � kM2L2ke�kAk�lϕk �Ck�t �Z

t

0

e�kAk�lϕk �Ck��t−τ�kBUb�τ�L1kdτ,

ε � DTk �Ck �Z

T

0

�k �CAk � lϕk �Ck2�DTe�kAk�lφk �Ck�τdτ

Moreover, if there is no atmospheric turbulence disturbance (namely,D � 0 and ε � 0), then

��ek�T��� → 0.Proof: Refer to the Appendix. □

Because of the introduction of basis function matrix Ub�t�, theiterative learning object is transformed from the control input uk�t� toa parameter vector qk, which makes the controller implementation

simpler. Besides, because the basis function matrix Ub�t� isgenerated by using adjoint-type ILC to solve the nonminimum-phase

problem in Sec. IV, the designed TILC control law [Eq. (17)] can

apply to the PDR system with a nonminimum-phase feature.As the real PDR system is computer controlled, a sampled-data

controller is preferred, which can be obtained by discretizing the

designed continuous controller. According to the controller

emulation design [27], a continuous-time controller is first designed

based on the continuous-time plant model (6). Then, the obtainedcontinuous-time controller is discretized according to the sampling

period Ts.For the proposed continuous-time terminal iterative controller

[Eqs. (17) and (18)], controller discretization is relatively easy.

Because qk, ηk,L1,L2, and ek�T� are all constantmatrices or vectors,

only the basis function Ub�t� needs to be discretized to Ub�jTs�,

j � 0,1, : : : N, T � NTs. Furthermore, the basis function Ub�t� isestablished in the preparatory stage and remains unchanged in theimplementation stage; thus, it can be directly discretized bysampling. The final sampled-data controller is the N-samplesequence of the control input as follows:

uk�jTs� � Ub�jTs�qk,�qkηk

��

�qk−1ηk−1

��

�L1

L2

�ek−1�T� (21)

Agenetic algorithm (GA) is a little bit similar to TILCbecause theyboth work in an iterative way. However, they have different origins,different definitions, and different applications. A GA is ametaheuristic inspired by the process of natural selection that belongsto the larger class of evolutionary algorithms. Genetic algorithms arecommonly used to generate high-quality solutions to optimizationand search problems by relying on bioinspired operators such asmutation, crossover, and selection. The iterative convergence speedof TILC is much higher than a GA. This makes TILC a better methodfor PDR because the iterative process of the docking control for PDRis expected to be successfully finished within two to three attempts[4]. Moreover, a GA is related to probability, but that is not the casefor TILC. There is also a connection between them that a GA can beused to determine the optimal TILC parameters.

VI. Simulation

In this section, the feasibility and the performance of the proposedTILC-based control scheme are investigated through the simulation.

A. Simulation Configuration

A MATLAB/SIMULINK-based simulation environment with athree-dimensional virtual-reality display has been developed by theauthors’ research laboratory to simulate the docking stage of PDR.The detailed information about the modeling procedure, modelparameters, and simulation environment can refer to previous worksofDai et al. [14] andWei et al. [15]. It is noteworthy that, although thedrogue dynamics [Eq. (4)] is considered in the controller design, thelink-connected model of the hose–drogue system is adopted in thesimulation environment. A hose–drum unit is also included toimprove the fitness of the simulation model. Moreover, the actuatordynamics of the aircraft is also considered.The TILC parameters used in this Note are set as

q1 �

26641

1

1

1

3775, L1 �

26640.1 0 0

0 −0.4 0

0 0 −10 14 0

3775,

L2 �24 0.3 0 0

0 0.4 0

0 0 0.4

35

The values forL1 andL2 used in this Note are selected by manualtuning, in which the physical meaning of position error feedforwardcan be used. Some other optimization methods (for example, agenetic algorithm, an ant colony algorithm, and machine learning)can be used to obtain optimal parameters for the proposed controllers.These methods are powerful but time consuming. Furthermore, theoptimal parameter may not be achievable in practice because the fullaccurate knowledge about themodel is not available. Thevideo of thedocking control performance by using the proposed control schemecan be viewed online.**

B. Basic Simulation

First, traditional TILC controllers that use control inputsor initial state values as their learning objects are considered

**The video titled “AReliable Docking Control Scheme for Probe-DrogueRefueling,” is available online at https://youtube/gTqAhAPlNvQ [retrieved10 June 2019].

6 Article in Advance / ENGINEERING NOTES

Dow

nloa

ded

by B

EIH

AN

G U

NIV

ER

SIT

Y o

n Se

ptem

ber

23, 2

019

| http

://ar

c.ai

aa.o

rg |

DO

I: 1

0.25

14/1

.G00

3708

Page 7: Reliable Docking Control Scheme for Probe–Drogue Refueling ...rfly.buaa.edu.cn/doc/2019RenReliable.pdfEngineering Notes Reliable Docking Control Scheme for Probe–Drogue Refueling

for comparison. By letting q1 � � 1 1 1 1 �T , L1 ��0.1 0 0; 0 − 0.4 0; 0 0 − 1; 0 14 0�, and L2 � 03×3, the TILC

with control input learning (controller 2) is obtained. By letting

q1 � � 1 1 1 1 �T , L1 � 04×3, and L2 � diag�0.9, 0.5, 0.5� theTILC with the initial value learning (controller 3) is obtained. The

convergence of the terminal docking error for the proposed hybrid

TILC (controller 1) and the two traditional TILC controllers is

compared in Fig. 7, from which one can see that the convergence

speed of the three controllers can be ordered as: controller 1 >controller 2> controller 3. It is consistentwith the theoretical analysis

that the hybrid TILCwith both control input learning and initial value

learning has the best convergence property. As iteration number kincreases, the terminal docking errors decrease. A successful docking

can be achieved at the second docking attempt, which meets the

docking requirements of the PDR system.

In the following, the simulation verification for the proposed

hybrid TILC will be focused on. The iterative process of the system

outputs is displayed in Fig. 8, which shows that the change of initial

position incurs a trajectory translation during the first half (before the

20 s). However, in the second half, not just a trajectory translation

effect is incurred because of the system nonlinearity and the change

of control inputs. Actually, the docking terminates at tend � 30 s,and it is for a better display to give the later outputs in 30 ∼ 50 s(which do not exist in practice). It also can be seen that the drogue

deviates from the equilibrium position under the bow wave effect

when the receiver approaches the drogue. It is demonstrated in

Fig. 8d that the docking relative velocity stays around−1.25 m∕s atthe docking moment. Because a good basis function is selected, the

velocity overshoot is small. The final successful three-dimensional

docking trajectory is illustrated in Fig. 9. The probe successfully

docks into the drogue in the end, and the solid red line shows the

motion of the moving drogue.

C. Simulation with Uncertainties and Disturbances

In this part, in order to evaluate the robustness of the proposed

hybrid TILC, the following system uncertainties and disturbances are

considered:Uncertainty 1: Add uncertainties to actuators bymultiplying actuatorinputs by a factor of 0.6, 0.8, 1.2, and 1.4, respectively.Uncertainty 2: Change the actual bow wave effect to 1.2 times themodeled bow wave effect.

Fig. 7 Comparison of convergence of terminal docking error betweenproposed TILC (controller 1) and two traditional TILC controllers.(Because rd � 0.305 m as shown in Fig. 2, a docking attempt is regarded

as successful if docking error is less than 0.25 m here.)

a) b)

c) d)Fig. 8 Iteration of the relative position and relative velocity between the drogue and the probe during the docking process.

m

m

m

Fig. 9 Successful three-dimensional docking trajectory of the probe and

the drogue.

Article in Advance / ENGINEERING NOTES 7

Dow

nloa

ded

by B

EIH

AN

G U

NIV

ER

SIT

Y o

n Se

ptem

ber

23, 2

019

| http

://ar

c.ai

aa.o

rg |

DO

I: 1

0.25

14/1

.G00

3708

Page 8: Reliable Docking Control Scheme for Probe–Drogue Refueling ...rfly.buaa.edu.cn/doc/2019RenReliable.pdfEngineering Notes Reliable Docking Control Scheme for Probe–Drogue Refueling

Disturbance 3: Add a side wind disturbance to the atmosphericenvironment.Disturbance 4: Add the Dryden wind-turbulence model to the PDRsystem.

It is noteworthy that Uncertainties 1 and 2 and Disturbance 3 are

repetitive, whereas Disturbance 4 is stochastic and nonrepetitive. The

aforementioned four changes just apply to the plant, whereas the

controller design is still based on themodel built previously. Based on

Uncertainties 1 and 2 and Disturbances 3 and 4, three scenarios are

considered:Scenario 1: Only Uncertainty 1 is considered.Scenario 2: The actuator uncertainty factor is fixed to 0.6, andUncertainty 2 and Disturbance 3 are considered.Scenario 3: The actuator uncertainty factor is fixed to 0.6, andUncertainty 2 and Disturbances 3 and 4 are all considered.

Under these scenarios, the convergence of the terminal docking

error of the proposed TILC is depicted in Figs. 10 and 11. Figure 10

shows that the convergence speed of the proposed TILC decreases

when there exist actuator uncertainties. The bigger the actuator

uncertainty, the more the convergence speed decreases. Figure 11

shows that the docking error of the proposed TILC can still converge

to zero when there exist Uncertainties 1 and 2 and Disturbance 3,

which are repetitive.When there also exists the stochastic uncertainty

(Disturbance 4), the docking error cannot converge to zero, but it is

bounded. The proposed TILC can achieve successful docking at the

third attempt, which illustrates that the proposed controller can

guarantee the docking performance under these uncertainties anddisturbances.

D. Discussion

The simulation exhibits favorable results. The proposed TILCdoes not adopt a real-time tracking method to track the movingdrogue, and so the problem of the slow dynamics to track the fastdynamics is solved. Because a good basis function matrix isconstructed in the preparation stage, the overshoot at the dockingmoment is under control. Additionally, TILC is basically model freeand robust, which can achieve successful docking without knowingthe drogue dynamics and the bow wave model. The proposedcontroller can also provide successful docking in the presence ofsystem uncertainties and disturbances. These make the dockingprocess safer and more reliable.

VII. Conclusions

With the aim of achieving precise docking for the probe–droguerefueling system, a reliable control scheme with a iterative learningcontrol (TILC) docking control method is proposed to make thereceiver contact with the nonstationary drogue. The combination ofcontrol inputs and initial state values is chosen as the learning objectof TILC, and the convergence condition is derived. The simulationresults illustrate that the docking precision is guaranteed after a fewiteration cycles. Compared with two traditional TILC controllers, theproposed hybrid TILC gives the highest convergence speed.

Appendix: Proof of Theorem

First, define variables ~xk�t� ≜ xk�t� − xk−1�t�, ~uk�t� ≜ uk�t�−uk−1�t�, ~qk ≜ qk − qk−1, and ~ηk � ηk − ηk−1. Starting from ek�T�,one has

ek�T� � ek−1�T� − �C ~xk�T� (A1)

The next step is to calculate ~xk�t�. Integrating both sides of theequation

_xk�t� � Axk�t� � Buk�t� � ϕ� �yk�t�� � φk

gives

xk�t� � x0,k �Rt0�Axk�τ� � Buk�τ� � ϕ� �yk�τ�� � φk� dτ (A2)

Then,

~xk�t� � M2 ~ηk �Z

t

0

�A ~xk�τ� � B ~uk�τ� � ϕ� �yk�τ�� − ϕ� �yk−1�τ��

�φk − φk−1� dτ(A3)

According to the learning control law [Eq. (17)] and the learningupdate law [Eq. (18)], one has

~uk�t� � Ub�t� ~qk � Ub�t�L1ek−1�T�

and ~ηk � L2ek−1�T�. Then, Eq. (A3) becomes

~xk�t��M2L2ek−1�T��Z

t

0

�A ~xk�τ��BUb�τ�L1ek−1�T��ϕ� �yk�τ��

−ϕ� �yk−1�τ���φk−φk−1�dτ (A4)

Substituting ~xk�T� into Eq. (A1) gives

ek�T� ��I3 − �CM2L2 −

ZT

0

�CBUb�τ�L1dτ

�ek−1�T�

− �C

ZT

0

�A ~xk�τ��ϕ� �yk�τ��−ϕ� �yk−1�τ���φk −φk−1�dτ (A5)Fig. 11 Convergence of the terminal docking error when there existuncertainties and disturbances (Scenario 2 and Scenario 3).

Fig. 10 Convergence of the terminal docking error when there existactuator uncertainties (Scenario 1).

8 Article in Advance / ENGINEERING NOTES

Dow

nloa

ded

by B

EIH

AN

G U

NIV

ER

SIT

Y o

n Se

ptem

ber

23, 2

019

| http

://ar

c.ai

aa.o

rg |

DO

I: 1

0.25

14/1

.G00

3708

Page 9: Reliable Docking Control Scheme for Probe–Drogue Refueling ...rfly.buaa.edu.cn/doc/2019RenReliable.pdfEngineering Notes Reliable Docking Control Scheme for Probe–Drogue Refueling

where I3 ∈ R3×3 is an identity matrix. According to Assumptions 2and 3, taking the norm on both sides of Eq. (A5) results in

kek�T�k ≤ αkek−1�T�k �Z

T

0

�k �CAk � lϕk �Ck2�k ~xk�τ�kdτ

�DTk �Ck (A6)

where

α ���I3 − �CM2L2 −

ZT

0

�CBUb�τ�L1dτ��

The next step is to calculate

k ~xk�t�k

Taking the norm on both sides of Eq. (A4) yields

k ~xk�t�k ≤ kM2L2ek−1�T�k �Z

t

0

��kAk � lϕk �Ck�k ~xk�τ�k

� kBUb�τ�L1kkek−1�T�k� dτ�DT (A7)

Applying the Gronwall–Bellman inequality [17] to Eq. (A7) resultsin

k ~xk�t�k ≤ �kM2L2ek−1�T�k �DT�e�kAk�lϕk �Ck�t

�Z

t

0

e�kAk�lϕk �Ck��t−τ�kBUb�τ�L1kkek−1�T�k dτ

≤ β�t�kek−1�T�k �DTe�kAk�lϕk �Ck�t (A8)

where

β�t� � kM2L2ke�kAk�lϕk �Ck�t �Z

t

0

e�kAk�lϕk �Ck��t−τ�kBUb�τ�L1kdτ

Substituting Eq. (A8) into Eq. (A6) gives

kek�T�k ≤ �α� γ�kek−1�T�k � ε (A9)

where

γ �Z

T

0

�k �CAk � lϕk �Ck2�β�τ�dτ

ε � DTk �Ck �Z

T

0

�k �CAk � lϕk �Ck2�DTe�kAk�lϕk �Ck�τdτ

When

α� γ < 1

compressed mapping holds for inequality (A9), and so

kek�T�k →ε

1 − α − γ

as k → ∞.

Acknowledgment

This work was supported by the National Natural ScienceFoundation of China under grant 61473012.

References

[1] Thomas, P. R., Bhandari, U., Bullock, S., Richardson, T. S., and DuBois, J. L., “Advances in Air to Air Refuelling,” Progress in AerospaceSciences, Vol. 71, Nov. 2014, pp. 14–35.doi:10.1016/j.paerosci.2014.07.001

[2] Nalepka, J. P., and Hinchman, J. L., “Automated Aerial Refueling:Extending the Effectiveness of Unmanned Air Vehicles,” AIAA

Modeling and Simulation Technologies Conference and Exhibit, AIAAPaper 2005-6005, Aug. 2005,doi:10.2514/6.2005-6005

[3] Bhandari, U., Thomas, P. R., Bullock, S., Richardson, T. S., and duBois,J., “Bow Wave Effect in Probe-and-Drogue Aerial Refuelling,” AIAA

Guidance, Navigation, and Control Conference, AIAA Paper 2013-4695, Aug. 2013,doi:10.2514/6.2013-4695

[4] “ATP-56(B) Air-to-Air Refuelling,” NATO TR ATP 3.3.4.2, Brussels,Belgium, 2010.

[5] Tandale, M. D., Bowers, R., and Valasek, J., “Trajectory TrackingController for Vision-Based Probe and Drogue Autonomous AerialRefueling,” Journal of Guidance, Control, and Dynamics, Vol. 29,No. 4, 2006, pp. 846–857.doi:10.2514/1.19694

[6] Fravolini, M. L., Ficola, A., Campa, G., Napolitano, M. R., and Seanor,B., “Modeling and Control Issues for Autonomous Aerial Refueling forUAVs Using a Probe-Drogue Refueling System,” Aerospace Science

and Technology, Vol. 8, No. 7, 2004, pp. 611–618.doi:10.1016/j.ast.2004.06.006

[7] Valasek, J., Gunnam, K., Kimmett, J., Junkins, J. L., Hughes, D., andTandale, M. D., “Vision-Based Sensor and Navigation System forAutonomous Air Refueling,” Journal of Guidance, Control, and

Dynamics, Vol. 28, No. 5, 2005, pp. 979–989.doi:10.2514/1.11934

[8] Kimmett, J., Valasek, J., and Junkins, J. L., “Vision Based Controller forAutonomous Aerial Refueling,” Proceedings of the 2002 InternationalConference on Control Applications, Vol. 2, IEEE Publ., Piscataway,NJ, Sept. 2002, pp. 1138–1143.doi:10.1109/CCA.2002.1038765

[9] Su, Z., Wang, H., Shao, X., and Yao, P., “Autonomous AerialRefueling Precise Docking Based on Active Disturbance RejectionControl,” IECON 2015—41st Annual Conference of the IEEE

Industrial Electronics Society, IEEE Publ., Piscataway, NJ, Nov. 2015,pp. 4574–4578.doi:10.1109/IECON.2015.7392813

[10] Wang, J., Patel, V. V., Cao, C., Hovakimyan, N., and Lavretsky, E.,“Verifiable L1 Adaptive Controller for Aerial Refueling,” AIAA

Guidance, Navigation and Control Conference and Exhibit, AIAAPaper 2007-6313, Aug. 2007,doi:10.2514/6.2007-6313

[11] Wang, J., Patel, V. V., Cao, C., Hovakimyan, N., and Lavretsky, E.,“Novel L1 Adaptive Control Methodology for Aerial Refueling withGuaranteed Transient Performance,” Journal of Guidance, Control, andDynamics, Vol. 31, No. 1, 2008, pp. 182–193.doi:10.2514/1.31199

[12] Valasek, J., Famularo, D., and Marwaha, M., “Fault-Tolerant AdaptiveModel InversionControl for Vision-BasedAutonomousAir Refueling,”

Journal of Guidance, Control, and Dynamics, Vol. 40, No. 6, 2017,

pp. 1336–1347.

doi:10.2514/1.G001888[13] Wang,H., Dong,X., Xue, J., and Liu, J., “DynamicModeling of aHose-

Drogue Aerial Refueling System and Integral Sliding Mode

Backstepping Control for the Hose Whipping Phenomenon,” Chinese

Journal of Aeronautics, Vol. 27, No. 4, 2014, pp. 930–946.

doi:10.1016/j.cja.2014.06.010[14] Dai, X.,Wei, Z., andQuan, Q., “Modeling and Simulation of BowWave

Effect in Probe and Drogue Aerial Refueling,” Chinese Journal of

Aeronautics, Vol. 29, No. 2, 2016, pp. 448–461.doi:10.1016/j.cja.2016.02.001

[15] Wei, Z.-B., Dai, X., Quan, Q., and Cai, K.-Y., “Drogue Dynamic ModelUnderBowWave inProbe-and-DrogueRefueling,” IEEETransactions on

Aerospace and Electronic Systems, Vol. 52, No. 4, 2016, pp. 1728–1742.

doi:10.1109/TAES.2016.140912[16] Xu, J. X., Chen, Y., Tong, H. L., and Yamamoto, S., “Terminal Iterative

Learning Control with an Application to RTPCVD Thickness Control,”Automatica, Vol. 35, No. 9, 1999, pp. 1535–1542.doi:10.1016/S0005-1098(99)00076-X

[17] Ahn, H. S., Moore, K. L., and Chen, Y. Q., Iterative Learning Control:Robustness and Monotonic Convergence in the Iteration Domain,

Springer–Verlag, London, 2007, pp. 4–25.

doi:10.1007/978-1-84628-859-3[18] Dai, X., Quan, Q., Ren, J., Xi, Z., and Cai, K.-Y., “Terminal Iterative

Learning Control for Autonomous Aerial Refueling Under

Aerodynamic Disturbances,” Journal of Guidance, Control, and

Dynamics, Vol. 41, No. 7, 2018, pp. 1577–1584.

doi:10.2514/1.G003217

Article in Advance / ENGINEERING NOTES 9

Dow

nloa

ded

by B

EIH

AN

G U

NIV

ER

SIT

Y o

n Se

ptem

ber

23, 2

019

| http

://ar

c.ai

aa.o

rg |

DO

I: 1

0.25

14/1

.G00

3708

Page 10: Reliable Docking Control Scheme for Probe–Drogue Refueling ...rfly.buaa.edu.cn/doc/2019RenReliable.pdfEngineering Notes Reliable Docking Control Scheme for Probe–Drogue Refueling

[19] Stevens, B. L., and Lewis, F. L., Aircraft Control and Simulation:

Dynamics, Controls Design, and Autonomous Systems, Wiley,Hoboken, NJ, 2015, pp. 287–322.doi:10.1002/9781119174882

[20] Stepanyan, V., Lavretsky, E., and Hovakimyan, N., “Aerial RefuelingAutopilot Design Methodology: Application to F-16 Aircraft Model,”AIAA Guidance, Navigation, and Control Conference and Exhibit,AIAA Paper 2004-5321, Aug. 2004,doi:10.2514/6.2004-5321

[21] Ro, K., and Kamman, J. W., “Modeling and Simulation of Hose-Paradrogue Aerial Refueling Systems,” Journal of Guidance, Control,and Dynamics, Vol. 33, No. 1, 2010, pp. 53–63.doi:10.2514/1.45482

[22] Dogan, A., Blake, W., and Haag, C., “Bow Wave Effect in AerialRefueling: Computational Analysis andModeling,” Journal of Aircraft,Vol. 50, No. 6, 2013, pp. 1856–1868.doi:10.2514/1.C032165

[23] Dibley, R. P., Allen, M. J., and Nabaa, N., “Autonomous AirborneRefueling Demonstration Phase I Flight-Test Results,” AIAA

Atmospheric Flight Mechanics Conference and Exhibit, AIAAPaper 2007-6639, Aug. 2007,doi:10.2514/6.2007-6639

[24] Fravolini, M., Ficola, A., Napolitano, M., Campa, G., and Perhinschi,M., “Development of Modeling and Control Tools for Aerial Refuelingfor UAVs,” AIAA Guidance, Navigation, and Control Conference and

Exhibit, AIAA Paper 2003-5798, Aug. 2003,doi:10.2514/6.2003-5798

[25] Ochi, Y., and Kominami, T., “Flight Control for Automatic AerialRefueling via PNG and LOS Angle Control,” AIAA Guidance,

Navigation, and Control Conference and Exhibit, AIAA Paper 2005-6268, Aug. 2005,doi:10.2514/6.2005-6268

[26] Wei, Z.-B., Quan, Q., and Cai, K.-Y., “Output Feedback ILC for a Classof Nonminimum Phase Nonlinear Systems with Input Saturation: AnAdditive-State-Decomposition-Based Method,” IEEE Transactions on

Automatic Control, Vol. 62, No. 1, 2017, pp. 502–508.doi:10.1109/TAC.2016.2555081

[27] Nesić, D., and Teel, A. R., “Sampled-Data Control of NonlinearSystems: An Overview of Recent Results,” Perspectives in Robust

Control, Vol. 268, Lecture Notes in Control and Information Sciences,Springer, New York, 2001, pp. 221–239.doi:10.1007/BFb0110609

[28] “A Reliable Docking Control Scheme for Probe-Drogue Refueling,”https://youtube/gTqAhAPlNvQ [accessed 10 June 2019].

10 Article in Advance / ENGINEERING NOTES

Dow

nloa

ded

by B

EIH

AN

G U

NIV

ER

SIT

Y o

n Se

ptem

ber

23, 2

019

| http

://ar

c.ai

aa.o

rg |

DO

I: 1

0.25

14/1

.G00

3708