Performance Optimization Study of a Common... Vehicle Using a Legendre Pseudospectral ...

Performance Optimization Study of a Common Aero
Vehicle Using a Legendre Pseudospectral Method
by
Kimberley A. Clarke
B.S. Aerospace Engineering, Pennsylvania State University, 2001
Submitted to the Department of Aeronautics and Astronautics
in partial fulfillment of the requirements for the degree of
Master of Science in Aeronautics and Astronautics
at the
MASSACHUSETTS INSTITUTE OF TECHNOLOGY
June 2003
© Kimberley A. Clarke, MMIII. All rights reserved.
The author hereby grants to MIT permission to reproduce and
distribute publicly paper and electronic copies of this thesis
document in whole or in part.
Author
...........................................
,epVtnnt ofAeronautics and Astronautics
May 23, 2003
Certified by....
..................
Anil V. Rao, Ph.D.
Senior Member of the Technical Staff
The Charles Stark Draper Laboratory, Inc.
Technical Supervisor
Certified by ............
................
Jonathan P. How, Ph.D.
Professor, Department of Aeronautics and Astronautics
Thesis Advisor
......................
Accepted by .........
Edward M. Greitzer, Ph.D.
H.N. Slater Professor of Aeronautics and Astronautics
Chair, Committee on Graduate Students
MASSACHUSETTS INSTITUTE
OF TECHNOLOGY
AEROBRES
LIBRARIES
[This page intentionally left blank.]
Performance Optimization Study of a Common Aero Vehicle
Using a Legendre Pseudospectral Method
by
Kimberley A. Clarke
Submitted to the Department of Aeronautics and Astronautics
on May 23, 2003, in partial fulfillment of the
requirements for the degree of
Master of Science in Aeronautics and Astronautics
Abstract
The problem of performance optimization of a Common Aero Vehicle (CAV) is
considered. In particular, the CAV is modeled as an unpowered high lift-to-drag
ratio Earth penetrating re-entry vehicle. The CAV mission design problem is to
determine a steering command that takes the CAV from a known initial state to
a target on the surface of the Earth while optimizing a given performance index
and satisfying all of the constraints imposed during flight. The CAV mission design problem is formulated as an optimal control problem. The optimal control
problem is transformed to a nonlinear programming problem using a Legendre Pseudospectral Method. The nonlinear programming problem is then solved
using a sparse nonlinear optimization algorithm. Once a solution to the CAV
mission design problem is obtained, three main studies are conducted. First,
the accuracy of the Legendre Pseudospectral Method is evaluated and the desirable characteristics of the solution to the CAV mission design problem are
defined. Second, a study is conducted to demonstrate the effect of the parameters on the performance of the CAV. This parametric study demonstrates the
use of the Legendre Pseudospectral method as a design tool and provides insight to the behavior of the CAV. Third, a preliminary investigation is performed
on the real-time application of the Legendre Pseudospectral Method to the CAV
mission design problem. This real-time analysis includes an assessment of the
robustness of the solution to realistic environmental disturbances.
Technical Supervisor: Anil V. Rao, Ph.D.
Title: Senior Member of the Technical Staff
The Charles Stark Draper Laboratory, Inc.
Thesis Advisor: Jonathan P. How, Ph.D.
Title: Professor, Department of Aeronautics and Astronautics
3
[This page intentionally left blank.]
Acknowledgments
I am very grateful for everyone who has made the completion of my masters
degree possible. Without the unique network of the Draper staff, MIT faculty,
family, and friends, I would not be where I am today.
I would like to thank the Charles Stark Draper Laboratory for providing me
with the funding and support necessary for the completion of my degree from
MIT. In particular I would like to thank the GCB2 staff as well as the Education
Office. I would also like to individually thank Doug Fuhry and Anil Rao. Doug,
even though we only worked together briefly, I learned a lot from you. Special
thanks to Anil Rao for the guidance and support not only on my project, but also
with my job search. It has been two years of laughter, frustration, and growth.
Oh and I will especially miss your corny, but funny engineering jokes.
Thanks to the MIT professors and the entire Aero/Astro staff. The most
amazing part about studying at MIT is the intelligence of the professors and
their first hand experiences that are integrated into the classroom. Professor
How, I am grateful for your patience and thank you for being my thesis advisor.
Furthermore, I would like to thank professors Battin and Ramnath for being a
reference for me on job applications.
Now to my MIT friends, these past two years have been years of personal
growth. Each and everyone of you has expanded my horizon and I thank you for
that. In particular, I would like to thank the "forget your lunch Friday" Draper
crew who I have directly shared the past two years with. Christine, thanks for the
bathroom breaks, Thursday night dinners, and most importantly, the girl time.
Jen, thanks for the kick-board chats, Friday morning breakfast, and trips to donate blood. "Coach" Geoff, thanks for bringing out the child in me by playing
games while waiting in line for rides at "Great Adventure" and by stopping on a
six hour car trip to ride go-carts. We have come a long way since Texas. Stephen,
thank you for being my e-mail buddy, introducing me to Strongbad, and normalcy. Dave Benson, thank you for attending our review sessions, teaching me
5
how to make bread, and being my party buddy. Daveed, thanks for the pingpong breaks, late night e-mails, and 6 a.m. breakfast. Heidi, thanks for listening
to my complaints, sailing, and your sanity. Stuart, thanks for all of your help
and good luck with your music career. To the first year Draper fellows, Dave,
Steve, and Drew, thanks for the fresh faces and I wish you the best of luck.
I would also like to thank those who rescued me from my graduate studies.
To my roommates, Sarah and Libby, thanks for providing me with food, clean
clothes, and clean dishes these past couple of months. To "the girls" from Penn
State, I want to thank you for your open ears and for understanding why I have
not kept in touch recently. Nick, thanks for picking me up when I lost motivation and always knowing the right thing to say. Preston, thanks for moving to
CT, leading me through the trees on ski trips, spooning, and most importantly,
for making me laugh. Parker, thanks for giving me something to smile about
despite my frustrations with writing my thesis. Last but not least, I would like
to thank my family for their love and support. Thanks for putting up with my
moods and helping me through these past two years. I could not have asked for
more.
This thesis was prepared at The Charles Stark Draper Laboratory, Inc., under
Internal Research and Development, Project Advanced Guidance and Trajectory
Design, 03-2-5037.
Publication of this thesis does not constitute approval by Draper or the sponsoring agency of the findings or conclusions contained herein. It is published
for the exchange and stimulation of ideas.
Kim berley A. Clarke..............................
6
.............
Contents
17
1 Introduction
1.1
Motivation .........................................
17
1.2 Common Aero Vehicle ............................
19
1.3 Mission Design Problem ..........................
. 21
1.4 Mission Design Approach ...........................
23
1.5 Research Objectives ..............................
24
1.6 Thesis Overview ................................
25
2 Common Aero Vehicle Problem Formulation
2.1 Overview ..........................................
2.2 Dynamic Model ................................
2.2.1 Coordinate System
..........................
2.2.2 Equations of Motion .............................
27
27
. 28
28
29
2.3 Boundary Conditions .............................
34
2.4 Path Constraints ................................
35
Perform ance Index ..............................
. 36
2.5
3 Optimal Control: Problem Formulation and Solution Methods
3.1
39
39
Overview ..........................................
3.2 Optimal Control Problem .............................
40
3.2.1
Dynamics ................................
40
3.2.2
Path Constraints ...............................
41
3.2.3
Boundary Conditions ............................
41
7
. . . . . . . . . . . . . . . . . . . . . . . . . .
3.2.4
Performance Index
3.2.5
General Form of an Optimal Control Problem ..........
42
.42
3.3 Methods for Solving Optimal Control Problems . . . . . . . . . . . . .
43
Analytic Methods for Solving Optimal Control Problems . . .
43
3.3.1
3.3.2 Numerical Methods for Solving Optimal Control Problems .
48
3.4 Direct Transcription of Optimal Control Problem Via Pseudospectral Methods
3.5
50
..................................
3.4.1 Pseudospectral Methods ..........................
50
3.4.2 Legendre Pseudospectral Method . . . . . . . . . . . . . . . . .
52
Summary of Optimal Control ........................
59
4 Numerical Optimization Study of the Common Aero Vehicle Problem
Using the Legendre Pseudospectral Method
61
4.1 Overview ..........................................
61
4.2 Discretization via the Legendre Pseudospectral Method ........
62
. 62
4.2.1 Optimization Vector ........................
4.2.2 Discretization of the Dynamic Constraints ............
.65
4.2.3 Discretization of the Path Constraints and the Terminal Con66
straints .....................................
68
4.2.4 Discretization of the Performance Index .............
4.3 Common Aero Vehicle Nonlinear Programming Problem .......
4.3.1
.69
Summary of the Common Aero Vehicle Nonlinear Program69
ming Problem .................................
4.3.2 Structure of the Common Aero Vehicle Nonlinear Program-
71
ming Problem .................................
4.3.3 Scaling of the Common Aero Vehicle Nonlinear Programming
Problem ......................................
72
4.4 Numerical Optimization via SNOPT .......................
74
4.4.1 Description of SNOPT ...........................
75
4.4.2 User Requirements and Options for SNOPT ...........
8
.76
77
4.5 Numerical Optimization Study .......................
4.5.1
Specification of the Required Inputs . . . . . . . . . . . . . . . .
77
4.5.2
Determination of an Adequate Number of Nodes
. . . . . . .
81
4.5.3 Choice of Weighting Factors Used in the Performance Index .
88
Summary of the Numerical Optimization Study . . . . . . . . . . . . .
97
5 Parametric Optimization Study of the Common Aero Vehicle Problem
99
4.6
5.1
99
Overview ..........................................
5.2 Key Features of the Trajectory and Control
5.3
100
...............
Effects of Dynamic Pressure on the Trajectory and Control
.....
.105
5.4 Effects of the Stagnation Point Heat Load on the Trajectory and
Control . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 110
5.5 Effects of the Lift-to-Drag Ratio on the Trajectory and Control
5.6
117
. .
123
Summary of the Parametric Study .....................
6 Preliminary Study of the Real-Time Application of the Legendre Pseu125
dospectral Method
6.1 Overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 125
6.2 Common Aero Vehicle Flight Simulation . . . . . . . . . . . . . . . . . 126
6.3 Assessment of the Accuracy of the Legendre Pseudospectral Method 129
. 134
6.4 Sum m ary....................................
137
7 Conclusions
7.1
.137
Summary......................................
7.2 Conclusions ..................................
. 139
A Notation
143
B Matrix Derivatives
145
C Constraint Jacobian and Objective Gradient Derivation
149
C.1 Constraint Jacobian .................................
150
C.2 Objective Gradient ..................................
176
9
D Initial Guess
179
E Earth Relative Downtrack and Crosstrack
183
10
List of Figures
1-1 Common Aero Vehicle Mission Profile ...................
.
22
2-1 Earth-Centered Earth-Fixed Coordinate System . . . . . . . . . . . . .
28
2-2 Free Body Diagram of the Common Aero Vehicle .............
31
2-3
..............
Bank Angle .....................
3-1 Distribution of LGL points for a given number of nodes
33
.......
54
4-1 Sparsity Pattern of the Common Aero Vehicle Nonlinear Programming Problem .....................................
73
4-2 Angle of Attack vs. Time for M=(25, 50, 75, 100) .............
83
4-3 Bank Angle vs. Time for M=(25, 50, 75, 100) ................
83
4-4 Earth Relative Downtrack Distance vs. Earth Relative Crosstrack
Distance for 50 Nodes ...............................
84
4-5 Earth Relative Downtrack Distance vs. Earth Relative Crosstrack
84
Distance for 75 Nodes ...............................
4-6 Earth Relative Downtrack Distance vs. Earth Relative Crosstrack
Distance for 100 Nodes ............................
85
4-7 Earth Relative Speed vs. Time for 50 Nodes ................
85
4-8 Earth Relative Speed vs. Time for 75 Nodes ................
86
4-9 Earth Relative Speed vs. Time for 100 Nodes
. . .. . . . . .. . . . .
4-10Angle of Attack vs. Time for ki = (0.1, 1.0, 10, 100), k 2 = k3= 1.0
4-11 Angle of Attack Rate vs. Time for kI = (0.1, 1.0, 10, 100), k 2
86
90
=k3
91
1.0.............................................
11
4-12 Bank Angle Rate vs. Time for ki
(0.1, 1.0, 10, 100), k 2 =k
4-13 Angle of Attack vs. Time for k 2
(0.1, 1.0, 10, 100), ki
4-14 Angle of Attack Rate vs. Time k 2
4-15 Bank Angle Rate vs. Time for k2
4-16Angle of Attack vs. Time for k 3
=
=
=
1.0 .
91
1.0
92
3
=k3
(0.1, 1.0, 10, 100), ki
1.0
93
1.0 .
93
1.0
94
=
(0.1, 1.0, 10, 100), k =k
(0.1, 1.0, 10, 100), ki
3 =
k2
=
.
.
4-17Angle of Attack Rate vs. Time for k 3 = (0.1, 1.0, 10, 100), ki = k2
1.0. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
95
4-18 Bank Angle Rate vs. Time for k3 = (0.1, 1.0, 10, 100), ki = k 2 = 1.0.
95
5-1 Altitude vs. Energy for M=100, ki = k 2 = 1,k 3 = 0.1 . .
100
- . . .. . . .
5-2 Altitude and Dynamic Pressure vs. Time for M=100, ki = k2 =
1,k 3 = 0.1 . . . . .
.. . . .
- - -.
-.
..
. . -.
.
. . -.
.. .
101
5-3 Earth Relative Crosstrack Distance vs. Earth Relative Downtrack
Distance for M=100, ki = k2 = 1,k 3 = 0.1 .................
5-4 Angle of Attack vs. Time for M=100, k 1 =
5-5 Bank Angle vs. Time for M=100, ki = k2
1,k 3 = 0.1
102
......
1,k3 = 0.1 .........
103
104
5-6 Altitude vs. Energy for qmin = (11.97, 23.94, 35.91, 47.88) kPa . . . . 106
5-7 Earth Relative Speed vs. Time for qmin = (11.97,23.94,35.91,47.88)
kPa . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 107
5-8 Earth Relative Crosstrack Distance vs. Earth Relative Downtrack
Distance for qmin = (11.97, 23.94, 35.91,47.88) kPa ..........
108
5-9 Angle of Attack vs. Time for qmin = (11.97, 23.94, 35.91, 47.88) kPa
108
5-10Value of the Performance Index vs. Minimum Allowable Dynamic
Pressure for qmin = (11.97, 23.94, 35.91, 47.88) kPa ..........
109
5-11 Total Heat Load vs. Time for Qmnax = (1100, 1300, 1400,1700,2000,2300)
MJ/m 2
........
. . . . . . . . . . . . . . . . . . . . . . . . . . . . .... . . . . . .
110
5-12 Heating Rate vs. Time for Qmax = (1100, 1300, 1400, 1700, 2000, 2300)
MJ/m 2
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. . .
11
5-13Altitude vs. Time for Qrmax = (1100, 1300,1400,1700,2000,2300)
MJ/m 2
.......
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. . . . . . .
12
112
5-14 Earth Relative Speed vs. Time for Qmax = (1100, 1300, 1400, 1700, 2000, 2300)
M J/m2
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 113
5-15 Earth Relative Crosstrack Distance vs. Earth Relative Downtrack
Distance for Qmnax = (1100, 1300, 1400, 1700,2000,2300) MJ/m
2
. .
114
5-16Angle of Attackvs. Time for Qmax = (1100, 1300, 1400, 1700,2000,2300)
MJ/m 2
.
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ..
115
5-17 Bank Angle Rate vs. Time for Qmax = (1100, 1300,1400,1700,2000,2300)
M J/m 2
. . .. . . .. . .. . .. . . . . .. . . . . .. . . . . . . .. . . . .
115
5-18 Value of the Performance Index vs. Qmax for Qmax = (1100, 1300, 1400, 1700,2000, 230
M J/m 2
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 116
5-19 Altitude vs. Energy for (L/D)max = (2, 2.1, 2.2, 2.3, 2.4, 2.5) . . . . . . 119
5-20 Earth Relative Crosstrack Distance vs. Earth Relative Downtrack
Distance for (L/D)max = (2, 2.1, 2.2, 2.3, 2.4, 2.5) . . . . . . . . . . . . 120
5-21 Earth Relative Speed vs. Time for (L/D)max = (2, 2.1, 2.2, 2.3, 2.4,2.5) 120
5-22 Stagnation Point Heat Load vs. Time for (L/D)max = (2, 2.1, 2.2, 2.3, 2.4,2.5)121
5-23 Angle of Attack vs. Time for (L /D)max = (2, 2.1, 2.2, 2.3, 2.4, 2.5) . . 121
5-24 Value of the Performance Index vs. (L/D)max for (L /D)max = (2, 2.1, 2.2, 2.3, 2.4, 2.5)12
6-1 Flight Simulation Block Diagram . . . . . . . . . . . . . . . . . . . . . . 127
D-1 Spherical Representation of Position with Respect to a Cartesian
ECEF Coordinate System . . . . . . . . . . . . . . . . . . . . . . . . . . . 180
D-2 Spherical Representation of Velocity with Respect to a Set of Axes
Defined in the Cartesian ECEF Coordinate System . . . . . . . . . . . 181
E-1 Earth Relative Downtrack Plane and Earth Relative Crosstrack Plane 184
13
[This page intentionally left blank.]
List of Tables
4.1
Numerical Values Used for Numerical Optimization ..........
4.2 Numerical Values for the Bounds on the Optimization Variables
4.3
.78
79
.
Numerical Values for the Bounds on the Path Constraints ......
.79
4.4 Options Set in SNOPT ................................
4.5
80
Terminal errors produced by integration for M = (50, 75, 100) . ...
87
4.6 Results from Varying the Weighting Factors (ki, k 2 , k 3 ) . . . . . . . .
6.1
Terminal Errors from the Simulation with Perturbations
.......
97
.133
6.2 Computational Performance of the Simulation with Perturbations . 134
D.1 Values Used to Generate an Initial Guess .................
15
179
[This page intentionally left blank.]
Chapter 1
Introduction
1
Motivation
Gulf War II has brought attention to the importance of space applications on
current warfare tactics. GPS navigation, high resolution imagery, and near-real-
time missile detection via communication satellites are a few of the many critical
technological capabilities that result from space applications. U.S. and coalition
forces can gain significant advantages on the battlefield from space-based capabilities. At this point, it is evident that the U.S. can not afford the loss of space
assets or even the current time delay of months to fix or replace failed systems
[24]. Recognizing this, the Air Force Space Command (AFSPC) has shifted its
attention to "quick-response-space". The AFSPC is currently conducting an Operationally Responsive Spacelift Analysis of Alternatives (ORS/AOA) to address
the issue of responsiveness in terms of space applications [241. This analysis will
evaluate the application of ORS to military space assets for force enhancement,
space support, force application, and counterspace. In particular, the United
States currently has a high level of interest in developing global power projection capabilities because of instabilities in the international environment. While
it may be desirable to place armed forces in enemy territory, such a strategy
may be difficult to implement given current conditions. Consequently, it may
be beneficial to conduct military operations remotely. The ultimate goal of the
17
AFSPC is to develop the abilities to launch satellites within hours or days of the
given command, quickly repair a damaged system in space, and strike an enemy
anywhere on the globe in less than one hour with conventional weapons [241.
Space-based global strike refers to the ability to project power with conventional weapons from the United States to any point on the globe in less than one
hour. With this new desired ability comes the challenge of demonstrating the
technological feasibility of such an approach through vehicles capable of delivering the required conventional weapons. U.S. political and military leaders are
re-examining the entire realm of space-based capabilities along with strategic
weapons for new counteractive tactics [241. While B-2 bombers have demonstrated the ability to conduct global reach operations, the Air Force is not investing in more long-range bombers [25]. The existing intercontinental ballistic
missiles (ICBMs) and sea-launched ballistic missiles (SLBMs) are also capable of
striking any point on the globe, but they carry nuclear armed weapons. In order
to project power without the use of nuclear weapons, ICBMs and SLBMs must
be modified to carry conventional weapons and are referred to as conventional
ballistic missiles (CBMs). However, rearming the current ICBMs and SLBMs would
jeopardize the incredible accuracy and reliability of the already existing systems.
In addition, CBMs can easily be mistaken as nuclear armed weapons [251. Therefore, advanced reusable space launch vehicles are being considered as a means
for global projection.
The distance and speed requirements associated with space-based global
strike capabilities necessitates the design of a vehicle with space launch and
Earth reentry capabilities [23]. Proposed space-based global strike vehicles include the Space Operations Vehicle (SOV) and the Common Aero Vehicle (CAV).
The SOV is a fully reusable launch vehicle capable of flying sub-orbital pop-up
trajectories. This type of trajectory allows the SOV to carry a significantly greater
amount of weight through space. For example, an SOV capable of putting 6,000
lbs into orbit can carry 40,000 lbs through space in a pop-up trajectory [251.
There are four basic capabilities behind the SOV motivating its design. First is
18
the desire for aircraft like characteristics including the reliability and maintainability of today's aircrafts [8]. The second desired capability is launch on short
notice which corresponds to the feasibility of near-real-time to real-time com-
pletion of missions. The last two capabilities, expeditious deployment rates and
rapid transition between missions, reduce the cost of SOVs [8]. The CAV, currently being considered in the ORS/AOA, is a reentry vehicle that departs from
a launch vehicle or other booster and returns to Earth with the purpose of delivering weapons, payloads, or cargo to a specified location. It is essentially a
shell weighing 1300-2400 lbs (fully loaded) with a cross-range maneuverability
of at least 2400 Nautical Miles [23]. Yet to be determined is whether or not to design a steerable ballistic CAV similar to the existing Maneuvering Reentry Vehicle
(MaRV), Advanced Maneuvering Reentry Vehicle (AMaRV), or High Performance
Maneuvering Reentry Vehicle (HpMaRV) concepts. Another design option is a
fully powered steerable CAV with considerable maneuverability both in space
and in the Earth's atmosphere. Nonetheless, the purpose of the CAV is to create
rapid response for global reach from within the continental United States and to
operate under abnormal conditions [23].
1.2
Common Aero Vehicle
The deployment of a CAV involves launch, atmospheric reentry, release of cargo,
payload, or weapon depending on the mission, and disposal of the CAV. Three
different launch vehicles are currently being considered for the CAV: an expendable ground launched rocket, an expendable air launched rocket, and the SOV.
Both the expendable ground launched and air launched rockets will most likely
However, in the long-term the SOV is
be utilized for near term applications.
most desirable because of its aircraft-like operability [231. Regarding launch scenarios, the following two trajectories are being considered: a pop-up trajectory
and an orbital trajectory. As mentioned above, the pop-up trajectory allows for
more weight to be carried by the launch vehicle. The orbital trajectory refers
19
to either a low Earth orbit (LEO) or one that orbits the Earth once. After the
CAV is launched, it must re-enter the Earth's atmosphere, reduce speed, and rely
on guidance to steer the vehicle to a specified release point. At this point the
cargo, payload, or weapon is released for the purpose of either force application
or force enhancement. Force application is utilized during combat before U.S.
forces arrive in attempt to stall adversarial advances. For example, Small Smart
Munitions can locate and identify stationary or mobile targets. Force application is also used to attack highly valued, heavily protected, or time critical targets. Examples of CAV payloads used for such missions include a single Unitary
Penetrator that is used to destroy deeply-buried targets such as underground
bunkers and storage facilities; an Agent Defeat Weapon that neutralizes biological or chemical weapons; Highly Effective Area Attack Submunitions that can
attack multiple dispersed targets; and precision area attack weapons such as
Low Cost Autonomous Attack Systems used to attack moving targets [23]. Force
enhancement is used to strengthen and provide services for military operations.
For example, Unmanned Aerial Vehicles are used for reconnaissance or surveillance purposes and CAVs may provide a means of delivering urgent cargoes to
remote locations in near-real-time [23]. Finally, the CAV may either be reusable
or expendable. If the CAV is reusable, it must return to a suitable recovery area
and if it is expendable, it must be destroyed without a trace.
Many technical challenges stand between concept and development of the
CAV. These challenges include designing a thermal protection, a propulsion system, a guidance and control system, and a payload release system. The CAV
must be able to structurally withstand the forces created from rapid acceleration and deceleration as well as excessive heat build-up. The propulsion system
is crucial for reentry positioning and must be safe and reliable. The guidance
and control system must be able to accurately and reliably guide the CAV using inertial navigation and/or GPS. Finally, the CAV must be able to release the
payload without disturbing its flight [23].
20
1.3
Mission Design Problem
The particular problem considered in this thesis is the application of a CAV as
a kinetic energy weapon, where instead of using explosives, its kinetic energy
upon impact is used to destroy a ground target. Consequently, the CAV is itself
the weapon and its deployment is simplified to launch and atmospheric reentry. An unpowered bank-to-turn high lift-to-drag ratio vehicle model is chosen.
Furthermore, the particular application is that of an Earth penetrator used to
strike hardened deeply-buried targets (HDBTs). The high maneuverability of the
CAV allows for contingencies such as avoiding adversarial anti-missile missiles
(AMM's) or in flight re-targeting. The mission design problem is to steer the CAV
from a fully specified initial state at or near atmospheric entry to a specified
target on the surface of the Earth.
The mission profile for the CAV considered, as shown in Fig. 1-1, consists
of atmospheric entry, a skip maneuver, a glide maneuver, speed depletion, and
Earth impact. The skip maneuver is characterized by a rise in altitude that en-
ables the vehicle to fly in a low density region in order to make the required
range. During the skip maneuver, control authority is lost as the vehicle rises
in altitude. Thus, it is desirable to prevent the vehicle from exiting the Earth's
atmosphere. In order to restrict the maximum altitude attained during the skip
maneuver, the initial condition is taken to be at the point after atmospheric
entry, but before the altitude of the vehicle starts to increase. The maximum
altitude attained during the skip maneuver is limited by imposing a minimum
allowable dynamic pressure constraint. During a glide maneuver, the vehicle
flies along a trajectory without using much control effort. As the vehicle nears
the target, it must deplete speed in order to meet the large but bounded terminal speed requirements for striking HDBTs. Finally, the mission terminates
when the vehicle strikes the target on the surface of the Earth. Typical terminal
conditions associated with HDBTs include position accuracy to within several
meters, a speed large enough for Earth penetration, but low enough so that the
21
Figure 1-1: Common Aero Vehicle Mission Profile
vehicle does not vaporize upon re-entry, and a nearly zero angle of incidence
[26]. These terminal conditions require that the vehicle approach the target with
negative lift. However, the CAV considered has one-sided angle of attack control
(i.e. the angle of attack must remain positive throughout flight). Thus, the vehicle
must rotate and fly upside-down in order to generate negative lift. Furthermore,
the natural behavior of the vehicle is to maintain a larger terminal speed and a
larger incidence angle at impact.
The terminal conditions associated with striking a HDBT pose a great challenge for the guidance and control system due to the conflict that arises between
high maneuverability and the need to achieve such tightly prescribed terminal
conditions. Early in-flight, the high maneuverability is desirable for both reachability and contingency plans; however, as the vehicle nears the target this maneuverability becomes a liability. Small errors in vehicle attitude can produce
extremely large errors in lift force that will, in turn, drive the vehicle away from
22
the desired target. Moreover, since this type of vehicle has one-sided angle of attack control, uncertainties in the environment can further increase errors. Con-
sequently, the demands on the guidance and control systems increase greatly as
the vehicle approaches the target. In this particular application, the ability to
withstand unexpected events during flight is a critical requirement in meeting
the boundary conditions with extremely high accuracy. In an attempt to obtain a
solution robust to in-flight dispersions, it is beneficial to design a trajectory and
control that has as much control margin as possible. The control margin is the
magnitude of the difference between the actual control and the control limits.
In addition to satisfying the initial and terminal conditions, the vehicle has
thermal, structural, and operational constraints during re-entry. Thermal constraints include maximum limits on heating rate and total heat load, structural
constraints include maximum limits on sensed acceleration, and operational
constraints include limits on control authority (i.e. limits on steering and steering rate capability).
For any set of initial and terminal conditions, a wide range of feasible trajectories and controls may exist. In order to obtain the most desirable performance, it is preferable to choose a particular performance index and determine
the particular trajectory and control that optimizes this performance index. This
results in the need to determine an optimal mission plan. The optimal mission
planning problem is then described as follows: determine a steering command
as a function of time that takes the vehicle from a specified initial state to a
target on the surface of the Earth while optimizing the given performance index
and satisfying all of the constraints imposed during flight.
1.4
Mission Design Approach
The optimal mission planning problem described in the preceding section is an
optimal control problem [3, 171. In general, aerospace optimal control problems
are nonlinear and do not have analytic solutions. Consequently, a numerical
23
method must be used to obtain a solution to these optimal control problems.
Numerical methods for solving optimal control problems can be categorized as
either indirect methods or direct methods. Indirect methods solve a HamiltonianBoundary-Value-Problem (HBVP) which is often difficult to solve numerically [11].
Direct methods discretize the optimal control problem at particular time points
which leads to a nonlinear programming problem (NLP). The resulting NLP is
solved using one of the many available optimization algorithms. While direct
methods have a wider range of convergence, the control time histories are not
as accurate as those obtained via an indirect method [201. A method that combines the advantages of both indirect methods and direct methods is desirable.
Pseudospectralmethods of Ref. [12, 16] utilize an approach to solve optimal control problems that has positive attributes of both indirect and direct methods.
In a pseudospectral method, the optimal control problem is discretized at specified time points using a basis of global orthogonal polynomials. This discretization procedure provides an efficient transcription of the continuous-time optimal control problem to a NLP. The solution of the resulting NLP provides an
accurate approximation to the continuous-time optimal control problem. In this
thesis, the Legendre PseudospectralMethod of Refs. [7, 9, 10, 11] is applied to
the Common Aero Vehicle optimal mission planning problem.
1.5
Research Objectives
This thesis seeks to demonstrate the application of the Legendre Pseudospectral Method to the problem of performance optimization of the Common Aero
Vehicle (CAV). In doing so, the accuracy of the Legendre Pseudospectral method
is assessed as well as the desirable traits of the trajectory and control for the
CAV. Furthermore, a parametric study is conducted to illustrate the use of the
Legendre Pseudospectral Method as a design tool as well as to gain a better
understanding of the behavior of the CAV under discussion. Finally, a preliminary investigation is performed of the real-time application of the Legendre
24
Pseudospectral Method to the CAV. The accuracy of the Legendre Pseudospec-
tral Method is considered in regards to the simulation of the flight of the CAV
and the robustness of the control to vehicle and environmental perturbations is
considered.
1.6
Thesis Overview
Chapter 2 mathematically describes the mission design problem stated in section 1.3. The equations of motion which govern the flight of the Common Aero
Vehicle are derived and the known initial and terminal conditions are defined.
Also included in this chapter are the constraints imposed throughout the flight
of the vehicle and the development of a performance index which reflects the
goal of maximizing the control margin.
Chapter 3 provides the theory and motivation behind the mission design approach discussed in section 1.4. A formal definition of an optimal control problem is stated and from which it is seen that the CAV mission design problem
is an optimal control problem. Also included is a discussion of analytic and
numerical methods for solving optimal control problems, which motivates the
use of a pseudospectral method. This discussion leads to an overview of pseudospectral methods and is followed by a detailed description of the Legendre
Pseudospectral Method used to solve the CAV optimal control problem.
Chapter 4 demonstrates the application of the Legendre Pseudospectral method
to the CAV optimal control problem. The CAV optimal control problem is discretized and the resulting nonlinear programming problem is discussed. A brief
overview is provided of the optimization algorithm SNOPT, which is used to
solve the nonlinear programming problem. A numerical optimization study is
then conducted to determine the number of nodes required to meet the accuracy requirements of the CAV mission design problem. Also included in the
numerical optimization study is the determination of the values of the weighting factors in the performance index that produce the most desirable trajectory
25
and control.
Chapter 5 presents a parametric optimization study of the CAV optimal control problem. This demonstrates the use of the Legendre Pseudospectral Method
for both vehicle design and trajectory design. The key features of the trajectory
and control generated from the Legendre Pseudospectral Method are discussed
to provide insight on the behavior of the CAV. Parameters in the problem are
then varied to determine the effect on the trajectory and control. The characteristics of the trajectory and control, using the control margin as a performance
metric, are then evaluated.
Chapter 6 describes a preliminary investigation into the real-time application
of the Legendre Pseudospectral Method to the CAV. This is done by simulating the flight of the CAV using the control obtained from the Legendre Pseudospectral Method. The assumptions used to develop the simulation along with
a description of the simulation itself is given in this chapter. The accuracy of
the Legendre pseudospectral solution is assessed by comparing the trajectory
obtained from the optimizer to the trajectory obtained via numerical integration. Perturbations which reflect realistic model uncertainties are then added to
the simulation in an attempt to assess the robustness of the solution. Finally,
Chapter 7 provides a summary of the material presented in this thesis and the
conclusions drawn from the results obtained.
26
Chapter 2
Common Aero Vehicle Problem
Formulation
2.1
Overview
This chapter gives a quantitative description of the optimal mission planning
problem stated in the introductory chapter. Recall that the optimal mission
planning problem is to determine a steering command as a function of time that
takes the CAV from a specified initial state to a target on the surface of the
Earth while optimizing a given performance index and satisfying all of the constraints imposed during flight. First, a mathematical model of the CAV, which is
an unpowered high lift-to-drag ratio vehicle in atmospheric flight, is developed.
Second, boundary conditions are specified to indicate the known initial and terminal conditions for the vehicle. Third, constraints during flight are identified
and quantified. Fourth, the desired performance index that is to be optimized is
developed.
27
2.2
2.2.1
Dynamic Model
Coordinate System
In this particular application, the target is a point on the surface of the Earth.
Consequently, it is most desirable to describe the motion of the vehicle using a
coordinate system that rotates with the Earth. Furthermore, in this research we
are interested only in vehicle performance. Therefore, it is adequate to model
the vehicle as a point mass and consider only the translational motion of the
center of mass (i.e. rotational effects are ignored). In this research, a Cartesian Earth-centered Earth-fixed (ECEF) coordinate system is used. Fig. 2-1 shows
schematically, the position, r, and inertial velocity, v, of the center of mass of
the vehicle represented in an ECEF coordinate system where 0 marks the cen-
Prime Meridian
Nv
G/
r
0 1
R
Equatorial Plane
Earth
Figure 2-1: Earth-Centered Earth-Fixed Coordinate System
ter of the Earth, N is the North Pole, and G is the location of the Greenwich
Observatory in the United Kingdom. The three principle-axis directions of the
ECEF frame are OQ, OR, and ON where OQ, OR, and ON are defined as follows.
The OQ-axis is the first principle direction, lies in the plane (OG,ON), and passes
through the equator (along the Prime Meridian). The ON-axis is the third principle direction and passes through the North Pole. Finally, the OR-axis completes
28
the right-handed system (OQ,OR,ON). Furthermore, the Earth rotates about the
ON-axis with a constant magnitude 0.
2.2.2
Equations of Motion
The three degree-of-freedom equations of motion in ECEF coordinates for a vehicle modeled as a point mass in flight over a spherical rotating Earth are derived
as follows. The position of the vehicle is given as
r = r(t) = xex + yey + zez
(2.1)
where ex is the unit vector in the direction of OQ, ey is the unit vector in the
direction of OR, and ez is the unit vector in the direction of ON. Differentiating
the position with respect to time, the absolute velocity, v, is given as
v = v(t) = ar
at
(
Noting that w =
+ w xr
(2.2)
(ex,ey,ez)
2ez, we have that
v = *ex + pje, + ez +Qez x (xex + yey + zez)
(2.3)
= (* - G2y)ex + (p + Qx)ey +
zez
Differentiating v(t) the absolute acceleration, a, is given as
a=(a(t)
(at)(ex,ey,ez)WXV
=(lzr\(r
at
at2(ex,ey,ez)
+2w x (ar)
+ w x (w x r)
2
t (ex,ey,ez)
= (k - 7p)ex + (p'+ i)ey + 2ez +
=
(
-29
-Y
2
x)ex +((jy)ey
+2k
-+
ez x ((k - Qy)ex + (p + i2x)ey + ez)
+ez
(2.4)
29
Now, let vr represent the Earth relative velocity, i.e.
(
vr
=ar ar
at
(ex,ey,ez)
= xex +
pe,
(2.5)
+ ez
= vxex + vye, + vzez
Substituting Eq. (2.5) into Eq. (2.4), the motion of the vehicle can be expressed in
terms of the position and Earth relative velocity as
at
/av
= Vr
ex,ey,e)
(,
at )(ex,ey,ez)
(2.6)
a -2w x v,
)=
-
w x (w
x r)
The following notation change is made and used in the remainder of this thesis:
)
(ex,ey,ez)
Applying Newton's second law to the vehicle, (i.e. F = ma) where m is the mass
is the absolute acceleration of the vehicle, and F is the
dt
total force acting on the vehicle, we obtain
of the vehicle, a
=d
-
= Vr
r
F
= - - 2w x vr - w x (w x r)
(2.8)
Throughout flight the vehicle is under the influence of gravitational and aerodynamic forces. The free body diagram of the vehicle shown in Figure 2-2 depicts
the individual affects of each component of the total force applied. The gravitational force is denoted by g, the lift force is denoted by L, and the drag force is
30
AL
g
Figure 2-2: Free Body Diagram of the Common Aero Vehicle
denoted by D. The total force on the vehicle is then given as:
(2.9)
F =g +L+ D
where the following equations represent each of the forces with respect to their
ECEF components.
gzez
g
= gxex+gyey+
L
=
D
= Dxex + Dye, + Dzez
Lxex + Lye, + Lzez
(2.10)
The gravitational force is inversely proportional to the square of the distance
between the center of the Earth and the vehicle given as
g = -m P r
r3
(2.11)
where p is the Earth's gravitational parameter and r = lirll 2 is the radius.
The aerodynamic model used to define the lift and drag forces is taken from
Ref. [29] which assumes air is a uniform gas. A drag polar is used and is given
as [221
CD
=
CDO + KCL
CL
=
CL,Cxo
31
(2.12)
where
CD
is the drag coefficient,
CDO
is the zero-lift coefficient of drag, K is
the drag polar parameter, CL is the lift coefficient, at is the angle of attack, and
CLa is the lift slope. The angle of attack is defined as the angle between the
Earth relative velocity and the zero lift line. It can be seen that using the above
assumptions, no lift is produced when c = 0.
The lift and drag forces are defined as follows
L
(2.13)
= LwL
D. =
-Dv
V
(2.14)
where L is the magnitude of the lift force, WL is the unit vector in the lift direction, D is the magnitude of the drag force, and v = lIVr IIis the Earth relative
speed of the vehicle. L and D are defined as
where q
=
L
= qSCL
(2.15)
D
=
(2.16)
qSCD
pv 2 /2 is the dynamic pressure and S is the reference area of the
vehicle. The atmospheric density, p, is modeled as an exponential function of
the radius as shown below
p = po exp [-(r - Re)/H]
(2.17)
where po is the density at sea level, Re is the radius of the Earth, and H is the
density scale-height [281.
The lift direction WL lies in the (r,vy)-plane and rotates with the vehicle while
32
the drag direction is opposite vr. The lift direction, WL, is computed as follows:
y
Wi
=
W2
=
V
r
x vr
rxVr1
(2.18)
W3
Wi X W 2
WL
sin 0w
2+
cOS Ow
3
where o- is the bank angle taken from Ref. [291 as depicted in Figure 2-3. Therefore,
L3
W,
W,
r
Figure 2-3: Bank Angle
the vehicle is controlled aerodynamically via a and o-. While in theory it is possible to control o( and - directly, in practice it is not possible to apply these
controls instantaneously. Consequently, it is necessary to impose rate limits on
a and -. Rate limits are imposed by augmenting the following two differential
equations to the dynamics of Eq. (2.8)
de
=
u,
J
= Uo.
(2.19)
(2.20)
where u, and u- are pseudocontrols that define the angle of attack rate and
the bank angle rate, respectively. The resulting augmented dynamics for an
33
unpowered vehicle in atmospheric flight over a spherical rotating Earth are given
in Cartesian Earth-centered Earth-fixed coordinates as
*c
=vx
9;
=v,
2
=v
X
9x + Lx + Dx + 20v, +
2
9y + Ly + Dy -2Qv±
2
m
X
y
(2.21)
m
2.3
z + Lz + Dz
Iz
=
di
=
u"
0-
=
uo.
m
Boundary Conditions
The desired trajectory steers the Common Aero Vehicle from a fully specified
initial position and velocity to a fully specified terminal position with terminal constraints on speed, the Earth relative flight path angle, and angle of attack. The Earth relative flight path angle, y, is computed from r and vr as
y
=
arcsin r vr). The initial conditions are then given as
r(to)
=
ro(2.22)
Vr(to)
=
Vr,o
while the terminal conditions are given as
=
r(tj)
rf
Vf
y(tf)
=
Yf
c(t5)
=
af
34
S=
(2.23)
2.4
Path Constraints
Flight path constraints are imposed throughout the entire trajectory. Trajectory
constraints include restrictions placed on the radius, speed, and dynamic pressure while vehicle constraints include restrictions placed on the structural load-
ing, thermal loading, and the control authority. Physically, the vehicle cannot fly
below the surface of the Earth. Therefore, it is necessary to impose an inequality
constraint on radius. Because the CAV has no propulsive capability, the speed
will never increase during re-entry. In order to enhance the performance of the
optimizer, a path constraint is placed on the speed. A dynamic pressure constraint is imposed during entry in order to maintain control authority. The CAV
has a limit on the maximum sensed acceleration it can withstand. Therefore, a
path constraint is placed on the sensed acceleration, a, which is defined as
a = VD 2 + L 2
(2.24)
During entry the vehicle absorbs heat. Because the amount of heat that the vehicle absorbs is limited by the material used in construction, a maximum allowable
heat load constraint is imposed. In this research, the heat load is taken to be the
stagnation point heat load [5] given as
Q=
to
Qdt
(2.25)
where
Q
K(PIPo)os(VIV)
3
.S
(2.26)
ve is the speed of a vehicle in circular orbit at a radius equal to the radius of
the Earth, po is the atmospheric density at sea level, and K is a known constant.
Operational path constraints include limits on the angle of attack and rate limits on the angle of attack and bank angle. The resulting inequality constraints
35
imposed during flight are given quantitatively as
r
>
Vmin :
V
0
a,nin
Uo-,min
2.5
Re
vmax
q
ii
a
amax
Q
Qmax
5
O
Uc
Uo-
(2.27)
5 amax
5 Ua,max
Uaomax
Performance Index
The CAV mission design problem includes steering the vehicle from a known initial state to a specified terminal state. Thus, it is important for the guidance and
control system to be able to reach the target. This requires a guidance and control system capable of not only steering the vehicle with precision and accuracy,
but also designing a trajectory that is capable of handling environmental disturbances experienced throughout flight. In attempt to minimize the demands on
the guidance and control systems, it is desirable to keep the controls away from
their upper and lower limits. This allows for more flexibility in the controls to
account for off-nominal perturbations experienced during flight. Defining the
control margin as the magnitude of the difference between the actual control
and the control limits, the goal of the performance index is to maximize the
control margin. Therefore, a performance index is constructed that attempts to
keep x in the middle of its capability and penalizes large control rates. Mathematically, a penalty is imposed on deviations in a from &, where 6 = VCDo/K.
This value of 6t corresponds to the angle of attack at the maximum L/D ratio
and lies in the middle of the bounds placed on the angle of attack. Minimization
of the control rates (ux, u,) keeps the controls smooth and within their allowable limits. While many possible performance indices can be constructed, the
36
following performance index is used in this thesis:
=ua
to
L
+ ks
(max /
2/
u,max
u
2]
o-U,max
dt
(2.28)
where ki, k 2 , and k 3 are positive constants. Each term is weighted by its respective maximum value for easier interpretation of the constants and squared to
account for the possibility of a negative value.
37
[This page intentionally left blank.]
Chapter 3
Optimal Control: Problem Formulation
and Solution Methods
3.1
Overview
Consider a dynamical system that is subject to constraints. Furthermore, con-
sider a system whose state can be affected by the choice of various inputs or
controls. Any input to the dynamical system that satisfies the constraints is
called a feasible control. The time history of the state that results from the application of a feasible control is called a feasible trajectory.For many problems it is
desired to determine the feasible control and feasible trajectory that optimizes a
specified performance index for a dynamical system subject to constraints. Such
a problem is called an optimal control problem.
It can be seen from Chapter 2 that the CAV mission design problem is an optimal control problem. While in principle any well-posed optimal control problem
has a solution, finding such a solution is often a difficult task. In this chapter an
overview is given of the basic theory of optimal control. Furthermore, a survey
of various numerical methods for solving optimal control problems is discussed.
Finally, the method used to solve the CAV mission design problem, the Legendre
Pseudospectral Method of Refs. [7, 9, 10, 11], is described.
39
3.2
Optimal Control Problem
An optimal control problem consists of four parts: (1) a mathematical model
describing the dynamics of the vehicle (equations of motion), (2) the boundary
conditions that specify the initial and terminal states, (3) path constraints that
are enforced during the trajectory, and (4) a performance index that measures
the optimality of the solution.
3.2.1
Dynamics
In general, a mathematical model for the dynamics of a particular system is comprised of three quantities: the state, the control, and the independent variable
(generally speaking, time). The state, denoted x(t), is a vector whose components individually define the variables that are required to describe the behavior
of the system at any instant of time. Denoting the dimension of the state by n,
the state is given mathematically as
xit)
x(t)
=
x 2 (t )
E R"
(3.1)
xn (t )
Similarly, the control, denoted u(t), is a vector whose components individually
define the inputs to the system at any instant of time. Denoting the dimension
of the control by m, the control is given mathematically as
Sui(t)
u(t) =
u 2 (t)
Um(t)
40
E R'
(3.2)
The dynamics of the system are described via a system of ordinary differential
equations of the form
i t) = f (X(t), U (t), t)
(3.3)
where i(t) is the time derivative of the state vector and f : R" x R" x R - R". In
general, the dynamics of Eq. (3.3) are nonlinear.
3.2.2
Path Constraints
Virtually all problems in dynamical systems are subject to constraints during
the evolution of the system. Such constraints are called path constraints. Denoting the number of path constraints by p, the path constraints are described
mathematically as
gmin s! g(X (t), u (t), t) s! gmax
(3.4)
where g: RI x Rm x R - RP and gmin E RP and gmax C RP are constant vectors.
3.2.3
Boundary Conditions
The boundary conditions describe events that occur at either the beginning or
the end of the trajectory. The boundary conditions are split into initial conditions that occur at the initial time, to, and terminal conditions that occur at
the terminal time, tj. Denoting the number of initial conditions by qo and the
number of terminal conditions by qf, the boundary conditions can be expressed
mathematically as
ho(x(to), to)
=
0
(3.5)
hf(x(tf), tf)
=
0
(3.6)
where ho : R" x R - Re and hf : Rn x R - Rqf.
41
3.2.4
Performance Index
The performance index is the functional (i.e. it is a function of a function) that
is to be optimized in the optimal control problem. The performance index produces a scalar output. Since the goal is to minimize (or maximize) the performance index, an accumulation (or depreciation) in value of the resulting scalar
can be thought of as a cost penalty. Often referred to as the cost functional,
the performance index can be broken into three parts: an initial cost, a terminal
cost, and an integrated cost. As the name implies, the initial cost Jo is associated
with the initial state, x(to), and the initial time, to. Similarly, the terminal cost Jf
is associated with the terminal state, x(tj), and the terminal time, tf. The initial
and terminal cost are given, respectively, as
Jo
= A4(x(to),to)
Jf =
(3.7)
M(X(tf), tf)
where AI : R" x R - R and N : R" x R - R. The integrated cost is a cost that
accumulates throughout the trajectory and is given as
t5
Ji =
.1:
(3.8)
£(x(t), u(t), t)dt
where f : R" x R"I x R - R. The total cost is then given as
tf
J = M(x(to), to) + N (x(tf), tf) +
3.2.5
fto L(x(t), u(t), 0 dt
General Form of an Optimal Control Problem
Using the definitions in Sections 3.2.1-3.2.4, an optimal control problem is now
stated formally as follows. Determine the control u* (-), and the state x* (-) on
the interval t c [to, tf] that minimizes the cost functional
+T
J = _'l(x (to), to) + X (x(ty ), tf ) + toI(x
42
t), u t), t d t
(3.9)
subject to the dynamic constraints
x
=
f(x(t), u(t), t)
(3.10)
the path constraints
gmi n
g(x(t), u(t),t) s gmax
(3.11)
and the boundary conditions
3.3
ho(x(to), to)
=
0
(3.12)
hf(x(tf),tf)
=
0
(3.13)
Methods for Solving Optimal Control Problems
A solution to an optimal control problem is obtained using either an analytic
or a numerical method. Typically, optimal control problems cannot be solved
using analytic methods and thus solutions are obtained via numerical methods.
Nonetheless, it is important to understand both analytic methods and numerical
methods.
3.3.1
Analytic Methods for Solving Optimal Control Problems
Analytic solutions to optimal control problems are generally determined by one
of two approaches: calculus of variations and dynamic programming. Calculus of variations involves setting the first variation of the cost functional (or an
augmented cost functional) equal to zero which leads to a set of first-order necessary conditions for a solution to the optimal control problem. Pontryagin's
Minimum Principle [17] is used to determine the optimal control. Application of
dynamic programming leads to the Hamilton-Jacobi-Bellman (HJB) equation [17].
The HJB equation is a partial differential equation which governs the dynamics
of the optimal cost functional. Calculus of variations, in combination with the
43
principle of optimality, leads to a derivation of the HJB equation. The intention here is to provide the reader with a brief explanation of analytic methods
and the difficulties that hinder the implementation of analytic methods. For a
complete explanation and derivation of both calculus of variations and dynamic
programming please refer to Refs. [31 and [171.
Calculus of Variations
Calculus of variations, in terms of optimal control problems, is motivated by the
desire to determine the feasible control and feasible trajectory that minimizes a
performance index. In the unconstrained case, the optimal control problem simplifies to a functional minimization problem. Consider the functional J(x(t)). A
local minimum of J exists at x* (t) if
(3.14)
J(x(t)) - J(x* (t)) > 0
for all admissible x(t) in some neighborhood around x*(t), (i.e. Ilx(t) - x* (t)I <
c). If the neighborhood can be extended to the entire domain of x(t), then x* (t)
is a global minimum. A necessary condition for x* (t) to be a local minimum of
J is
6J (x*(t), 6x(t)) = 0
for any
6x(t)
where 6J is the first variation of the functional. In order to determine if the
stationary point is indeed a minimum, the second variation of the functional is
considered. By doing so, second order sufficient conditions for a local minima
are established. Please refer to Ref. [31 for a complete explanation and derivation
of the second order sufficient conditions.
By definition, an optimal control problem has constraints and thus the application of functional minimization to an unconstrained problem must be extended to handle a constrained problem. Consider the following optimal control
44
problem: Minimize
J =
tf
L(x(t), u(t), t)dt
T
(x(tf), tj) +
subject to the system equations
x = f(x(t),u(t), t)
and control constraints
u(t) E U(t)
where x(to) and to are fixed, tf is free, and there are simple form terminal constraints. In order to impose the state differential equations, an augmented cost
functional Ja is considered where
Ja = X(x(tf), tj) +
I L(x(t),u(t),
t) + A(t)T[f(x(t),u(t), t)
t0
-
i]
dt
A(t) E R" is the co-state. In taking the first variation in Ja, it is convenient to
define the Hamiltonian,H [171:
H(x(t), u(t), A(t), t) = L(x(t),u(t), t) + AT (t)f(x(t), u(t), t)
(3.15)
In terms of the Hamiltonian, the necessary conditions for an extremal trajectory
are
f(x, u, t)
(3.16)
x(to) = Xo
(3.17)
i
_aH T
(3.18)
H(tj) + at (tf) = 0
(3.19)
x (tf) = xf,i
(3.20)
Ai(tj) =
axi (tj)
45
(3.21)
where Eq. (3.16) is the dynamic constraints, Eq. (3.17) is the initial conditions,
Eq. (3.18) is the co-state equations, Eq. (3.19) is the transversality conditions,
and Eqs. (3.20) and (3.21) are the terminal conditions. In a neighborhood of a
locally optimal solution, where the state and co-state differential equations and
all the boundary conditions are satisfied, the first variation becomes
6]a
f
Hu(t)6u(t)dt
to
H, is the functional gradient of the augmented cost functional with respect to
the control at every point in time. If the extremal solution is a minimum, then
any variation from that point will yield a positive variation.
Hu(t)6u(t) > 0
for all admissible
5u(t)
The goal is to minimize H over the admissible range of u. From Pontryagin's Minimum Principle [17], the admissible control that minimizes H can be determined
and is given as
mn
u* (t) = arg
H(x*(t),u(t), A*(t), t)
(3.22)
u(t)EU(t)
Only in simple cases can a solution that satisfies the necessary conditions stated
in Eqs. (3.16)-(3.23) be obtained. The combination of nonlinear differential equations and split boundary values creates difficulty in finding an analytic solution
to the optimal control problem.
Dynamic Programming
This approach uses calculus of variations and the principle of optimality to develop a partial differential equation which governs the optimal cost functional.
Consider the following nonlinear system with general terminal constraints and
control constraints. Minimize
tJ
J (x (t), U (t), t) = X (x(tj), tf ) + ft I(x (t), u(t), t d t
46
subject to
xkt) = f (X(t), U(t), t)
hf(x(tf), tf) = 0
u(t) E U(t)
Given the initial state and control history, the state history is computable and the
optimal cost is a result of the optimal control history. As a result, the optimal
cost does not depend on the control, J*(x(t), u(t), t) = J*(x(t), t).
Consider a control problem where given an initial state x(to), the goal is to
drive the system to a terminal state, x(tf). Suppose that the optimal solution
passes through some intermediate point x(ti). The principle of optimality states
that the solution to the optimal control problem starting at x(t
1
) and terminating
at x(tf) is a segment of the solution to the optimal control problem that starts
at x(to) and terminates at x(tj)'[17]. In other words, any portion of an optimal
solution is itself an optimal solution.
Using the principle of optimality and assuming that J is twice differentiable
with respect to x(t) and t, a Taylor series expansion of J* about (x(t), t) yields
the Hamilton-Jacobi-Bellman (HJB) equation [17]:
a*(x(t), t) =
at
min I (x (t), U(t), t) + a*(X (t), t f (X(t), U(t), t)
u~U t)
ax
subject to the constraints
J* (x(t), t)
=
N(x(t), t)
on
hf(x(t), t) = 0
The HJB equation is both necessary and sufficient for optimality [17]. If the
above equation can be solved to obtain J* [x(t), t], then the optimal control is
determined as a feedback law and is given as
u* (t) = arg
mn
[L(x(t), u((t, t) +
aj*
(x(t), U(t), t
(3.23)
While the result of Eq. (3.23) applies to problems with general dynamics, a gen47
eral cost functional, and constraints, it is rarely possible to obtain analytic solutions to the HJB equation. HJB theory could be used to generate an optimal
feedback law numerically, but this is not usually done. Instead the HJB equation is used to test the optimality of a control whose form was either guessed or
obtained by some other method 1171.
3.3.2
Numerical Methods for Solving Optimal Control Problems
In general, the optimal control problem of Sec. 3.2 cannot be solved analytically,
so the solution must be attained using a numerical method. Numerical methods
fall under two main categories: indirect methods and direct methods.
In an indirect method, the Hamiltonian boundary-value problem (HBVP) that
arises from the first-order necessary conditions via the calculus of variations is
solved numerically. The general procedure for solving the HBVP [3] begins with
making an initial guess for the unspecified initial (or terminal) conditions. An
iterative procedure is then used to modify the estimate of the unknown initial
(or terminal) conditions, where each modification should improve the solution.
An improvement occurs when the current solution is "closer" to satisfying the
necessary conditions than the previous solution. If the iterative procedure converges, it will produce a solution that satisfies all of the necessary conditions.
Obtaining an initial guess is not a trivial procedure and thus more often than
not the solution that results will violate at least one of the necessary conditions. Examples of such iterative procedures include steepest descent methods,
neighboring extremal methods, and quasilinearization methods. Please refer to
Refs. [3] and [17] for an explanation of each procedure.
An advantage of using indirect methods is that an accurate co-state can be
found [201, which is beneficial because this co-state is then used to compute an
accurate control. Unfortunately, it is often impossible to obtain an initial guess
for the unknown conditions at one end which will produce a solution sufficiently
close to the optimal solution. As a result, it is often difficult to solve the optimal
48
control problem using an indirect method.
In a direct method, the optimal control problem is discretized at particular
time points called nodes. This discretization leads to a nonlinear programming
problem (NLP). The number of nodes is chosen large enough so that the time
steps are small enough to adequately represent the solution characteristics and
the implicit integration of the system equations produce sufficiently accurate
results. Provided that the time steps adequately represent the solution, the accuracy of the implicit integration depends on the specific quadrature rule used
[12]. In terms of parameterizing the problem, there are two approaches taken:
differential inclusion and collocation. Both of these methods involve implicit
integration of the system governing equations. However, differential inclusion
only discretizes the state variable time history while collocation discretizes both
the state and control variable time histories.
Differential inclusion methods replace bounded controls with bounds on admissible values of the state variable time rates of change. Elementary implicit
integration rules are then used to write the time rates of change as functions of
only the state variables. Since the control variables are eliminated, the number
of variables in the resulting NLP is reduced which, in turn, significantly reduces
the computation time required to solve the NLP [4]. Differential inclusion is restricted to problems with linearly appearing controls and the state rate must be
determined by the least accurate quadrature rule, Euler integration [9].
In collocation methods, the state and control are known at the node points
and the system governing equations are satisfied by including nonlinear constraint equations at the node points. The time histories of the state and control
variables are obtained using interpolation and the state differential equations are
satisfied using implicit integration. In most collocation methods, linear or cubic
splines are used as the interpolating polynomial and Gauss-Lobatto quadrature
rules, such as trapezoidal and Hermite-Simpson, are used for implicit integration [10]. The NLP resulting from using a collocation method typically has many
more variables and constraints; however, collocation methods are more accurate
49
than differential inclusion [4]. Collocation methods can use implicit integration
schemes with a higher order of accuracy and the number of nodes needed to
obtain the same level of accuracy as in differential inclusion is much smaller.
Finding a solution to the NLP that results from employing a direct method
is significantly easier than solving a HBVP [19]. As a result, direct methods are
capable of solving complex problems with a relatively poor initial guess. However, the co-state is not as accurate as that obtained via an indirect method.
Consequently, it is difficult to implement a direct method in real time.
3.4
Direct Transcription of Optimal Control Problem
Via Pseudospectral Methods
Spectral collocation methods, also referred to as pseudospectralmethods [12, 27],
combine the advantages of differential inclusion and collocation methods. Pseudospectral methods use differential inclusion, but retain the desired accuracy of
using higher order quadrature rules. Partitioning of the time interval is based
on the Gaussian quadrature formula, which results in an unequal distribution
of time points. The state and control are approximated by global orthogonal
polynomials while the derivative is approximated by a discrete differentiation
operator. Gauss-Lobatto quadrature rules are then used to approximate the integral with a summation. Despite the fact that both methods use essentially
the same technique, pseudospectral methods are faster and more accurate than
traditional collocation methods [6]. The solution to the CAV optimal control
problem considered in this thesis is solved using a pseudospectral method.
3.4.1
Pseudospectral Methods
In pseudospectral methods the time interval is divided into segments where the
nodes correspond to the locations of knots in Gaussian quadrature formulas.
The knots in Gaussian quadrature formulas are chosen such that the approxi50
mation of the function is exact for polynomials of higher order [12]. According
to the approximation theory, nodes that are the roots of orthogonal polynomials
will yield the best approximation [9].
The state and control constraints are satisfied at the nodes using global orthogonal polynomials. Orthogonal polynomials are closely related to Gauss-type
integration rules which yields an easy transformation of the state and control
constraints to algebraic equations [10]. Letting Ti, for i = 0, 1, 2, ... , N represent
the nodes, the function
is approximated as
y((T)
N
y(T) ~yN(T)
=
>y4
1T
i=O
where y is the value of y at
T1
and <pi(T) are the interpolating polynomials such
as Chebyshev or Legendre [121. The set of interpolating functions satisfy
<hi (Tj) = Sij
1i=j
0 i =j
Thus the value of yN(T) at the point Ti, for i = 0, 1, 2, .... ,N is equal to the value
of the function y
(T)
Y (Ti) = yN (i
According to this definition of interpolation, the approximation is exact at the
nodes.
Typically the derivatives are approximated using finite difference or finite element methods. In pseudospectral methods, the state differential constraints
are imposed by collocating the differentiation matrix at the nodes. The differentiation matrix is determined by taking the analytic derivative of the interpolating
polynomials as shown below
N
f(T)
~9N(T)
yN
(Ti)
j=0
51
Denoting the differentiation matrix by D whose elements are Dij = <j (Ti), we
have that
j,N(T)
(3.24)
= DyN(T)
In terms of accuracy, as N increases the convergence rate of finite difference
or finite element methods decreases on the order of N-" where m is a constant
that depends upon the order of the approximation and the smoothness of the
solution. Spectral methods will converge faster than any finite negative power
of N [16].
The integral performance functional is approximated using Gauss-Lobatto
integration rules [9]. Consider the integral of y(T) with respect to a weighting
function 0(r)
Iy
=f
(T)Y(T)dT
N
Iy
IyN
f
o1T)
Y Yi~
1
(T)dT
i=O
Discrete weights wi, for i
0,1, 2,...,N, which correspond to a set of orthogo-
nal polynomials, are defined as
wi
j
(T)pi(T)dr
which results in the following summation approximation to the integral
N
IyN _
Wjy,
i=0
3.4.2
Legendre Pseudospectral Method
The Legendre Pseudospectral Method of Refs. [7, 9, 10, 11] is a direct method
that converts the optimal control problem into a nonlinear programming problem. One of the many available software programs is then used to solve the
resulting nonlinear programming problem (NLP). The collocation points for the
Legendre Pseudospectral Method are the Legendre-Gauss-Lobatto (LGL) points
52
where the state and control parameters are the unknown values of the states
and controls at the LGL points. The continuous time problem is transformed to
a set of algebraic expressions using Nth degree Lagrange interpolating polynomi-
als to approximate the state and control parameters and the performance index
is descretized using the Gauss-Lobatto quadrature rule. The remainder of this
section provides a detailed description of the Legendre Pseudospectral Method
taken from Ref. [9].
Optimal Node Spacing
When determining collocation points it is advantageous to choose a distribution
that gives the best polynomial approximation. LGL points produce the smallest
L2 interpolation error [9] and thus yield better results than approximations obtained using equidistant points [11]. The LGL points lie on the interval [-1,11 and
are defined as
-1
To
TI
= roots of LN(T)
TN
=1
for l = 1, 2,...,N - 1
where LN(t) is the time derivative of the Nth degree Legendre polynomial, LN(t)As depicted in Fig. 3-1, this particular node distribution creates a clustering
of points near the endpoints. Denoting the initial time by to and the terminal time by tj, let t represent actual time and
T
E [To,
TN
T
represent LGL time where
-1, 1]. The actual time is mapped to LGL time by the follow-
ing affine transformation:
2(t - to) - (tf - to)
t5 - to
(3.25)
and conversely, LGL time is obtained from actual time via the inverse affine
transformation:
t - (tf - to)T + (tf + to)
2
53
(3.26)
46
a
3
±.~
35
*4**
0
* .4.4,
*
:4
4-
* *40
-
0
4-,25
0
~20
+4
+
4-. 4,4
.4 . ...
4.
*.4#
44
+-
15|-
1
-1
* -0.8' e
e-0.4'
'
-0.6
's
-0.2
I
*
0
0.2
0.4
0.6
0.8
"
Location of LGL Points
Figure 3-1: Distribution of LGL points for a given number of nodes
Taking the differential of Eq. (3.26), we obtain
(3.27)
dt = (tj - to)
2
d
Subsequently, in terms of LGL time, the optimal control problem of Eqs. (3.9)(3.12) becomes: Minimize
J
=
'M(x(-1), t 0 ) + N(x(1), tj) + tf 2 to
f
L(x(T), u(T), T, to, tj)d r
(3.28)
subject to the dynamic constraints
t5 - to
x = tf2
f(x(T), u(T),
T, to, tj)
(3.29)
the path constraints
gmin : g(x(T), u(T), T, to, tf)
54
Ymax
(3.30)
and the boundary conditions
ho(x(-1), to)
=
0
(3.31)
hf(X(1), tf)
=
0
(3.32)
Lagrange Interpolation with Legendre Polynomials
The state and control functions are approximated at the LGL points using Nth
degree Lagrange interpolating polynomials. Obtained from orthogonal Legendre
polynomials, the Lagrange polynomials are the trial functions while the state
and control variables at the LGL points are the unknown coefficients. Legendre
polynomials have a weight function c(t) = 1 and are orthogonal over the interval
[ -1, 1] [9]. In terms of Lagrange polynomials <q (T) for I = 0, 1, 2,..., N, the state
and control variables are approximated as:
N
x(T)
XN
(3.33)
XT4T
_
1=0
N
u(T)
UN(T)
=
YU Ti
1=0
P(T
(3.34)
where the general equation for the Lagrange interpolation scheme at the LGL
pointsTi,
1=0,1,2,...,Nis
(T - TO)... (T - TI- 1 )(T - T11)... (T - TN)
(Tj - TO) ...(Tj - Tj- 1) (Tj -- Tj+1)... (Tj - TN)
More concisely, Eq. (3.35) can be expressed as
N
<(T) =
T - Tn
M=1 T - Tm
m:j
55
=1,...,N(3.36)
(3.35)
In order to obtain an expression of the Lagrange polynomial in terms of Legendre
polynomials, a function w (T) is defined as
N
w(T)=
7
(T - Tm)
(3.37)
m=O
Evaluating the time derivative of w at Tj,
j = 0,1,2,...,Nwehave that
N
'W)(Tj)
H (-r1
M=1
mij
-
(3.38)
Tm)
Consequently, the Lagrange interpolating function can be rewritten as
w(T)
1
(3.39)
(T - Ti) W (Tj)
Referring to the definition of the LGL points, the derivative of the Nth degree
Legendre polynomial can be expressed as
LN (T)
=
(T
(T - TN-1)
- TI) (T - T2) -.-.
(3.40)
Combining Eqs. (3.37) and (3.40) with the fact that To = -1 and TN = 1, we obtain
N
w(T) = 1
(T - Tm)
=
(T - To)(T - T 1 ) ...
(T -
=
(T - To)LN(T) (T -
TN)
=
(T2 _ 1)LN(T)
TN-1)(T - TN)
m=O
(3.41)
In addition, the Legendre polynomials are the eigenfunctions of the differential
equation
d
dt (1
dt
-
T 2 )LN]
+ N(N + 1)LN(t) = 0
56
(3.42)
Using this property, the following expression shows the relationship between
1W(T) and LN(T).
N(N + 1)LN(Tj)
=
d [(T2_ 1) LNIT=T = W(TI)
(3.43)
The equation for the Nth degree Lagrange interpolating polynomial in terms of
the Legendre polynomial of degree N is
(T2 _ 1)iN (T)
-(3.44)
<pl(T ) =
(T - TI)
1
N(N + 1)LN(TI)
It can be shown that
1 if
1= k
0 if
1
=6ik=
<pI(Tk)
k
(3.45)
which leads to
XN(Tk)
uNTk
X(Tk)
=
k = 0,1,...,N
(3.46)
U(Tk)
Derivative Approximation
To impose the state differential equations at the LGL points, a differentiation
matrix is calculated by taking the analytic derivative of the interpolating polynomial. Since only the derivative at the node is desired, the following expression is
used
N
N(Tk
x(TlOL(Tk
(3.47)
1=0
N
=
ZDkIx(TI)
1=0
57
(3.48)
Dkl is the (N + 1) x (N + 1) differentiation matrix where
LN(Tk)
1
LN(TI) Tk
-
k =
Ti
N(N + 1)
k
1
0
4
DkIl=
(3.49)
N(N +1)
k
4
0
1=N
otherwise
Integral Approximation
Using the Gauss Lobatto integration rule, the cost functional is transformed to
an algebraic expression in terms of the state and control as follows
j
jN
=
Al(XN(-1), to) + N(xN(1), t)
=
'(x(-1),to)
+ t
C(XN
N(T), UN(T), T)dt
-
N
+
+ tft -
(x(1), t)
£(X(Tk), U(Tk), Tk, to, tf)Wk
k=o
where Wk are the weights corresponding to the Legendre polynomials [91 and are
expressed as
2
1
Wk = N(N + 1) LN(Tk ) 2
(3.50)
Nonlinear Programming Problem
The optimal control problem of Equations (3.9)-(3.12) is approximated by the
following nonlinear programming problem. Minimize the cost functional
N
J = 'M(x (-1), to) + X(x(1), t)
+ t2
to
I (X(Tk), U(Tk), Tk, to, tf)Wk (3.51)
k=O
over the variables
R",
k =0, 1,.., N
u(Tk) E RmI,
k =0, 1,..., N
x(TO)
to E R
tE C R
58
(3.52)
(.2
subject to
N
Z DkLX(T)
tf-t
-
2 t
f(x(Tk),U(Tk), Tk, to, tf)
=
0,
k = 0, 1,...N
1=0
gmax,
gmin : g(X(Tk ), u(Tk), Tk, to, tf)
ho(xo, to)
hf(xN, tf)
3.5
=
k=0,1,...,N
0
0
Summary of Optimal Control
According to the definition of an optimal control problem presented in Section
3.2, the CAV optimal mission problem formulated in Chapter 2 is an optimal
control problem. Because of the complexity of the optimal control problem,
it is necessary to obtain a solution numerically. Numerical methods fall into
two main categories: indirect methods and direct methods. Indirect methods
produce an accurate control, but they require an initial guess that produces a
solution close to the optimal solution. It is often difficult to obtain such an
initial guess. Direct methods have a wider range of convergence than indirect
methods, but produce a less accurate control than that which is obtained via
indirect methods. Pseudospectral methods comprise a class of newly developed
direct methods for solving optimal control problems which have a wide range
of convergence and produce an accurate control. In the subsequent chapters,
the Legendre Pseudospectral Method is applied to the Common Aero Vehicle
mission design problem.
59
[This page intentionally left blank.]
Chapter 4
Numerical Optimization Study of the
Common Aero Vehicle Problem Using
the Legendre Pseudospectral Method
4.1
Overview
The purpose of this chapter is to provide a detailed description of the steps involved in obtaining a solution to the CAV optimal control problem via the Legendre Pseudospectral Method. In particular, the discretization of the CAV optimal
control problem is described in detail. Properties of the resulting nonlinear programming problem are discussed in terms of characteristics that have an impact
on the optimization algorithm. The optimization algorithm used to solve the
NLP, SNOPT, is introduced with a brief explanation of how it solves the NLP. The
user inputs required for the implementation of SNOPT are also included in this
discussion about the optimization algorithm. Then, the specific values used for
the vehicle dynamic model and the bounds on the variables and constraints described in the discretization process along with the inputs to SNOPT are listed.
However, the number of nodes required to obtain a sufficiently accurate solution
is unknown a prior. Similarly, the choice of values for the weighting factors in
61
the performance index that will produce the most desirable trajectory and control is also not known a priori. Consequently, the last two sections are devoted
to the analysis used to choose appropriate values for the number of nodes and
the weighting factors. In doing so, the accuracy of the results obtained via the
Legendre Pseudospectral Method is assessed and the desirable characteristics
of the solution are noted. It may be useful to review Appendix A and B before
proceeding.
4.2
Discretization via the Legendre Pseudospectral Method
The Legendre pseudospectral transcription, described in Chapter 3, is applied to
the CAV optimal mission design problem formulated in Chapter 2. Implementing
the Legendre Pseudospectral Method requires discretization of the dynamics,
boundary conditions, path constraints, and performance index. The resulting
NLP is comprised of a bounded optimization vector, a bounded vector of equality
and inequality constraints, and a cost functional.
4.2.1
Optimization Vector
The optimization vector is comprised of the variables manipulated by the NLP
programming solver to determine the optimal solution. These variables are referred to as decision variables and include the state and control variables at
the LGL points as well as any undefined time points.
The augmented state
variables at the M (= N + 1) LGL points are the ECEF Cartesian components
of position (x c RM, y
(vx e RM, vY e
(o- E RM).
E
M Vz E
RM, z
E
RM), the Earth relative velocity components
RM), the angle of attack (o E RM), and the bank angle
The control variables at the M LGL points are the angle of attack
rate (u, e Rm) and the bank angle rate (u, E RM) and the final time, tf E R, is the
only undefined time point. In terms of these decision variables, the optimization
62
vector, xopt E R(f10M+1), for the CAV optimal mission design problem is
xpt =
X
y
Vx
Z
Vy
Of
Vz
o
Ua
Uof-
tf]
(4-1)
Naturally there is a range of admissible values pertaining to each of the decision variables. This leads to a lower bound vector bi c RM and an upper bound
vector b, E RM for each state and control decision variable and a lower bound
bEt5 c R and an upper bound bu,tf c R for the final time. An additional subscript
on b, and b, indicates which decision variable pertains to the respective bound.
Boundary conditions are then imposed by setting the upper and lower bounds
equal to the same value. For instance, initial conditions are imposed by setting
the upper and lower bounds equal to the appropriate initial value at the first LGL
point. The vectors of lower bounds on each decision variable are:
bi,x
=
X0
bi,,
=
YO Ymin
bl,z
=
Zo
bi,vx
Xmin
Zmin
v
V
...
Xmin
...
Ymin
zmin
...
x5
Yj
Z]
x,min
...
Vx,min
Vx,min
...
Vy,min
Vy,min
bi,vy
VyO
Vy,min
bi,vz
Vzo
Vz,min
bi,a
0 0 ...
...
Vz,min
Vz,min
0 0
bi,o,
=
rmin
bi,ux
=
Ucmin
Ucn-min
...
Ua,min
Ua,min
bi,uo-
=
uo,min
Uo-,min
...
U,min
Uo-min
bL,tf
=
o-min
(4.2)
-min
...
0min
0
Recall that the CAV considered has one-sided angle of attack control which is
indicated by a lower bound vector of zeros. Also note that there are no initial
conditions on the angle of attack and bank angle. This means that the optimizer
is free to choose their respective initial values.
63
Combining the lower bound
vectors corresponding to each of the decision variables yields a lower bound
vector for the optimization vector denoted by
BL,xopt
--
bl,x bl,y
bl,z bl,vx
BL,xopt E
R(1OM+1) as shown below.
bl,vy bl,vz bl,of bl,or
bl,uaf bl,uog bl,tf
(4.3)
Similarly, the upper bound vectors for each decision variable are
bu,x
X0
Xmax
...
Xmax
Xf
...
Ymax
YJ
bUy
=
Yo
Ymax
bu,z
=
Zo
Zmax
zmax
...
Zf
bu,vx
VxO
Vx,max
...
Vx,max
Vx,max
bu,vy
vyo vy,max
...
Vy,max
Vy,max
bu,vz
Vzo
Vz,max
...
Vz,max
aXmax
bu,o
=
amax
L(max
...
bua
=
rmax
o-max
...
-
max
(4.4)
Vz,max
0]
0-max
bu,ua
U a,max
U ,max
...
U ,max
U ,max
bu,uo
Uo,max Uo,max
...
U,max
U,max
bu,t
=
tfmax
Notice that the terminal velocity is not fully specified and thus the bounds on the
components of velocity at the last LGL point are simply the minimum and maximum values respectively. The resulting upper bound vector for the optimization
e,xopt
R(OM+l) as shown below
vector is B
Bu,xopt
=
bu,x
bu,y bu,z bu,vx bu,vy bu,vz
bu,t
bu,o
bu,ua
bu, 0-u bu,tf
(4.5)
64
Discretization of the Dynamic Constraints
4.2.2
In terms of the optimization vector, the equations of motion given in Eq. (2.21)
are expressed as
(4.6)
xcem = f (xop)
where
vx
vy
vz
gx + Lx + Dx + 2Qv, + Q 2x
f(xOPt) =
(4.7)
gy +g~±L~±Ly + Dy -2v+0y
2Qvx +Q 2 y
m
gz + Lz + Dz
m
Using the differentiation matrix
DN,
the continuous time equations are trans-
formed into an algebraic expression. The dynamic constraints defined in Eq. (4.7)
are denoted by C E RM and a subscript that indicates which decision variable
corresponds to that particular constraint equation. Rewriting the equations in
constraint form yields
vx
Cx
DNX
Cy
DNY
Cz
DNZ
vy
vz
Dx + Lx + gx
+ 2wvy +
Cvx
DNVx
tf - to
Cvy
DNVY
2
Coz
DNVz
Ca
DNa
Ca
DNO'
m
+
±Y
±Ygy +
m
2wvx + (2
Dz + Lz + gz
m
65
2
= 0
(4.8)
Together, these constraints comprise the dynamic constraint vector, Cde e R8M,
as shown below
Cdc
[ Cx
Cy
Cz
CVy
Cvx
CVz
C
C
]
(4.9)
Since the dynamic constraints are equality constraints, the lower bound vector,
BL,dc
E
4.2.3
R8M, and the upper bound vector, Bu,dc
E
p8M, are each a vector of zeros.
BL,dc
=
0
(4.10)
BU,dc
=
0
(4.11)
Discretization of the Path Constraints and the Terminal
Constraints
Path constraints confine the optimizer to stay within a set region when determining the trajectory. Referring to the path constraints listed in Eq. (2.27), the
path constraints corresponding to decision variables (a, u, u,) are included in
the optimization bound vectors. The remaining constraints are placed on the
radius, (r E RI), speed (v c RM), dynamic pressure (q c Rm), and sensed acceleration (a E RN) at every LGL point. Also included in this discretization is the
terminal constraint on the total heat load (Q c R) at the final LGL point. Recall
from Chapter 2 that the heat load is expressed as an integral. Discretization of
the integral results in the following summation:
Q
=
2 t
Wk
(4.12)
k=O
The terminal boundary condition on the flight path angle has yet to be imposed
and thus a constraint is placed on the sine of the flight path angle (Cy E R) at the
final LGL point. The resulting constraint vectors, denoted by C and a subscript
66
which indicates the corresponding constraint, are:
Cr
r
CV
V
Cq
q(4.13)
Ca
a
CO
Q
CY
S
where
s = sin(y(tj))
(4.14)
These constraints combine to form a constraint vector C1 tc
C R(4M+2)
as shown
below
Ch
CPtC
Cs C,
Ca CQ
Cy
(4.15)
where the subscript ptc is used to indicate path constraints and terminal constraints. Similar to the dynamic constraints, there are lower and upper bounds
on each path constraint and terminal constraint. The bounds on the path constraints imposed throughout flight are denoted by b E RI where the subscript
includes an 1 for lower bound or a u for upper bound and an additional letter to
indicate which path constraint corresponds to the bound vector. The terminal
constraints on the total heat load and the sine of the flight path angle are denoted by b E R and the same subscript notation as described above. The vectors
of lower bounds on the path constraints and terminal constraints are:
bir
bi
Re
=
Re
...
Re
Re
...
Vmin
Vf
qn
q"
Vmin
Vmin
biqc=
qmi
qm1 n ...
bia
0 0 ...
bio
=
0
biy
=
sin(yf)
67
0 0
(4.16)
where the complete vector of lower bounds,
BL,ptc
E R(4M+2), on the path con-
straints is
blr
BL,ptc
ble
blq
blQ
bla
bly
(4.17)
Likewise, the upper bounds on each path constraint are
bur
rmax
rmax
Vmax
Vmax
...
Vmax
V5
buq
qmax qmax
...
qmax
qmax
bua
amax
...
amax
buv
=
buQ =
buy
=
...
amax
rmax
rmax
(4.18)
amax
Qmax
sin(yf)
and the vector of upper bounds on all of the path constraints, BU,ptc
is
[bur
Bu,ptc
bu
bua
buq
buQ buy
E
p(4M+2)
(4.19)
Notice that the remaining terminal conditions pertaining to velocity are imposed
through the bounds on the final speed and the final flight path angle.
4.2.4
Discretization of the Performance Index
The performance index of Eq. (2.28) discussed in Chapter 2 is transcribed to a
summation which produces a scalar F where
F = t5 -k
-
k1
Nm
a
2
)2
+ k2
68
Ua,k
Ua,max/
2
+ k3
uo,m
uomax/
(4.20)
4.3
Common Aero Vehicle Nonlinear Programming Problem
Discretization of the CAV optimal mission design problem results in a nonlinear
progranmming problem. A nonlinear programming problem (NLP) is a problem
where it is desired to minimize or maximize a real-valued nonlinear function of
variables subject to real-valued nonlinear constraints. This section is devoted
to describing the NLP for the CAV mission design problem. First, it is important to understand the components of the NLP in terms of the breakdown of
the constraints. Thus, the NLP is summarized by recognizing both the number
and the type of constraints that comprise the NLP. Second, the structure of the
problem is important in terms of understanding the NLP as well as choosing an
optimization algorithm. Third, the NLP must be scaled properly in order to enhance the performance of the optimizer. In fact, in some cases it is necessary to
appropriately scale the problem in order to even obtain a solution.
4.3.1
Summary of the Common Aero Vehicle Nonlinear Programming Problem
To summarize the NLP resulting from discretization of the CAV optimal control
problem, the dynamic constraints, path constraints, and terminal constraints are
joined to form the following constraint vector C c(12M+2).
C
=
[
Cdc
Cptc
1
(4.21)
Similarly, the lower and upper bound vectors are also combined to form the
vector BL,C E
j(12M+2)
and the vector Bu,c E
69
p(12M+2)
respectively, where the
subscript C is used to denote bounds pertaining to the constraints:
BL,C
=
BL,dc
BL,ptc
(4.22)
Bu,c
=
B,d
BU,ptc
(4.23)
The resulting NLP is to minimize:
N
F
= Ft5
2
-
t
Z wi i
i=0
k1
L
uc,i
+ k2
max
\
amax
± ks
/
"'
\Uu,max
)
2]
(4.24)
over x 0 pt subject to
BL,xopt
s
BL,C
Xopt
s Cs
Bv,xopt
(4.25)
Bu,c
(4.26)
The breakdown of the NLP in terms of the number of optimization variables and
types of constraints is shown below.
# of Optimization Variables
= 10M + 1
# of Linear Equality Constraints
= 10
# of Nonlinear Equality Constraints
= 8M + 2
# of Linear Inequality Constraints
= 3M
# of Nonlinear Inequality Constraints
= 4M + 1
The optimization variables are split as follows: 8M variables correspond to components of the augmented states at the LGL points, 2M variables correspond
to components of the augmented controls at the LGL points, and the remaining
1 variable corresponds to the free terminal time. The 10 linear equality constraints correspond to each of the terminal boundary conditions with the exception of the speed and flight path angle. The nonlinear equality constraints are
comprised of 8M constraints corresponding to the eight discretized differential
equations at the LGL points and the remaining two correspond to the terminal
70
speed and flight path angle. The 3M linear inequality constraints correspond
to the path constraints placed on the angle of attack and the controls (ua, u,)
at the LGL points. The nonlinear constraints correspond to the remaining path
constraints where 4M of the constraints correspond to the radius, speed, sensed
acceleration, and dynamic pressure at the LGL points and the remaining 1 constraint corresponds to the total heat load at the terminal time.
4.3.2
Structure of the Common Aero Vehicle Nonlinear Programming Problem
After defining the NLP, it is useful to define the structure of the NLP. In general,
the more information the optimizer knows about the problem, the better the
optimizer will perform. Define the constraint Jacobian, [Cjac], as
[Cjac]
=
aC
aXopt
(4.27)
where [Cjac] is a (12M + 2) x (10M + 1) matrix. Each column corresponds to
each optimization variable at the LGL points and each row corresponds to each
constraint at the appropriate LGL points. (See Appendix B for a review of vector
differentiation rules used in this thesis.) The structure of the problem is best
described by indicating the dependence of the components of the constraint
Jacobian on the components of the optimization vector using a dependence matrix [Cdep]. If a component of [Cjac] is dependent upon a component of xopt, then
the corresponding element in [Cdep] is assigned the value of unity, otherwise it
is zero. The resulting matrix [Cdep] is a matrix of ones and zeros commonly
referred to as the dependency pattern. In the case of trajectory optimization
problems, the dependence matrix is a sparse matrix, i.e. a large percentage of
the individual derivatives of the nonlinear constraints with respect to the optimization variables are zero. The sparsity pattern for the CAV mission design
problem is shown in Fig. 4-1 where rows Cx-C, correspond to the dynamic con-
71
straints, rows labeled r-a correspond to the path constraints, and Q and y are
the terminal constraints. The sparsity pattern is partitioned into three main sections: the first partition corresponds to the discretized dynamic constraints, the
second partition corresponds to the discretized path constraints, and the third
partition corresponds to the terminal constraints. The dynamic constraints section can be partitioned even further into a block that depends only on the state
decision variables, a block that depends only on the control decision variables,
and a block that depends only on the final time. The block that depends only on
the state decision variables has blocks of size M x M along the main-diagonal
that result due to the dependence of the discretized differential equations on
the differentiation matrix. The off-diagonal blocks are either the M x M zero
matrix or the M x M identity matrix. The block that depends only on the control
decision variables also consists of either the M x M zero matrix or the M x M
identity matrix while the block that depends only on the final time is a column
of ones. The discretized path constraints have dependencies similar to the block
in the discretized dynamic constraint partition that depends only on the control
decision variables. Finally, the terminal constraints depend on their respective
state decision variables at the final LGL point and, in the case of the total heat
load constraint, the final time as well.
4.3.3
Scaling of the Conunon Aero Vehicle Nonlinear Programming Problem
In the extreme situation, a poorly scaled problem may prevent the optimizer
from even obtaining a solution and at the very least, it can negatively affect the
performance of the algorithm. In particular, scaling can change the convergence
rate, termination tests, and numerical conditioning [2]. A well-scaled problem
will be much better behaved numerically. One of the basic guidelines in determining appropriate scaling factors is to make every state and control variable
about the same order of magnitude and as close to unity as possible. One set of
72
x
y
z
V,,
v,
V,
Of
O-
u
u,,
t,
C
CA
CzI
C
CLIv
CUz
C,,
r
V
q
a
Q
'Y
Figure 4-1: Sparsity Pattern of the Common Aero Vehicle Nonlinear Programming
Problem
scale factors that leads to a well-scaled NLP for the CAV mission design problem
are as follows:
Units of Length:
Units of Time:
Units of Density:
Earth Radii
Period of a Spacecraft in Circular Orbit at One Earth Radii
Air Density at Sea Level
Since the flight of the vehicle is restricted to the Earth's atmosphere, scaling the
position by Earth radii makes the scaled position 0(1). The scale factor for time
is chosen such that the scaled velocity 0(1) where the velocity is scaled by the
term that results from dividing the units of length by the units of time. Similarly,
73
scaling the density of the atmosphere by the sea level density results in density
values close to unity. From these three base values, a canonical transformation
is used to convert all other values from one set of consistent units to another
set of consistent units. In particular, the canonical transformation used in this
thesis converts values from SI units to a set of nondimensional values with a
magnitude close to unity. The following useful nondimensionalizing constants
are derived in order to maintain a canonical transformation:
nlength
=
nime
=
ndensity
=
1e(4.28)
(4.29)
Po
(4.30)
(4.31)
-ength
nvelocity
ntime
nmass
nforce
ndensitynlength
4
fdensitynlength
2-
-
(4.33)
ntime
nenergy
nforcenlength
(4.34)
and nondimensional angles are represented in radians. In order to nondimensionalize a quantity, simply multiply it by the corresponding scaling factor. Conversely, in order to dimensionalize a nondimensional quantity, divide by the appropriate nondimensionalizing constant. While nondimensional quantities are
used in the optimization algorithm, the results are scaled to dimensional quantities for analysis purposes.
4.4
Numerical Optimization via SNOPT
There are many available software programs capable of solving the resulting
NLP; however, it is desirable to use a computationally efficient and robust method.
The current problem has both linear and nonlinear inequality and equality constraints. Sequential quadratic programming (SQP) methods are designed to han74
dle optimization problems with linear and nonlinear constraints [131. In addition, it is beneficial to use an optimization algorithm that takes advantage of the
sparsity of this problem.
Three well-known SQP numerical optimizers are NPSOL, SNOPT, and SPRNLP.
Both NPSOL and SNOPT were written by Gill, Murray, and Saunders [13, 14, 15]
while SPRNLP is a Boeing code developed by Betts and Frank [1]. NPSOL is very
similar to SNOPT; however, it does not take advantage of the sparsity of the
Jacobian and is not designed to solve large-scale NLPs. The study conducted
in Ref. [1] demonstrates that while SPNRLP solves larger problems in a shorter
period of time, SNOPT is faster for smaller problems. SPNRLP has the advantage of utilizing first and second derivative information versus SNOPT which
only uses first order information. Nonetheless, if given enough time, SNOPT will
solve large complex problems with the same accuracy as SPRNLP. SNOPT is a
dependable SQP method for solving sparse large-scale NLPs.
4.4.1
Description of SNOPT
SNOPT is a general purpose solver for optimization problems that have many
variables and constraints. It minimizes a linear or nonlinear function subject to
bounds on variables and linear or nonlinear constraints. Using a SQP algorithm,
SNOPT solves the NLP by solving a sequence of quadratic programming problems
(QP subproblems). The basic idea is to iteratively solve the problem, each time
working towards the optimal solution. In doing so, the task becomes to find a
direction in which the function approaches a minimum and to determine how
far to move in that particular direction. A series of major iterations and minor
iterationsare completed to determine a search direction while a merit function
is used to determine the step length.
While a complete analysis of SNOPT is not included in the scope of this thesis,
the following discussion is included to introduce the reader to the work required
to solve major and minor iterations. For a more complete explanation please re-
75
fer to Refs. [131 and [151. Initially, SNOPT converts inequality constraints into
equality constraints by introducing slack variables. Then SNOPT enters a major
iteration which generates an iterate of the optimization variables that satisfy the
linear constraints. The search direction for the next iterate is determined by
solving a QP subproblem. Minor iterations correspond to the iterative process
involved in solving the QP subproblem for each major iteration. In doing so,
the nonlinear constraints are linearized by a Taylor series expansion. The QP
subproblem is to minimize a quadratic approximation of a modified Lagrangian
subject to linear constraints and simple bounds on the variables. A reduced
Hessian algorithm is used to solve the QP subproblem where the Hessian is a
matrix of second derivative information used to create the quadratic approximation. A BFGS quasi-Newton approximation of the Hessian is used versus other
algorithms that utilize a full sparse Hessian. After the QP subproblem is solved,
a new estimate of the solution is obtained by completing a line search on an
augmented Lagrangian merit function. The merit function is used to determine
if and how much progress is being made by the algorithm. The line search determines the step length (how far to go in the search direction) in order to produce
the most significant decrease in the merit function. Eventually, this iterative
process will converge to a point that satisfies the first order conditions for optimality.
4.4.2
User Requirements and Options for SNOPT
User requirements in order to run SNOPT consist of creating two subroutines
and supplying an initial guess. One subroutine defines the objective function
and the other defines the constraints as well as the sparsity of the constraint Jacobian. Each must return their respective function values and, optionally, their
respective gradients. SNOPT provides the user with the option of coding as
many or few of the gradients as desired and the remaining derivatives are approximated with finite differences. In fact, SNOPT has the capability of verifying
76
the analytic gradients by comparing these gradients to finite difference approximations obtained via central differences.
Using this capability, the user can
correct any errors made in coding the analytic derivatives. While coding the ana-
lytic derivatives will enhance the performance and increase the reliability of the
optimization algorithm, analytic derivatives are often inconvenient to compute.
As mentioned earlier, an initial guess must be supplied to the optimizer, which
can be a daunting task depending on the problem at hand. Generally speaking,
a good start is to select any feasible point.
The user can control the performance of SNOPT by choosing various options.
Each option has a default setting chosen based off of the norm for most problems. These options include tolerance levels, derivative verification , level of desired output, and both major and minor iteration limits. More detailed options
include information about the QP subproblem, the SQP method, and the Hessian
approximation. For a complete list and description of the options please see the
SNOPT User's Guide [151.
4.5
Numerical Optimization Study
This study involves the setup for numerical optimization, which includes specifying the values used to describe the CAV mission design problem and the values
corresponding to the discretization of the mission design problem as well as the
inputs necessary for the implementation of SNOPT.
4.5.1
Specification of the Required Inputs
Table 4.1 includes all of the particular values used in the CAV mission design
study. Bounds on the optimization vector corresponding to Eqs. (4.2) and (4.4)
are listed in Table 4.2. The upper bounds on the position and velocity components correspond to 1.5 times their initial values respectively and the value of
the lower bounds are simply the negative of the upper bounds.
77
Table 4.1: Numerical Values Used for Numerical Optimization
Mass of the CAV (kg)
Aerodynamic Reference Area (M2 )
687
0.6
CL,
Zero-Lift Drag Coefficient
Drag Polar Parameter
Density at Sea Level (kg/m 3 )
Density Scale Height (m)
Angular Rotation of the Earth (s1)
Radius of the Earth (m)
Gravitational Parameter (m 3/s 2 )
Heating Rate Constant (W/m 2 )
0.043
1
1.225
6914
7.29x105
6378145
3.986x 1014
1.9987x108
The boundary conditions used in this analysis were taken from Ref. [261 and
are as follows:
to
=
0
x(to)
=
6415145 m (=37 km in altitude)
y(to)
=
0m
z(to)
=
0m
vx(to)
=
Om/s
vy (to)
=
7137.9 m/s
vz(to)
=
0m/s
x(tf)
5773486 m
y(tf)
2710645 m
z(tf)
=
Om
V(tf)
=
1219 m/s
y(tf)
=
-89.9 deg
a(tf)
=
0deg
These boundary conditions correspond to a terminal state approximately 2800
km downrange from the initial position in the initial Earth relative trajectory
plane. While the terminal velocity of the vehicle should actually be orthogonal
78
Table 4.2: Numerical Values for the Bounds on the Optimization Variables
Variable
Lower Bound
(m)
y (m)
z (M)
vx (m/s)
vy (m/s)
vz (m/s)
ot (deg)
6
-9.6227x10
-9.6227x10 6
-9.6227 x106
-10706.85
-10706.85
-10706.85
0
Upper Bound
9.6227x10 6
9.6227x10 6
9.6227x106
10706.85
10706.85
10706.85
25
o (rad)
-67T
6r
ua (deg/s)
u. (deg/s)
-10
-30
10
30
tc (s)
0
5000
x
to the plane tangent to the point of impact, (thus requiring a terminal flight path
angle of -90 deg), the unit lift direction of Eq. (2.18) chosen for this study is
undefined when y = -90 deg. Therefore, a terminal flight path angle of -89.9
deg is chosen in order to obtain results that are similar to those that would be
obtained for yf
=
-90 deg.
Bounds on the path constraints introduced in Eqs. (4.16) and (4.18) are listed
in Table 4.3 where go is the gravitational acceleration at sea level. At this point,
Table 4.3: Numerical Values for the Bounds on the Path Constraints
Path Constraint
r (m)
V (m/s)
Lower Bound
Re
10
Upper Bound
9.6227x10 6
10706.85
q (kPa)
11.97
0o
a (go)
Q (MJ/m 2 )
0
45
-00
00
sin(yf)
-1
-1
the total heat load the vehicle can sustain is unspecified and thus the heat load
is actually unconstrained. However, using the optimizer as a design tool, a parametric study presented in Chapter 5 will analyze the affects on the trajectory
79
and control from varying the maximum allowable heat load.
The choice of weighting factors used in the performance index is not obvious
prior to running the optimizer. Consequently, these values are varied and the
results are compared to determine the values that reflect the most desired characteristics of the trajectory and control. The corresponding study and results
are presented in Section 4.5.3.
In terms of the inputs required to implement SNOPT, the information provided in Section 4.2 along with the specific values listed in Tables 4.1-4.3 are
used to create the subroutines. The initial guess supplied to SNOPT varies depending on the specific case being run. A discussion of the choice of initial guess
is given with the description of each case. In addition to the required inputs, the
user must make three major decisions. First, the user must decide if the gain in
speed and accuracy is worth the work required to compute and input the analytic
constraint Jacobian and objective gradient. In this application, the analytic objective gradient and constraint Jacobian are computed analytically and are given
in Appendix C. The analytic derivatives are verified using SNOPTs derivative verifier (as described earlier). Second, the options must either be tailored to the
problem at hand or left at the default settings. For this study, the options corresponding to the limits on the total number of iterations, the number of major
iterations, and the number of minor iterations were changed from their default
values. Table 4.4 indicates the value to which each of these options were set as
well as the default setting (note that m is the number of constraints in the NLP).
Third, the number of nodes must be determined in order to produce accurate
Table 4.4: Options Set in SNOPT
OPTION
Iteration Limit
Major Iteration Limit
Minor Iteration Limit
SETTING
1000000
1000000
1000000
DEFAULT SETTING
max(10000,20m)
max(1000,m)
max(1000,5m)
results without sacrificing computational effort and time. This value is also un-
80
known prior to running the optimization algorithm and is problem dependent.
Section 4.5.2 includes the analysis used to determine the appropriate number of
nodes for this particular application.
4.5.2
Determination of an Adequate Number of Nodes
The number of nodes used to solve the NLP arising from the Legendre Pseu-
dospectral Method discretization directly affects the accuracy of the discrete approximation to the continuous time optimal control problem. An infinite number
of nodes will theoretically produce the most accurate solution. However, the efficiency of the optimizer decreases as the number of nodes increases. Therefore,
the process of determining an adequate number of nodes for a given problem involves a trade-off between the desired solution accuracy and the time required to
obtain a solution to the NLP. An adequate number of nodes for the CAV mission
design problem is determined by comparing results from using 25, 50, 75, and
100 nodes. In terms of solution accuracy, the smoothness of the control profile
is considered along with the accuracy of the controls. While the time it takes the
optimizer to solve each case is also considered, more emphasis is placed on the
accuracy of the results. Nonetheless, the resulting control profile is weighted
against the solution time in order to determine an appropriate number of nodes
to use for this study. Please see Appendix D for a description of the initial guess
used to obtain these solutions.
Effects of the Number of Nodes on the Control Profile
The smoothness of the control profile is assessed visually by observing the control profile along with the angle of attack and bank angle profiles. Even though
the angular rates are the control in the optimal control problem, the angle of
attack and bank angle are actually used to steer the vehicle. As a result, the
smoothness of the angles is more important than the smoothness of the angular
rates. Figures 4-2 and 4-3 clearly indicate that 25 nodes is not enough to obtain a
81
smooth control. Therefore, 25 nodes is no longer considered and a comparison
is made between 50, 75, and 100 nodes.
Accuracy of the controls is assessed by integrating the equations of motion
to Earth impact (altitude=0) and comparing the resulting error in the terminal
state. The same dynamic model used in the optimization algorithm is used in
the numerical integration and the controls are approximated by Lagrange interpolation. Integration of the equations of motion is carried out using a 4t4
order Runga-Kutta routine with a constant stepsize of h = 0.001s. The position
of the vehicle is completely described in a plot of altitude versus time and the
Earth relative crosstrack distance versus the Earth relative downtrack distance.
The Earth relative crosstrack and downtrack distance are defined in Appendix
E. Since the terminal condition in the integration eliminates the possibility of a
terminal error in altitude, the altitude profile is not shown. The crosstrack versus downtrack plots shown in Figures 4-4-4-6 may mislead the observer to think
that the integrated solution matches the Legendre pseudospectral solution. In
reality, there are differences, but they appear negligible in terms of the distance
the vehicle is traveling. The same holds true in the plot of speed versus time for
all three cases as shown in Figs. 4-7-4-9.
82
4-d
C-4
100
0
200
400
300
500
600
700
Time (s)
Figure 4-2: Angle of Attack vs. Time for M=(25, 50, 75, 100)
S)
to
'0
100
200
300
400
500
600
Time (s)
Figure 4-3: Bank Angle vs. Time for M=(25, 50, 75, 100)
83
700
-. Integrated soln, h=0.(X)1i
35 0 -
-.
--
-
-
- - - -. - -.-
-..
-.-.-.-.-.-.-
-
-.-.- -.-.--
-.-.-
.- .
250-
200 -
- -
---
--
---
U
aV
> 150-
;-4
50
--
--
-
0
500
1(0
1500
2000
2500
Earth Relative Downtrack Distance (km)
---
--
3000
Figure 4-4: Earth Relative Downtrack Distance vs. Earth Relative Crosstrack Distance for 50 Nodes
500
-8-
-.-.
-
400
LPS soin, N= 75
Integrated soln. h-0.001
----
- --
300
200-
100-
0
500
1000
1500
2000
2500
Earth Relative Downtrack Distance (km)
300
Figure 4-5: Earth Relative Downtrack Distance vs. Earth Relative Crosstrack Distance for 75 Nodes
84
CeI
;0
Cn
C
W-
-1001
1
0500
1
1
1
1000
1500
20'00
2500
3oo
Earth Relative Downtrack Distance (km)
Figure 4-6: Earth Relative Downtrack Distance vs. Earth Relative Crosstrack Dis-
tance for 100 Nodes
Ct
S
-0
a)
a)
C.fl
a)
Ce
a)
F-
Ce
0
100
200
30
400
50
60
Time (s)
Figure 4-7: Earth Relative Speed vs. Time for 50 Nodes
85
7(X)
0
100
200
300
40()
500
600
70()
Time (s)
Figure 4-8: Earth Relative Speed vs. Time for 75 Nodes
8000
6000
5000
~4000
3000
0
100
200
300
4(X)
500
600
Time (s)
Figure 4-9: Earth Relative Speed vs. Time for 100 Nodes
86
700
In order to properly assess the accuracy, the error in terminal position and
speed is calculated. The error in position and speed is calculated by taking the
square root of the sum of the squares of the differences between the integrated
solution and the Legendre pseudospectral solution. Let the subscript "LPS" denote the solution obtained from the optimizer at the last LGL point and the
subscript "INT" denote the results from integration to Earth impact. The terminal position error epos and the speed error espeed are then determined using the
following equations:
epos
espeed
=
(XLPS
XINT)
-
(Vx,LPS
-
2
± (YLPS -
Vx,INT)
2
+
y.NT)
2
+
(ZLPS -
(Vy,LPS - Vy,INT)
2
ZINT)2
+ (Vz,LPS
-
(4.35)
Vz,NT)2
(4.36)
Table 4.5 shows the results from integration using the control histories attained
for 50, 75, and 100 nodes. It is seen that the accuracy in position improves
significantly as the number of nodes increases. On the other hand, the speed
accuracy is virtually unaffected by the number of nodes used.
Table 4.5: Terminal errors produced by integration for M = (50, 75, 100)
No. of LGL Points (M)
Position Error (m)
Speed Error (m/s)
50
75
100
220.0763
40.2454
0.6722
9.53003
6.8386
5.9902
Effect of the Number of Nodes on the Solution Time
One last mode of comparison is the solution time, which is highly dependent
upon the type of machine used to run the optimization algorithm. The solution
time for the 50 node case is 768.22 seconds, the 75 node case took 2653.62
seconds, and the 100 node case took 3378.91 seconds. Using 50 nodes significantly reduces the computational time to solve the problem and the difference
in solution times between the 75 and 100 node cases is small in comparison.
87
Summary of the Results from Varying the Number of Nodes
In determining an adequate number of nodes for the CAV mission design problem, the results from using 25, 50, 75, and 100 nodes were compared. The accuracy of the control profile was compared along with the solution time. Looking
at the smoothness of the control profiles, it was immediately apparent that 25
nodes is not adequate for solving this problem. As to be expected, the 100 node
case produced the most accurate results; however, it also required the longest
amount of time to obtain a solution. Similarly, the use of 50 nodes significantly
reduced both the accuracy of the solution and the solution time. While the 75
node case fell in between the 50 node and 100 node cases, the results from this
case were closer to the 100 node case in terms of the control profile and solution time. Recall that the terminal conditions for HDBTs (described in Chapter 1)
require position accuracy to within several meters and speed accuracy to within
500 m/s. While the speed accuracy requirements were satisfied in all three cases,
the position accuracy requirements were only satisfied in the case of 100 nodes.
In this analysis, the solution accuracy was more important than the time it takes
to obtain a solution. Consequently, all the results presented in the remainder of
this chapter will be shown for M = 100 (i. e. 100 Nodes).
4.5.3
Choice of Weighting Factors Used in the Performance Index
It is crucial that the trajectory be robust to environmental perturbations, especially near the end of flight. A robust trajectory is one in which the terminal
conditions are met despite unpredictable conditions experienced during an actual flight. The CAV used in this thesis has limited control authority. Thus, it is
desirable to maintain control flexibility in order to compensate for unexpected
disturbances. For instance, by keeping the angle of attack away from its upper
and lower limits, it is possible to either increase or decrease the angle of attack
in the presence of an uncertainty (e.g. a windgust, a thicker than predicted atmo88
spheric density, or a thinner than predicted atmospheric density). Furthermore,
control flexibility is maintained by keeping the controls in the middle of their
respective corridors. Consequently, the performance index keeps the angle of
attack as close to the middle of its corridor (61) as possible. However, the angle
of attack reaches a maximum near the end of flight in order to meet the terminal
conditions on speed and flight path angle. This dramatic increase arises from
the need to deplete speed over a short period of time and obtain a large and
negative flight path angle. In order to decrease the speed of the vehicle, the drag
must increase and thus the angle of attack increases. The terminal flight path
angle requires that the vehicle approaches the target with negative lift. Since
the angle of attack cannot be negative, the only way to generate negative lift on
the vehicles is to rotate the vehicle 180 degrees. The amount of lift required to
execute this maneuver causes the angle of attack to increase as well. The an-
gle of attack then rapidly decreases to its prescribed terminal condition of zero
degrees. In order to maintain flexibility in the angle of attack near the end of
flight, the maximum angle of attack should be minimized. Redefining the control margin, the goal is to keep a near 61, to minimize 0(max, and to minimize uO
and u,.
Recall that ki corresponds to keeping a near 6z, k 2 corresponds to the angle
of attack rate, and k 3 corresponds to the bank angle rate. The weighting factors
in the performance index are selected according to the goal of maximizing the
control margin and minimizing the control rates. In determining the appropriate
values for k 1 , k 2 , and k 3 , each parameter is varied independently. The parameter
being varied takes on the values of 0.1, 1.0, 10, and 100 while the remaining two
parameters are set to 1.0. The initial guess used to obtain results is the solution
to the 100 node case described is Section 4.5.2.
Effects on the Control Margin due to Variations in ki
Increasing k 1 places more emphasis on keeping the angle of attack near 6(. However, as mentioned earlier the angle of attack increases near the end of flight. In
89
order for the angle of attack to increase while more emphasis is placed on keeping the angle of attack near &, the angle of attack remains near 6 for as long as
possible. By delaying the increase in angle of attack, the controls are forced to
decrease the angle of attack to zero over a shorter time interval. Consequently,
a remains closer to & for a longer period of time and the angle of attack rate
increases as ki increases. In order to generate an adequate amount of lift to rotate the vehicle in a shorter period of time, amax and u, must also increase near
the end of flight as ki increases. Figures 4-10 and 4-11 show that the maximum
angle of attack and the angle of attack rate increases slightly at ki = 10 and
noticeably for ki = 100. The same is true for the bank angle rate as indicated in
Fig. 4-12. Also, the angle of attack rate reaches its minimum value at the end of
flight for the values of ki = 1.0, 10, and 100.
25
20
CZ
15
0410
5
0'
0
100
200
400
300
500
600
700
Time (s)
Figure 4-10: Angle of Attack vs. Time for k1 = (0.1, 1.0, 10, 100), k 2 = k3 = 1.0
90
-2-
U
-4
0
-6-
kl=
kl=
-Ukl=
A8 kl=
1
-10
0
1 30
S-8 -+*
V
200
300
400
500
700
-600
Time (s)
Figure 4-11: Angle of Attack Rate vs. Time for ki = (0.1, 1.0, 10, 100), k 2 = k=
1.0
6
2
_2
-4
--BA-
-6
0
kl=
kl=
kl=
kl=
10 0
200
400
300
500
600
700
Time (s)
Figure 4-12: Bank Angle Rate vs. Time for ki = (0.1, 1.0, 10, 100),
91
k2
= k3
=
1.0
Effects on the Control Margin due to Variations in k 2
Increasing k2 increases the emphasis on minimizing the angle of attack rate. In
order to generate the same amount of lift using a slower angle of attack rate,
the deviation of of from 6 increases as shown in Fig. 4-13. In regards to the
maximum angle of attack, Fig. 4-13 also indicates that the smallest maximum
angle of attack is obtained for k2 = 10 while the largest maximum angle of attack
is obtained for k2 = 100. Figure 4-14 clearly shows that increasing k2 decreases
the magnitude of the maximum angle of attack rate and the angle of attack rate
reaches its minimum value at the end of flight for the values of k2 = 0.1 and
1.0. Fig. 4-15 shows that the bank angle rate appears to be unaffected by the
variation in k2 .
25
20
U
15
0t
4-j
10
100
200
400
300
500
600
700
Time (s)
Figure 4-13: Angle of Attack vs. Time for k2 = (0.1, 1.0, 10, 100), ki = k 3 = 1-0
92
Q-)
4-4
0
a-)
300
700
400
Time (s)
Figure 4-14: Angle of Attack Rate vs. Time k 2 = (0.1, 1.0, 10, 100), ki = k3 = 1.0
-4
a)
0
100
200
300
400
500
600
700
Time (s)
Figure 4-15: Bank Angle Rate vs. Time for k 2
93
=
(0.1, 1.0, 10, 100), ki = k3 = 1-0
Effects on the Control Margin due to Variations in k 3
Increasing k 3 places more emphasis on minimizing the bank angle rate. Looking
at Figure 4-16, the weighting on the bank angle rate appears to only affect the
angle of attack profile in the case where k3
scenario (k3
=
=
100. It is obvious that in this
100) the deviation of the angle of attack from
et
increases and
the angle of attack reaches its upper limit. The angle of attack rate reaches its
maximum value at the end of flight for every value of k3 , as indicated in Fig. 417. As to be expected, Fig. 4-18 confirms that as k3 increases, the bank angle
rate decreases throughout the trajectory.
25
-4-
y
k3= 0.1
k3= 1.0
-UA
k3= 10
k3= 100
20-
-
-
- -
to
0
100
200
300
500
400
600
700
800
Time (s)
Figure 4-16: Angle of Attack vs. Time for k 3
94
=
(0.1, 1.0, 10, 100), ki = k2 = 1-0
-2
-4
4'-
0
-6
-8
0
100
500
400
300
200
600
700
Time (s)
Figure 4-17: Angle of Attack Rate vs. Time for k3 = (0.1, 1.0, 10, 100), ki = k2
1.0
1.5
S 0.5
01
<-0.5
2
-1
-1.5
-2'0
100
200
300
400
500
600
700
800
Time (s)
Figure 4-18: Bank Angle Rate vs. Time for k 3 = (0.1, 1.0, 10, 100), k 1 = k2 = 1.0
95
Summary of the Results from Varying the Weighting Factors in the Performance Index
The performance index is constructed such that the optimal control and trajectory maximize the control margin. The control margin is measured by three
terms. The first term keeps the angle of attack in the middle of its corridor,
the second term minimizes the angle of attack rate, and the third term minimizes the bank angle rate. The desired trajectory and control is such that each
of these terms is minimized. Analyzing these results according to the desired
performance, each term in the cost functional is evaluated separately (without
the weighting factor) as depicted below
N
Term1
=2
i=0
N 1
Term2
=
Term3
=
_
2
Ofmax /2
2
(Ua,)
i=0 Uoa,max
2
N
i=0
20-,max)
(4.37)
Furthermore, the angle of attack profile is such that the angle of attack reaches
a maximum value near the end of flight. Near the end of flight it is crucial to
maximize the control margin which corresponds to minimizing the maximum
angle of attack. Thus, the maximum angle of attack is also considered. The
overall performance of the vehicle resulting from varying the weighting factors
is assessed by considering all three terms defining the control margin as well as
the maximum angle of attack. Consequently, the overall performance is assessed
by summing the values of each term in addition to the maximum angle of attack.
Table 4.6 summarizes the results from varying the parameters in terms of each
of these values for each of the cases and the last column is a summation of the
preceding values in each row. The only undesirable case is the last one where
ki = k2 = 1 and k 3 = 100 because the angle of attack reaches its upper limit.
However, the case where ki = k 2 = 1 and k 3 = 0.1 produces slightly more
96
desirable results. Consequently, these values are used for in the remainder of
this thesis.
Table 4.6: Results from Varying the Weighting Factors (ki, k 2 , k 3)
ki
0.1
k2
k3
Termi
1
1
10
1
10
0.1
10
1
0.0048
0.0025
0.0023
0.0023
0.0024
0.0025
0.0047
0.0146
0.0025
100
1
1
1
1
1
1
1
1
4.6
10
100
1
1
1
1
1
1
1
1
I 0.11
10
100
Term2
0.0014
0.0020
0.0023
0.0026
0.0022
0.0020
0.0014
0.0011
0.0020
Term3
4.3919 x 104
4.6944 x 104
5.894 x 104
0.0017
4.6589 x 104
'I
0.0025
I
0.002
0.0026
0.0084
0.0020
0.0031
4.6944 x 104
4.9231 x 104
5.3504 x 104
5.3961 x 104
4.6944 x 104
4.188 x 104
2.633 x 10-4
amax
Total
19.8101
19.8161
18.9726
20.2372
20.9097
18.9676
20.2320
20.9031
20.1295
18.9676
20.1346
18.9726
19.7476
19.7533
22.0621
22.0783
18.8591
18.8541
18.9676
19.3337
25
4
18.9726
19.3387
25.0118
Summary of the Numerical Optimization Study
Applying the Legendre Pseudospectral Method to the Common Aero Vehicle optimal mission design problem results in a nonlinear programming problem. It is
important to identify the sparsity pattern of the NLP and to scale the NLP properly in order to improve the performance of the optimizer. The CAV optimal
control problem has a sparse nonlinear constraint Jacobian. SNOPT is designed
to handle problems with sparse nonlinear constraint Jacobians and thus it is the
optimization algorithm used to solve the NLP. Once the optimal control problem
is discretized to form the NLP and the optimization algorithm was chosen, the
specific values pertaining to the NLP and the options in SNOPT used in the optimization study were listed. A study was then conducted in order to choose an
appropriate number of nodes in terms of the accuracy of the solution and the
time required to obtain a solution. Results were obtained using 25, 50, 75, and
97
100 nodes and it was shown that 100 nodes was the only case that produced
results which met the accuracy requirements set forth by the CAV mission de-
sign problem. Using 100 nodes, another study was conducted to determine the
choice of weighting factors in the performance index that produced the most desirable trajectory and control. In this case, it was desirable to design a trajectory
that is robust to environmental disturbances. A robust trajectory corresponds
to one in which the vehicle flies in the middle of its control capabilities. Thus,
the control margin was used as a metric for quantifying the desirability of the
trajectory and control. The control margin was assessed in terms of keeping the
angle of attack in the middle of the corridor, minimizing the maximum angle
of attack, and minimizing the control rates. One of the weighting factors was
varied while the remaining two factors were held constant. It was found that
setting ki = k2 = 1 and k 3 = 0.1 maximized the actual control margin and thus
these values are used in the remainder of this thesis. Furthermore, upon the
completion of this analysis it is evident that the value of the performance index
reflects the control margin. In particular, the goal is to minimize the performance index, which maximizes the control margin. Thus, the smaller the value
of the performance index, the larger the control margin and vice versa.
98
Chapter 5
Parametric Optimization Study of the
Common Aero Vehicle Problem
5.1
Overview
This chapter focuses on the general behavior of the Common Aero Vehicle as
well as the response of the solution from the optimization to changes in parameters. Since the CAV is a new concept, it is important to first understand the
motion of the vehicle during flight. Therefore, the first step is to identify the
key features of the trajectory and control. Utilizing the optimization setup as
a design tool, parameters are then varied and the differences in the trajectory
and control profiles are determined. As discussed in Chapter 4, the performance
index is a measure of the amount of control margin. Since the goal is to produce
a solution that maximizes the control margin, the value of the performance in-
dex is used as the metric to compare the quality of the solutions obtained for
different values of the parameters. In particular, the quality of the solution is
compared for different values of minimum allowable dynamic pressure, maximum allowable stagnation point heat load, and maximum lift-to-drag ratio of
the vehicle.
99
5.2
Key Features of the Trajectory and Control
The solution used to identify the key feature of the trajectory and control pertains to 100 nodes, ki = k2 = 1, k 3 = 0.1, qmin = 11.97 kPa, Qmax = oo, and
(L/D)max ~ 2.4. The first key feature of the optimal trajectory is the behavior
of the altitude as shown in Fig. 5-1. It is seen from Fig. 5-1 that the altitude increases twice during flight. Initially, the altitude increases to a region where the
atmospheric density is small, thus allowing the vehicle to achieve the required
range of 2800 km. The subsequent decrease in altitude increases the dynamic
pressure in order to produce enough lift to rotate the vehicle. The final decline
in altitude reduces the speed to meet the specified terminal speed of 1219 m/s
and satisfy the required range. In order to maintain control authority, a constraint is placed on the minimum allowable dynamic pressure. The points where
the altitude attains a local maximum correspond to points where the dynamic
pressure constraint is active as seen in Fig. 5-2.
0
2
4
6
8
10
The second key feature of the
12
14
16
Energy (GJ)
Figure 5-1: Altitude vs. Energy for M=100, ki = k2 = 1,k 3 = 0.1
100
18
10 0
-7
1 0 -Altitude
Attains
Local Maximum
4-0
Dynamic Pressure
Constraint Active
0
100
200
400
300
500
600
700
Time (s)
Figure 5-2: Altitude and Dynamic Pressure vs. Time for M=100, ki
=
1,k 3
0.1
optimal trajectory is the behavior of the in-plane out-of-plane motion. Figure 5-3
shows the Earth relative crosstrack distance versus the Earth relative downtrack
distance traveled by the vehicle (see Appendix E). It is seen that the vehicle steers
out of plane close to 410 km and actually approaches the target slightly from
behind. The third key feature of the results is the behavior of the optimal angle
of attack. It is seen from Fig. 5-4 that, because of the desire to minimize the
performance index, the angle of attack remains near 6 = 11.9 deg throughout a
large portion of the trajectory. The dramatic increase and decrease in angle of
attack at the end of flight arises from the need to meet the terminal conditions
on speed, flight path angle, and angle of attack. The increase in angle of attack
near the end of flight arises from the need to deplete speed over a short period
of time and obtain a large and negative flight path angle. In order to decrease
the speed of the vehicle, the drag must increase and thus the angle of attack
increases. Attaining the terminal flight path angle requires negative lift. Since
101
-~
-
INUU
k 4=
1.0k2=
1.0k3= 0.1
350
Q
300250
-
-
0 200
150-
-
100S500
0
500
1000
1500
2000
2500
3000
3500
Earth Relative Downtrack Distance (km)
Figure 5-3: Earth Relative Crosstrack Distance vs. Earth Relative Downtrack Distance for M=100, k 1 = k2 = 1,k3 = 0.1
the angle of attack must remain positive, the only possible way to generate a
sufficient amount of negative lift is to roll (bank) the vehicle -180 degrees and
increase the angle of attack. Once the vehicle is oriented properly, the angle
of attack decreases rapidly in order to meet the required terminal condition of
zero degrees. The last key feature of the optimal trajectory is the behavior of the
bank angle. Fig. 5-5 shows that the vehicle is banked to -90 deg for roughly 200
seconds of flight. When the bank angle is -90 deg, there is no vertical component
of the lift direction. The vehicle flies with this orientation in order to decrease
the altitude of the vehicle. It is also seen in Fig. 5-5 that the bank angle is -180
deg when the vehicle reaches the target. As mentioned earlier, the restrictions
on the angle of attack in combination with the terminal condition on the flight
path angle (-89.9 deg) require that the vehicle fly upside-down as it approaches
the target. Furthermore, lower in the atmosphere the forces acting on the vehicle are greater. Larger forces acting on the vehicle results in a high bank angle
102
18
161412108
4
20
100
200
300
400
500
700
600
Time (s)
Figure 5-4: Angle of Attack vs. Time for M=100, k 1 = k 2
1,k 3
=
0.1
rate as the vehicle rotates to -180 deg. Since the performance index minimizes
the bank angle rate, the vehicle ascends to a higher altitude before rotating. The
altitude is increased by decreasing the magnitude of the bank angle. In this case,
the bank angle decreases in magnitude to 50 deg from -90 deg before rotating
the vehicle over.
103
0
100
200
300
400
500
600
Time (s)
Figure 5-5: Bank Angle vs. Time for M=100, ki = k2 = 1,k 3 = 0.1
104
700
5.3
Effects of Dynamic Pressure on the Trajectory and
Control
A minimum dynamic pressure constraint is added in order to keep the vehicle
from exiting the Earth's atmosphere thereby maintaining aerodynamic control.
In order to assess the affect of the minimum allowable dynamic pressure on
the resulting trajectory and control, the minimum allowable dynamic pressure
is varied between 11.97 kPa (250 psf) and 47.88 kPa (1000 psf) while Qmax = co
2.4) are held constant. The initial guess is the solution for the
and (L/D)max
case where M
=
100, ki = k2
=
1, k 3 = 0.1, qmin = 11.97 kPa, Qmax = oo, and
(L/D)max ~ 2.4.
In terms of trajectory characteristics, it is known that the dynamic pressure
is a function of density, which is a function of the altitude, and speed. While
the initial and final altitude is specified in the boundary conditions, the altitude
is free to increase and decrease throughout flight. However, the local maximum
in altitude are constrained by the minimum allowable dynamic pressure. The
initial and final speed is specified as well, but contrary to the altitude, the speed
of the vehicle can only decrease during flight. The only way to increase the dynamic pressure without increasing the speed is to decrease the altitude of the
vehicle. However, the vehicle must meet the range requirements for the terminal conditions which forces the altitude to increase in the beginning of flight.
As depicted in Figs. 5-6 and 5-7, these conflicting trends result in the following
trade-off: as the minimum dynamic pressure increases, the initial increase in
altitude decreases and the speed depletes at a slower rate. Note that regardless
of the constraint on the dynamic pressure, the vehicle reaches 20 km in altitude
with nearly the same amount of energy and the remaining energy is dissipated
in the same manner for all values of qmin.
In regards to the lateral motion of
the vehicle, it is seen in Fig. 5-8 that the crosstrack distance varies as the minimum allowable dynamic pressure decreases. Fig. 5-8 also shows that the vehicle
approaches the target further from behind as the dynamic pressure constraint
105
60
50 -
40-
"030 -
20 -
.. . . . .
-..
.
- - qmin= 11.97kPa
...
10-
-UA
0
0
2
4
6
8
10
12
qmin= 23.94 kPa
qmin= 35.91 kPa
qmin= 47.88 kPa
14
16
18
Energy (GJ)
Figure 5-6: Altitude vs. Energy for qmin = (11.97, 23.94, 35.91, 47.88) kPa
is lowered.
In terms of the effects of the dynamic pressure on the control, the control
profile is considered along with the value of the performance index. Referring to
the angle of attack proffle in Figure 5-9, the maximum angle of attack increases
slightly and the deviation of the angle of attack from d&increases as the minimum
dynamic pressure increases. As to be expected, Fig. 5-10 shows that the value
of the performance index increases as the minimum allowable dynamic pressure
increases. Recall that the performance index is a measure of the control margin
and the smaller the value of the performance index, the larger the control margin. Thus, increasing the minimum allowable dynamic pressure decreases the
control margin.
To summarize, the purpose of including a constraint on the minimum allowable dynamic pressure is to prevent the vehicle from skipping out of the atmosphere and to maintain control authority. As to be expected, as the minimum
allowable dynamic pressure was increased, the maximum altitude reached de106
8000
7000
P
..
-
.
6000 -
-
5000 -
-
-..
4000 -..
44
2000 --
-
-
S3000 -
-4-
qmin= 11.97 kPa
qmin= 23.94 kPa
-UA
qmin= 35.91 kPa
qmin= 47.88 kPa
1000
0
100
200
300
-
500
400
600
700
Time (s)
Figure 5-7: Earth Relative Speed vs. Time for qi
kPa
=
(11.97,23.94,35.91,47.88)
creased. However, as the minimum allowable dynamic pressure was increased,
the control margin decreased. Since in each case the vehicle did not exit the
Earth's atmosphere, the case which maximized the control margin is desired.
Thus, it is beneficial to design a vehicle that can be controlled at higher altitudes
(i.e. at a lower minimum dynamic pressure constraint).
107
500
qmin= 11.97 kPa
-*-
V
450
A
400
V
qmin= 23.94 kPa
qmin= 35.91 kPa
qmin= 47.88 kPa
-U-
350-c-I 300
/
250
-
II
/
0
U 200
a)
150
-
.
.
-.
-
--
I
/
-
a)
-
100
...
.
--
50
.. . . .. . . . .. . .
500
1500
1000
2500
2000
- 3000
3500
Earth Relative Downtrack Distance (km)
Figure 5-8: Earth Relative Crosstrack Distance vs. Earth Relative Downtrack Distance for qmin = (11.97,23.94, 35.91,47.88) kPa
25
20(
-4y
-u-A
qmin= 11.97 kPa
qmin= 23.94 kPa
qmin= 35.91 kPa
qmin= 47.88 kPa
0
-.--
-
15
-.
-...
-.
10
-.-.-.-.-
-
-.
to
. .. .. ... .... .
. .. .
. . . ..
.
. ..
--
5
0 O0
I
I
I
100
200
300
I
400
I
I
500
600
700
Time (s)
Figure 5-9: Angle of Attack vs. Time for q
108
=
(11.97,23.94, 35.91,47.88) kPa
5.8
x 10-
5.6 -
5.4-
-
5.2 -
5
- ---.- -
0
.
4.8
Sqmin= 11.97 kPa
V qmin= 23.94 kPa
* qmin= 35.91 kPa
A qmin= 47.88 kPa
4.6 -
4.4 10
15
20
25
30
35
40
45
50
qmin (kPa)
Figure 5-10: Value of the Performance Index vs. Minimum Allowable Dynamic
Pressure for qmi = (11.97,23.94, 35.91,47.88) kPa
109
5.4
Effects of the Stagnation Point Heat Load on the
Trajectory and Control
The maximum allowable stagnation point heat load that the vehicle can withstand depends on the thermal protection system. The total stagnation point
heat load sustained by the vehicle in the unconstrained case is approximately
2300 MJ/m 2 . In this study, the maximum allowable heat load is varied between
1100 MJ/m 2 and 2300 MJ/m
2
while qmin = 11.97 kPa and (L/D)max ~ 2.4 are
held constant. The initial guess is the solution for the case where M = 100,
ki = k 2 = 1, k 3 = 0.1, qmin = 11.97 kPa, Qmax = oo, and (L /D)max ~ 2.4.
It is seen in Fig. 5-11 that this constraint is active in every case. Furthermore,
tightening the constraint on the maximum allowable heat load results in an increase in the maximum heating rate, as shown in Fig. 5-12. In reference to tra2500
Qmax=
Qmax=
Qmax=
Qmax=
Qmax=
2000
1500F
2300
1900
1600
1200
1 100
MJ/m2
MJ/m2
MJ/m2
MJ/m2
MJ/m2
- - -.-
C 1000 -...
500 F I
0
I
100
200
300
400
500
600
700
800
900
Time (s)
Load
Heat
Total
5-11:
Figure
(1100, 1300,1400,1700,2000,2300) MJ/m 2
vs.
Time
for
Qmax
jectory characteristics, it is known that the heat load is a function of density and
110
18
16 -
Qmax= 1600 MJ/m2
. .. ..
..
~
..-.. ...
.......
.
14 12
Qmax= 1200 MJ/m2
Qmax= I1100 M J/m2 -
-
A-,
-..
. .. .
. .. ..
104
-
4
8O
.. .. .
2 -
.
-
-
.
-
.. . .
..
-
-
---
4--.
0
0
100
200
300
400
500
600
700
800
900
Time (s)
Figure
5-12:
Heating
Rate
vs.
(1100, 1300, 1400, 1700,2000,2300) MJ/m 2
Time
for
Qmax
=
altitude (see Eq. (2.25)). Using the same argument as presented in Section 5.3, the
density is used to control the heat load. Consequently, the density is lowered to
decrease the heat load. In order to decrease the density, the vehicle increases in
altitude to a low density region until the dynamic pressure constraint becomes
active. To relieve the dynamic pressure constraint, the vehicle descends to a
lower altitude while depleting speed. Consequently, as the maximum allowable
heat load is lowered, the vehicle undulates through the atmosphere at a higher
frequency and, initially, the speed is depleted at a faster rate. In fact, the faster
the altitude decreases, the faster speed is depleted.The affects on altitude and
speed for different values of Qm ax are shown in Figs. 5-13 and 5-14. Similar to
the affects of loosening the constraint on dynamic pressure, as the maximum
total heat load is increased, the vehicle approaches the target from further behind. This trend is depicted in Fig. 5-15 which also shows that the vehicle takes
a more direct trajectory and approaches the target from the front in the more
111
constrained cases. The crosstrack distance traveled by the vehicle is a maximum
for Qmax = 1900 MJ/m
2
and is a minimum for Qmax = 1200 MJ/m 2 .
-oL
0
100
200
300
400
500
700
600
800
900
Time (s)
Figure 5-13: Altitude vs. Time for Q m ax = (1100, 1300, 1400, 1700,2000,2300)
MJ/m 2
The effects of the maximum allowable heat load on the control are evident
by looking at the angle of attack profile, the bank angle rate, and the value of
the performance index. The angle of attack is not specified in the initial conditions which allows the optimizer to choose the initial value for the angle of
attack. As shown in Fig. 5-16, the angle of attack reaches its upper limit in the
beginning of the trajectory for Qmax = 1200 MJ/m 2 and Qma = 1100 MJ/m 2 .
Furthermore, amax is a minimum for Qmax = 1100 MJ/m
2
and a maximum for
= 1900 MJ/m 2 . Also evident in Fig. 5-16 is that the deviations of a from
& increase in both magnitude and frequency as Qmax decreases. However, as
Qrnax
the vehicle nears the target, it looses the ability to make any necessary corrections. Hence, the primary concern is maintaining control authority near the end
of the trajectory. Figure 5-17 shows that the bank angle rate reaches its upper
112
8000
--- Qmax= 2300 MJ/m2
V
--A
7000
cj~
Qmax=
Qmax=
Qmax=
Qmax=
-*-
1900 MJ/m2
1600 MJ/m2
1200 MJ/m2
1 100 MJ/m2
S 6000
5000
4000
-
3000
.
..
2000|
1AA
I
0
100
200
300
400
600
500
800
700
900
Time (s)
Earth Relative Speed
Figure
5-14:
(1100, 1300,1400,1700,2000,2300) MJ/m 2
vs.
Time
for
Qmax
and lower limits both in the beginning and at the end of flight in the case where
Qmax = 1200 MJ/m 2 and Qma = 1100 MJ/m 2 . It is seen from Fig. 5-18 that the
value of the performance index increases as the maximum allowable heat load
decreases. Thus, the control margin increases as the maximum allowable heat
load increases.
In each case the total heat load experienced by the vehicle is exactly the value
of the maximum allowable heat load. To account for unexpected events encountered during flight, it is beneficial to add a buffer region between the amount of
heat the vehicle will sustain and the amount of heat the vehicle is capable of withstanding. In other words, the trajectory and control should be designed based on
a maximum allowable heat load which is less than what the thermal protection
system is designed to handle. In addition, the maximum rate at which the vehicle can be heated also depends on the thermal system. Thus, the heating rate
and the heat load experienced by the vehicle must both be taken into considera-
113
450
4 400
-I
Qmax= 2300 MJ/m2
Qmax= 1900 MJ/m2
-U-
MJI/m2
A - Qmax =1200 MJ/m2
-0- Qmax = 1100 MJ/m2
350
300
250
c 200
U
150
-
100
- -
-.....
-.....
....
. ...
...-..
-...
-..
. ..
-..
................
-
..
-.
-................
50
0
500
1000
1500
2000
2500
3000
3500
Earth Relative Downtrack Distance (km)
Figure 5-15: Earth Relative Crosstrack Distance vs. Earth Relative Downtrack
Distance for Qmax = (1100, 1300, 1400, 1700,2000,2300) MJ/m 2
tion when designing a trajectory and control. Regardless, it was shown that the
control margin increased as the maximum allowable heat load increased. Thus,
it is desirable to design a vehicle that can withstand as much heat as possible.
114
2
0-2
15,
4-)
0
1
-4-
Qmax=
Qmax=
-o- Qmax=
-A -Qmax=
-0- Qmax=
n
-5
0
100
2300
1900
1600
1200
I 100
|
200
MJ/m2
MJ/m2
MJ/m2
MJ/m2
MJ/m2
|
|
300
400
500
600
700
800
900
Time (s)
of
Attack
Angle
5-16:
Figure
(1100, 1300,1400, 1700, 2000,2300) MJ/m 2
vs.
Time
for
Qmax
r4
0'
100
200
300
400
500
600
800
700
900
Time (s)
Rate
Bank
Angle
5-17:
Figure
(1100, 1300,1400,1700,2000,2300) MJ/m 2
115
vs.
Time
for
Qmax
0.08
*
Qmax=
Qmax=
Qmax=
Qmax=
Qmax=
y
-..
0.07-
A
A
2300
1900
1600
1200
1100
MJ/m2
MJ/m2
MJ/m2 MJ/m2
MJ/m2
0.06a)
0.05a)
C-)
0.04C
I.
0.03-
a)
0.020.01
'
100
1200
1400
1600
1800
2000
4
2200
2400
Qm ax (MJ/m 2 )
Value of the Performance
Figure 5-18:
(1100, 1300,1400, 1700,2000,2300) MJ/m 2
116
Index vs.
Qmax for Qmax
5.5
Effects of the Lift-to-Drag Ratio on the Trajectory
and Control
The maximum lift-to-drag ratio, (LID)max, is determined by the specific design
of the vehicle. The vehicle used in this thesis has a maximum lift-to-drag ratio of
approximately 2.4. This study analyzes the effects on the trajectory and control
of varying (L/D)max between 2.0 and 2.5 while Qmax = oo and qmin = 11.97 kPa
are held constant. The initial guess fed into the optimizer is the solution for the
case where M = 100, ki
k2 = 1, k 3 = 0.1, qmin = 11.97 kPa, Qmax = oo, and
(L/D)max ~ 2.4. In order to vary (L/D)max, it is assumed that the lift coefficient
corresponding to the maximum lift-to-drag ratio is constant. The zero-lift drag
coefficient, CDO, and the drag polar parameter, K, are the parameters that are
The lift-to-drag ratio is computed as
altered as a result of varying (L/D)ma.
follows:
L
_
D
CL
(5.1)
CDo + KC2(
Let CL be the value of CL corresponding to (L/D)max.
Since (L/D)max is the
maximum value of LID, a necessary condition for LID to equal (L/D)max is
(L /D)= 0
aCL
(5.2)
Computing a(LID)/WCL using Eq. (5.1), we have that
CDO - KC
(CDO
and from which we have that CL
0
+ KCLZ)2
CDOIK.
(5.3)
Substituting CL into Eq. (5.1), CDO
and K are given in terms of (L/D)max as follows:
CDO
=
K
=
-(L
CL
2(L/D)max
1
2CL(L/D)max
117
(5.4)
(5.5)
In regards to the effect of the maximum lift-to-drag ratio on the trajectory, as
(LID) decreases the vehicle loses some of its maneuverability. In other words,
if the vehicle is constrained to fly in the downtrack direction during a glide
maneuver, the range of the vehicle will decrease as (LID) decreases. The only
notable distinctions in the trajectory occurs when (L/D)max is reduced to 2.0.
Thus, the comments on the results refer to the differences between a vehicle
with (L/D)max = 2.0 and one with (L/D)max > 2.0. Figure 5-19 shows that during
the initial increase in altitude an (L/D)max of 2.0 depletes more energy while
achieving a slightly lower altitude and the maximum altitude attained near the
end of flight is higher. Looking at the motion of the vehicle in the crosstrackdowntrack plane shown in Fig. 5-20, as the maximum lift-to-drag ratio increases,
the vehicle approaches the target further from behind. The crosstrack distance is
maximized at the lowest maximum lift-to-drag ratio and the vehicle approaches
the target perpendicular to the downtrack direction. Looking at Fig. 5-21, the
speed profile is noticeably different in the case where the maximum lift-to-drag
ratio is the smallest. In the beginning of the trajectory, a vehicle with (L/D)max =
2.0 decreases its speed at a faster rate before reaching a relatively constant speed
while, during the speed depletion phase, it depletes speed at a slower rate. It is
seen in Fig. 5-22 that as the maximum lift-to-drag ratio increases, the total heat
load increases as well.
The effect of the maximum lift-to-drag ratio on the control is minimal. Fig. 5-
23 shows that as (L/D)max is increased the maximum angle of attack decreases
slightly. In terms of the control margin, it is seen in Fig. 5-24 that the performance index increases as (L/D)max decreases. Notice that the increase in the
performance index is relatively constant with the exception of the difference between (L/D)max = 2.0 and (L/D)max = 2.1. In this case, there is a greater increase
in the performance index.
Overall, varying the maximum lift-to-drag ratio has little affect on the trajectory and control margin until (L/D)max is reduced to 2.0. At this point, there is
a clear distinction in the behavior of the vehicle even though the trends are sim118
60
50
0
-
20 -
V
-U--0.
0
0
2
4
6
8
12
10
2.0
2.1
L/Dmax= 2.2
L/Dmax= 2.3
L/Dmax= 2.4
L/Dmax= 2.5
L/Dmax=
L/Dmax=
14
16
18
Energy (GJ)
Figure 5-19: Altitude vs. Energy for (L/D)max
=
(2, 2.1, 2.2, 2.3, 2.4, 2.5)
ilar. In designing the vehicle, it is important to keep in mind that the maximum
lift-to-drag ratio effects the heat load that the vehicle endures throughout flight.
Furthermore, the higher the maximum lift-to-drag ratio, the larger the control
margin. Thus, in this particular application, the more maneuverable the vehicle
the better.
119
500
450
400
L/Dmax= 2.0
L
-O-
A
-0-
I
I
-I
-
=
/DT
L/Dmax=
L/Dmax=
L/Dmax=
L/Dmax=
21
2.2
2.3
2.4
2.5
350
C 300-
-10 x
-/m
-.
250-
- ---- - -U
...-.. ..... .
U 200-
4-0
150-
-......
.... ...
. .
-..
-.-.
. -..........
...... -..
........
10050\500
500
00
1000
1500
2000
3000
2500
3500
Earth Relative Downtrack Distance (km)
Figure 5-20: Earth Relative Crosstrack Distance vs. Earth Relative Downtrack
Distance for (L ID)max = (2, 2.1, 2.2, 2.3, 2.4, 2.5)
a)
w)
4-
L/Dmax=
- L/Dmax=
-U- L/Dmax=
A- L/Dmax=
-0- L/Dmax=
L/Dmax=
1000 "
0
100
Earth
Figure 5-21:
(2, 2.1, 2.2, 2.3, 2.4, 2.5)
2.0
2.1
2.2
2.3
2.4
2.5
200
300
400
600
500
700
Time (s)
Relative
Speed
120
vs.
Time
for
(L/D)max
2500
2000
C
1500
1000*
L/Dmax=
2.
. .....
. .............. .. V L/tb max = 2.
...
....
-U--L/Dmax= 2.
500
-o
A
200
100
0
400
300
L/Dmax=
L/Dmax=
L/Dmax=
2.
2.
2. 5
600
500
700
Time (s)
Stagnation
Figure 5-22:
(2, 2.1, 2.2, 2.3, 2.4, 2.5)
Point Heat
Load
vs.
Time
for
18- .
16
S)
4-d
-
14 121 __
10
. . . . . ......... .
............ .
..........
..
.. -..
-
4-4
..
...-..
8
. ..
-..
..
6
-44
2
V
-UA
--
0,
C
2.0
2.1
2.2
2.3
L/Dmax= 2.4
L/Dmax= 2.5
L/Dmax=
L/Dmax=
L/Dmax=
L/Dmax=
100
200
300
400
500
600
700
Time (s)
Figure 5-23: Angle of Attack vs. Time for (L /D)ma = (2, 2.1, 2.2, 2.3, 2.4, 2.5)
121
=
I17
1
201
(L/D)ma
-3
4.75
4.7
a)
4 .65|
a)
C-)
4.6
C
a) 4.55.
4.5
*
A
A
0
00
4.45'
2
L/Dmax=
L/Dmax=
L/Dmax=
L/Dmax=
L/Dmax=
L/Dmax=
2.05
2.1
2.0
2.1
2.2
2.3
2.4
2.5
2.15
.
2.2
2.25
2.3
2.35
2.4
2.45
2.5
(L/D)max
Figure 5-24: Value of the Performance Index vs. (L/D)max for (L/D)max
(2, 2.1, 2.2, 2.3, 2.4, 2.5)
122
5.6
Summary of the Parametric Study
The key features of the optimal trajectory were described to provide insight to
the behavior of the Common Aero Vehicle. It was seen that the vehicle initially
increased in altitude until the dynamic pressure constraint became active. This
was done to ensure the vehicle can travel the required downtrack distance. The
local maximum in altitude reached near the end of flight resulted from the need
to first increase the speed to generate lift and then the need to deplete speed,
both of which are done in order to meet the terminal conditions. In terms of
lateral motion, the vehicle steered 410 km out of the Earth Relative downtrack
plane. Another important feature was that while the performance index kept the
angle of attack near the middle of its corridor for a majority of the trajectory,
the angle of attack reached a maximum value near the end of flight. This trend
arose from the need to meet the terminal conditions imposed on the speed and
the flight path angle. Furthermore, the vehicle was banked at -90 deg for about
200 seconds of flight in order to decrease the altitude of the vehicle and the
terminal bank angle was -180 degrees, which resulted from the need to meet the
terminal flight path angle.
The minimum allowable dynamic pressure, maximum allowable heat load,
and maximum lift-to-drag ratio were varied in order to assess their effect on
the resulting trajectory and control profiles. The dynamic pressure constraint
maintains control authority by preventing the vehicle from exiting the Earth's
atmosphere. The minimum allowable dynamic pressure was varied and in each
case the constraint became active at each point where the vehicle attained a local
maximum in altitude. As the minimum allowable dynamic pressure increased,
the initial increase in altitude decreased and the speed depleted at a slower rate.
It was found that in terms of maintaining control authority, the lower the min-
imum dynamic pressure constraint the better. The maximum allowable stagnation point heat load that the vehicle can sustain is determined by the thermal
protection system of the vehicle. In each case of varying the maximum total
123
heat load, the optimizer designed a trajectory in which this constraint was active. This is an important characteristic to consider when designing the vehicle
and in determining an optimal trajectory. As the maximum allowable stagnation point heat load was decreased, the vehicle skipped through the atmosphere
more, initially depleted speed at a faster rate, and the vehicle took a more direct
line to the target by approaching it more from the front (versus further downtrack). It was also found that the more heat the vehicle can withstand, the larger
the control margin. Another design parameter considered was the maximum
lift-to-drag ratio which reflects the maneuverability of the vehicle. Of the three
parameters varied, the range of lift-to-drag ratios considered has the least effect
on the trajectory and controls. Even though the differences in the control margin
for each case was smaller, the higher the lift-to-drag ratio the larger the control
margin.
It is evident from this parametric study that the properties of the Common
Aero Vehicle in combination with the required terminal conditions result in four
important features of the trajectory and control. By varying parameters in the
problem, it was found that that the looser the constraints on dynamic pressure
and maximum heat load and the higher the maximum lift-to-drag ratio, the larger
the control margin.
124
Chapter 6
Preliminary Study of the Real-Time
Application of the Legendre
Pseudospectral Method
6.1
Overview
This chapter addresses the potential real-time application of the Legendre Pseudospectral Method. A useful method to consider in the context of real-time is
one which is capable of obtaining a solution in a sufficiently short period of time,
can solve a wide range of problems, and produces an accurate control. The ability to implement a method in real-time depends upon both the amount of computational resources that are available and the computational complexity of the
problem. In particular, the execution time required to solve an optimal control
problem using the Legendre Pseudospectral Method is highly dependent upon
the optimizer and the machine used. A complete assessment of the solution
time involves comparing the execution time of various optimization algorithms.
However, this thesis is concerned with the Legendre Pseudospectral Method, not
the optimization algorithm. Therefore, the execution time required to solve the
CAV optimal control problem using the Legendre Pseudospectral Method is not
125
considered in this preliminary analysis. Furthermore, it is shown from the diversity of problems solved in Refs. [6, 7, 9, 10, 11, 19, 20, 211 that the Legendre
Pseudospectral Method is indeed capable of solving a wide range of problems.
Thus, this preliminary study is restricted to the assessment of the accuracy of
the solution obtained using the Legendre Pseudospectral Method.
The accuracy of the solution obtained via the Legendre Pseudospectral Method
is assessed by simulating the flight of the Common Aero Vehicle. In the simulation, the control is updated periodically based on the current state of the vehicle. However, a point is reached where the control can no longer be updated.
At this point, the motion of the vehicle is simulated using the previous control
to fly the vehicle until Earth impact. The state of the vehicle at Earth impact
is considered to be the actual performance of the CAV while the most recent
solution obtained via the Legendre Pseudospectral Method is considered to be
the predicted performance of the CAV. Thus, the accuracy of the solution obtained using the Legendre Pseudospectral Method is assessed by comparing the
predicted solution to the actual solution. Furthermore, realistic vehicle and environmental dispersions are added to the simulation. The perturbed model is
created with the intention of assessing the accuracy of the solution subject to
"real life" uncertainties in the vehicle performance.
6.2
Common Aero Vehicle Flight Simulation
Fig. 6-1 depicts a typical simulation for the flight of a vehicle, where N, G, and
C are the navigation, guidance, and control systems, respectively, that comprise
the flight software. The navigation system estimates the current state of the
vehicle and provides this information to the guidance system. The guidance system uses the navigation information to determine the commands that steer the
vehicle to the prescribed terminal state. The control system then implements
these control commands and provides the control to the environment model.
The environment model uses this information to simulate the flight of the vehi126
cle given a particular environment model and a vehicle model. The state of the
vehicle at the end of the cycle is predicted by using the control from the flight
software to integrate the equations of motion. The state calculated by the environment model is then fed into the navigation system and the steps described
above are repeated for the duration of the flight.
For simplicity, it is assumed in this simulation that the state is known perfectly and that there is no error associated with implementing the controls. As
a result, the navigation and control systems are not modeled. Thus the simulation consists of the guidance system and the environment model. Furthermore,
the simulation operates under the assumption that the optimizer can instantaneously produce a new set of control commands which are then updated every
10 seconds.
Target Information
Flight Software
N
G
C
1_
_Model
Figure 6-1: Flight Simulation Block Diagram
Guidance System
In this study, the "guidance algorithm" is the iterative procedure that arises
from using SNOPT to solve the NLP that arises from the Legendre Pseudospectral
Method. The guidance law is the steering command that arises from solving the
NLP. The NLP is solved periodically at time intervals called guidance cycles: thus,
each time the NLP is solved, the vehicle is closer to the target and the time of
flight decreases. Since the number of nodes used throughout the simulation is
127
fixed and the duration of flight decreases with each guidance cycle, the absolute
spacing between the LGL points decreases. This creates the effect of increasing
the number of nodes, which increases the accuracy. As a result, instead of using
100 nodes (which was used to compute the trajectories in the previous chapters),
the number of nodes is reduced to 50 for the guidance simulation. Furthermore,
the guidance algorithm requires an initial guess. Prior to flight, an optimal solution to the CAV optimal control problem is generated. This predetermined
optimal solution is supplied to the guidance system as an initial guess. In this
study, the simulation begins with a converged optimal solution that corresponds
to using 50 nodes and the numerical values previously stated in Chapter 4. After
the completion of the first guidance cycle, the most recent solution generated by
SNOPT is used as the initial guess for the current guidance cycle.
Environment Model
The environment model predicts the state of the vehicle after flying with the
current control for one guidance cycle (10 seconds). The state of the vehicle
is predicted by using the current control to integrate the equations of motion.
The equations of motion include models of the vehicle and the environment. In
particular, a 4 th order Runga-Kutta integration scheme with a constant stepsize
of h = 1 s is used to integrate the equations of motion. Furthermore, Lagrange
interpolation is used to compute the controls during the numerical integration.
The integration is carried out in SI units because the information readily avail-
able from the guidance system is usually in dimensional quantities. The state is
then fed into the guidance system which solves the optimal control problem to
determine a new set of control commands based on the most recent estimation
of the state of the vehicle. In order to maintain a continuous control profile from
one cycle to the next, the controls are included in the specification of the initial
state. This process is repeated ideally until the time remaining in the flight is
less than ten seconds. However, there comes a point at which the optimizer is
unable to find an optimal solution. From the point where the optimizer is no
longer able to find a solution, the flight of the vehicle is simulated using the
128
control from the last converged solution. For the remainder of this chapter, the
portion of the simulation in which the control is updated is referred to as the
closed-loop simulation while the portion of the simulation in which the control
can no longer be updated is referred to as the open-loop simulation.
6.3
Assessment of the Accuracy of the Legendre Pseudospectral Method
The simulation described in the Section 6.2 is implemented both with and without perturbations in the environment model. In the perturbed cases, dispersions
in the value of the mass, lift-to-drag ratio, and the density are each added to the
simulation separately. Simulation perturbations are implemented by altering the
vehicle and environmental models used in the environment block. In all cases
the predicted results obtained via the Legendre Pseudospectral Method are compared to the actual results generated by the environment model. In particular,
the terminal error in position and speed is used to assess the accuracy of the
Legendre Pseudospectral Method in terms of the potential for real-time application. These terminal errors are calculated in the same manner described in
the node analysis. Please refer to Section 4.5.2 for a description of the specific
equation used. Since an optimal solution from the optimizer by definition satisfies the terminal conditions, any error in the terminal state results from the
open-loop simulation. Since the integration terminates at a zero altitude, the
integration time may exceed the final time from the solution generated by the
Legendre Pseudospectral Method. When this occurs, the integrator runs out of
control. If the integration time is greater than the predicted final time, the control used in the integration is set equal to the control from the optimal solution
corresponding to the final time and held constant. Thus both the time at which
the open-loop guidance begins and ends along with the final time corresponding
to the optimal solution are considered.
129
CASE I: Accuracy of Simulation Results Without Perturbations
For this study, the model used in the environment model is the same as
the model used in the optimization algorithm. The terminal position error is
14.0 meters and the terminal speed error is 2.77 m/s. While the position error
violates the requirement of position accuracy to within several meters associated
with striking HDBTs, the error is small when considering the distance traveled
by the vehicle. The speed accuracy is well within the prescribed accuracy range
of ±500 m/s. In this case the closed-loop simulation terminates at 610 seconds
into the flight and at this point, the vehicle is flown open-loop for an additional
41 seconds. The integrated solution guides the vehicle to a zero altitude with
a final time of 651 seconds while the last optimal solution from the Legendre
Pseudospectral Method begins at 610 seconds and terminates at 647 seconds.
Thus the actual solution terminates 4 seconds after the predicted final time.
In order to improve the accuracy of the simulation results, the number of
nodes can be increased. However, this will increase the solution time, which is
undesirable when considering real-time. In terms of the open-loop simulation,
there are two factors that effect the accuracy of the solution: the accuracy of
the integration scheme and the length of time for which the vehicle is flown
using the open-loop simulation. To improve the accuracy of the integration
scheme it would be beneficial to conduct an analysis to determine which integrator produces the most accurate results given the control from the Legendre
Pseudospectral Method. To shorten the duration of flight flown in an open-loop
simulation, the point at which the closed-loop simulation terminates may be delayed by loosening the terminal constraints on the speed, flight path angle, and
angle of attack. For example, since the terminal speed must lie within ± 500 m/s
of vf, set the lower bound on speed at the final LGL point equal to (vf - 500)
m/s and the upper bound at the final LGL point equal to (vf + 500) m/s.
CASE II: Accuracy of Simulation Results with Perturbations in the Mass of the
Vehicle
In this case the value of the mass used in the environment model is per130
turbed by ±1%. This accounts for a 6.87 kg difference between the assumed
and actual values of the mass of the vehicle. With a positive perturbation in
mass, the closed-loop simulation terminates at 620 seconds into flight and the
vehicle flies using the open-loop simulation for 84 seconds. The integrated solution terminates 3.04 seconds after the predicted final time from the last optimal
solution. The terminal error in position is 290 m while the speed error is 36.7
m/s. A negative deviation in mass results in a 660 second closed-loop simulation and a 52 second open-loop simulation where the actual flight terminates
2.23 seconds after the predicted flight. The corresponding trajectory hits the
ground 210 m away from the target with a 7.47 m/s error in speed. Comparing
both cases, a negative perturbation in the mass results in a longer closed-loop
simulation, shorter open-loop simulation, and better accuracy in regards to the
terminal error in position and speed than a positive perturbation in mass.
CASE III: Accuracy of Simulation Results with Perturbations in the Lift-to-Drag
Ratio of the Vehicle
In this case the lift-to-drag ratio (LID) is perturbed by ±1%. This is done
by perturbing the lift coefficient and the drag polar parameter. Consider the
situation where oc =
ec.
A ±1% (LID) perturbation corresponds to perturbing
the angle of attack by ±0.119 deg while the drag polar parameter is perturbed
by T0.1. With a positive deviation in the lift-to-drag ratio, the vehicle is flown
using a closed-loop simulation for 570 seconds before switching to an openloop simulation for the remaining 82 seconds of flight. In this case the actual
flight is 3.56 seconds longer than the predicted flight. The resulting terminal
error in position is 287 m and the terminal speed error is 7.92 m/s. A negative
deviation in (LID) has a much greater affect on the accuracy of the solution. In
this case the closed-loop simulation only lasts for 500 seconds while the openloop simulation lasts for 153 seconds. The actual terminal time is 2.38 seconds
greater than the predicted final time. The terminal error in position is 2580 m
and the speed error is 71.4 m/s. A negative deviation in LID leads to a much
shorter closed-loop simulation and a much longer open-loop simulation than a
131
positive deviation. Despite the fact that the difference in final times between the
actual and predicted solutions is shorter, the terminal errors are significantly
larger for the case with a negative perturbation versus a positive perturbation in
the lift-to-drag ratio.
CASE IV: Accuracy of Simulation Results with Perturbations in the Atmospheric Density
In this case the density is perturbed by ±5%. Perturbing the density this
amount results in a deviation of ±0.06125 kg/m 3 , respectively, from the assumed sea level density of 1.225 kg/m 3 . Take the situation where the vehicle
is at a zero altitude. Using the strictly exponential density model of Eq. (2.17)
and taking the vehicle to be at sea level, a ±5%deviation in the density results
in a altitude deviation of roughly ± 345 m. This is an extremely large difference
which will decrease as the density decreases. When the density is perturbed by
+5%, the closed-loop simulation flies the vehicle for 550 seconds and from this
point, the open-loop simulation flies the vehicle for 102 seconds. The final time
from the open-loop simulation is 2.33 seconds longer than the final time predicted by the closed-loop simulation. The resulting terminal error in position is
1990 m and the terminal speed error is 54.2 m/s. Similar results are obtained
from perturbing the density by -5%. The closed-loop simulation terminates at
570 seconds and the duration of the open-loop simulation is 98 seconds. This
leads to a difference of 4.33 seconds between the actual and predicted final time
of flight, where the actual final time is the greater of the two. The terminal error
associated with the negative deviation in density is 1560 m for position and 113
m/s for speed. The closed-loop simulation is 20 seconds longer and the openloop simulation is 2 seconds shorter in the case with a negative perturbation
versus the case with a positive perturbation in density. However, the difference
in the actual final time from the predicted final time is much larger in the case
with a negative perturbation. While the position error is better in the case with a
negative deviation in density, the speed error is roughly double that of the case
with a positive density deviation.
132
Summary of the Results from the Perturbed Simulations (CASE II-IV)
Table 6.1 and 6.2 summarize the results from perturbing the environment
model in terms of the terminal errors and computational performance. Table
6.1 lists the terminal position and speed errors corresponding to the value that
was perturbed. It is evident from these values that while the vehicle does not
hit the specified target, in every case the vehicle impacts the Earth with the required kinetic energy associated with striking HDBTs. Table 6.2 compares the
computational performance of the simulation with perturbations where TCL is
the duration of the closed-loop simulation, TOL is the duration of the open-loop
simulation, and ATj is the difference between the actual final time and the predicted final time. It is seen that a negative perturbation in the mass of the vehicle
results in the longest closed-loop simulation while a negative perturbation in the
lift-to-drag ratio of the vehicle has the shortest closed-loop simulation.' The duration of the open-loop simulation directly relates to the terminal position error
in the fact that the shorter the open-loop simulation, the smaller the position
error. Thus, the case with a negative deviation in mass has the shortest openloop simulation and the case with a negative deviation in the lift-to-drag ratio
has the longest open-loop simulation. The difference in final times between the
actual and predicted solutions are pretty close for all of the cases considered.
The case with a negative perturbation in the density model produces the largest
difference while the case with a negative mass deviation produces the smallest
difference in final times.
Table 6.1: Terminal Errors from the Simulation with Perturbations
Perturbation Position Error (m) Speed Error (m/s)
+1% m
290
36.7
-1%m
+1%L/D
-1%L/D
210
287
2580
1990
1560
7.47
7.92
71.4
54.2
113
+5% p
-5%p
133
Table 6.2: Computational Performance of the Simulation with Perturbations
6.4
Perturbation
TCL (S)
+1%m
-1%m
+1%L/D
-1%L/D
+5%p
-5%p
620
660
570
500
550
570
(s)
84
52
82
153
102
98
TOL
ATf (S)
3.04
2.23
3.56
2.38
2.33
4.33
Summary
The Legendre pseudospectral method was assessed in the context of real-time
application in terms of the solution accuracy. A flight simulation of the Common
Aero Vehicle was constructed where the state of the vehicle was updated periodically throughout flight. A
4 th
order Runga-Kutta integration scheme was used
in the environment model to update the state of the vehicle. The updated state
along with the last control were used as an initial condition while the previous
converged solution was used as the initial guess for the optimization algorithm.
Since the optimization process was repeated with current information regarding
the state of the vehicle, the number of nodes was reduced to 50. It was found
that without perturbations in the environment model, the terminal position error was roughly 14 meters. With this position error, the vehicle will miss the
target; however, it is insignificant in comparison to the distance traveled by the
vehicle. The speed error was well within the allowable bounds and thus, the
vehicle will penetrate the surface of Earth with the required kinetic energy.
The robustness of the solution was then addressed in terms of accuracy by
adding perturbations to the mass, lift-to-drag ratio, and density values used in
the environment model. A deviation in the mass of the vehicle was modeled in
the environment and it was found that in terms of the terminal error in both
position and speed, it is better to over estimate than to under estimate the mass
of the vehicle.
In terms of the lift-to-drag ratio, there was a large difference
134
in accuracy between a positive and negative deviation in the lift-to-drag ratio.
In this case it is much more beneficial to under estimate the lift-to-drag ratio.
With a density perturbation, the vehicle came closer to hitting the target with
a negative deviation versus a positive deviation and the speed error was larger
for the case with a negative deviation in density versus a positive deviation in
density. Overall, the solution was the most sensitive to deviations in the density
and the least sensitive to deviations in the mass. However, a negative deviation
in LID resulted in the largest terminal position error.
The results from the unperturbed simulation indicate that the Legendre Pseudospectral Method shows promise for use in real-time. In terms of the perturbed
cases, the solution is the most sensitive to a negative perturbation in LID as well
as any perturbation in the density model. The resulting position errors directly
correspond to the duration of the flight flown using the open-loop simulation.
In order to improve the accuracy of the solution subject to perturbations in the
model, the duration of the open-loop simulation should be decreased. In order
to improve the robustness of the solution to perturbations, a more accurate vehi-
cle model should be developed and a more accurate atmospheric model should
be used. However, these improvements may increase the solution time. Thus,
in order to continue this analysis on the real-time application of the Legendre
Pseudospectral Method to the Common Aero Vehicle a detailed comparison of
both the execution time and the solution accuracy should be conducted.
135
[This page intentionally left blank.]
Chapter 7
Conclusions
7.1
Summary
The United States desires space-based global strike capabilities. Global strike
refers to the ability to project power anywhere on the globe from the continental United States in short notice. This desire leads to the development of new
vehicles which involve space launch and Earth re-entry. This thesis considered
the use of the Common Aero Vehicle (CAV) as the Earth re-entry vehicle. Furthermore, the Common Aero Vehicle (CAV) considered is an unpowered bank-to-turn
high lift-to-drag ratio Earth penetrating re-entry vehicle. The natural behavior of
the vehicle conflicted with the behavior required to satisfy the terminal conditions for striking HDBTs. As the vehicle neared the target, the demands on the
guidance and control systems increased greatly. In order to maintain control
flexibility, it was desirable to determine a trajectory and control in which the
control margin was maximized. The CAV mission design problem was to steer
the CAV from a completely specified initial condition to a partially specified
terminal state on the surface of the Earth such that a performance index is minimized and the constraints imposed on the vehicle are satisfied. This resulted in
an optimal control problem.
A solution to the optimal control problem was obtained using a direct Legendre Pseudospectral Method. The Legendre Pseudospectral Method discretizes
137
the optimal control problem at the Legendre-Gauss-Lobatto points and the resulting NLP is solved using one of the many available software programs. The
resulting Common Aero Vehicle NLP has both linear and nonlinear inequality
and equality constraints and a sparse Jacobian.
SNOPT is a general purpose
solver that takes advantage of the sparsity of the problem and thus, it was used
to obtain a solution to the NLP. The steps required to obtain a solution to the
Common Aero Vehicle optimal control problem via the Legendre Pseudospectral
Method were explained in detail. Also included was an analysis to determine the
number of nodes to use in order to obtain an accurate solution. Another analysis involved in setting up the numerical optimization problem was the choice of
weighting factors in the performance index that maximize the control margin.
Thus, the generation of a trajectory and control was discussed in terms of the
desired vehicle performance as well as the accuracy of the solution obtained.
Once the optimization setup was completely defined, the key features of the
trajectory and control were noted to better understand the Common Aero Vehicle. A parametric optimization study was then conducted to demonstrate the
application of the Legendre Pseudospectral Method to vehicle design. The minimum allowable dynamic pressure, maximum allowable stagnation point heat
load, and maximum lift-to-drag ratio were varied independently to determine
their effects on the trajectory and control.
Finally, a preliminary study assessed the real-time application of the Legen-
dre Pseudospectral method to the Common Aero Vehicle optimal control problem in terms of the accuracy of the solution. This was done by simulating the
actual flight of a Common Aero Vehicle. An environment model was used to
update the state of the vehicle and the state was then used to update the control
history. This process repeated until the optimizer could no longer find an optimal solution. At this point an open-loop simulation was used to fly the vehicle.
The open-loop simulation integrated the equations of motion using the most recent control until the vehicle impacted the Earth. Included in this last analysis
was the effect of model uncertainties on the ability of the control to steer the
138
vehicle to a specified target on the surface of the Earth. In both cases with and
without uncertainties, the accuracy of the solution was assessed by calculating
the resulting terminal error in position and speed.
7.2
Conclusions
The Legendre Pseudospectral Method is capable of solving a complex optimal
control problem. The trajectory and control generated using 100 nodes satisfied the strict accuracy requirements associated with the Common Aero Vehicle.
Furthermore, the performance of the Common Aero Vehicle was optimized by
maximizing the control margin. Maximizing the control margin refers to keeping
the angle of attack near the middle of its corridor, minimizing the maximum angle of attack, and keeping the control rates small. The value of the performance
index were used as a direct measure of the control margin corresponding to the
optimal solution.
The terminal constraints are the driving force behind the characteristics of
the trajectory and control for the Common Aero Vehicle. The vehicle initially
increased in altitude which resulted from the need to satisfy the range requirements. At each local maximum in altitude the density constraint became active.
A minimum allowable density constraint was imposed to prevent the vehicle
from escaping the Earth's atmosphere and to maintain control authority. Furthermore, the vehicle steered out of plane 410 km and traveled farther downtrack than the target. Thus, it actually approached the target from behind. While
the performance index kept the angle of attack near the middle of its corridor
throughout most of the trajectory, the angle of attack reached a maximum value
near the end of flight. This resulted from the need to meet the terminal conditions on the speed and flight path angle. The terminal flight path angle also
drove the bank angle to -180 deg. In order to obtain a flight path angle of -89.9
deg, the vehicle must approach the target with negative lift. Negative lift was
generated by rotating the vehicle upside-down, which corresponds to a bank
139
angle of -180 deg.
In order to better understand the behavior of the Common Aero Vehicle, the
minimum allowable dynamic pressure, maximum allowable heat load, and maximum lift-to-drag ratio were varied. In each case where the dynamic pressure
constraint was varied the local maxima in altitude corresponded to the points
where the dynamic pressure constraint was active. Since the dynamic pressure
constraint is a function of altitude and speed, as the minimum allowable dynamic pressure increased, the initial increase in altitude decreased and the speed
decreased at a slower rate. The stagnation point heat load is also a function of
altitude and speed. In each case where the maximum allowable heat load constraint was imposed, the optimizer yielded a solution in which the vehicle hit its
upper limit on heat load. As the maximum allowable heat load decreased, the
vehicle skipped through the atmosphere more and approached the vehicle on a
more direct path. The looser the constraint on dynamic pressure and heat load,
the larger the control margin. Thus it is desirable to design a vehicle that can not
only fly in a low density region while still maintaining control authority, but can
withstand a large amount of heat load. The lift-to-drag ratio corresponds to the
maneuverability of the vehicle and had an insignificant effect on the trajectory
and control. Nonetheless, the value of the performance index indicated that the
higher the lift-to-drag ratio (the more maneuverable the vehicle) the larger the
control margin.
In terms of the real-time application of the Legendre Pseudospectral Method
to the Common Aero Vehicle optimal control problem, a preliminary study was
conducted. Without perturbations in the simulation environment, the position
error was roughly 14 m and the speed error was well within the range of the
required accuracy. These results are impressive considering the distance that the
vehicle is traveling and the stressing terminal conditions imposed. This indicates
that the Legendre Pseudospectral Method shows promise for the use in real-time
and that a more detailed analysis involving the optimizer used in the closed-loop
simulation and the integration scheme used in the open-loop simulation must
140
be conducted for a more conclusive assessment. The robustness of the solution
was also considered in terms of the application of the Legendre Pseudospectral
Method to the Common Aero Vehicle optimal control problem. The mass, liftto-drag ratio, and density in the environment model were perturbed and the
resulting accuracy was considered along with the computational performance.
The solution was the most sensitive to a negative perturbation in the lift-to-
drag ratio followed closely behind with any deviation in the density. It was also
seen that the shorter the open-loop simulation, the smaller the terminal error in
position.
141
[This page intentionally left blank.]
Appendix A
Notation
1. If a e R"Y and b E R, the following notation represents term-by-term multiplication on vectors
ab=
ai
a1 bi
a2
a2 b2
a3
a3 b 3
an
anbn
and the same holds true for division.
2. A diagonal matrix is denoted by (-) as shown below.
(1)=
1
0
0
...
0
0
1
0
---
0
0
0
1
-..
0
- - --
0
--- 1
3. Square brackets are used to indicate a matrix. For example, [0] is a matrix
of zeros. However, in some instances, square brackets may represent a
row or column "matrix". The matrix dimensions should be obvious based
on the context.
143
[This page intentionally left blank.]
Appendix B
Matrix Derivatives
The rules listed in this appendix pertain to the matrix manipulations used to
calculate derivatives. These rules are used to calculate the analytic Jacobian and
objective gradient. First, the matrix which results from taking the derivative of
one vector with respect to another vector is defined. Second, the chain rule is
used to generalize the rule for differentiating the multiplication of two vectors
with respect to a common vector. The term common vector refers to the fact
that both vectors in the multiplication are a function of the same vector.
Consider the following two vectors with different lengths: y E Rm and x C R".
Taking the derivative of y with respect to x results in the following m x n matrix:
dy =
dx
dy
1
dy 1
dy 1
dy
dx
1
dx
dx
3
dxn
2
1
dy 2
dy 2
dy 2
dy
dx
dx2
dX 3
dxn
dym
dym
dym
dym
dx
dx
dx
dxn
1
1
2
3
2
(B.1)
Now consider two vectors with the same length, a c R" and b c Rm. Define
the vector y c Rm to be the term-by-term multiplication of a and b. Using the
notation defined in Appendix A, y can be written as:
y = ab
145
(B.2)
Furthermore, assume that a and b are both functions of x E Rn. In order to
determine the derivative of y with respect to x, the chain rule must be used.
da1
db 1
bi+ aid
da2
db 2
b + a2
dx1 2
dx 1
dy
dx
da1 b +
db1
dx2
dx 2
da2
dx
2
dbm dam
dxi dx 2
dam
dx 1
da 1
dxn
da2
dxn
b2 + a 2 db2
dx
2
dam b
dxn
dbm
+am dx
db 1
dxn
db2
dxn
2
+mdbm
+am~~
dxn(I
(B.3)
Rewriting the matrix above as the sum of two matrices, the following is obtained:
-
b2
dy
dx
da1
da1
dx1
dx
b2
dx 1
dxn
b 2 da2
2
dx2
dxn
dam
bmdam
bmdx1
dx
dbi
dx1
-
bi dal
a2
+±
db 2
a 2 dx
dx 1
dbm
bdam
mdx
_
2
_ adx
db1 ~
dxn
db
ai dx 1
dx2
db 2
1
aid
db2
azd
dxn,
2
dbm
adn
dbm
am dx 2
(B.4)
dy
Splitting each matrix in B.4 into the multiplication of two matrices d, can be
dx
rewritten as:
bi
0
dy
0
0
---
0
0
b2
0--
0
0
0
a1
0
0
0
0
a2
0
0
0
0
0
da1
dx 1
da2
dx1
da1
da1
dx
2
dx 3
2
dx
da 2
dx
dx
+±
---
-
bm
am
da 2
3
dam dam dam
dx1 dx2
dx 3
dbi
db1 db 1
dx 1 dx2
dx 3
db 2 db 2
db2
dx 1 dx 2 dx 3
dbm dbm
dbm
dx 1
dx
dx
2
3
da 1
dxn
da 2
dxn
dam
dxn
dbi
dxn
db 2
dxn
(B.5)
dbm
dxn
Using the notation defined in Appendix A, the matrix equation in B.5 can be
146
condensed to the following form:
dx
(b) d + (a)
dx
dx
147
(B.6)
[This page intentionally left blank.]
Appendix C
Constraint Jacobian and Objective
Gradient Derivation
This appendix defines the constraint Jacobian and objective gradient of the CAV
mission design problem used in this thesis. The notation defined in Appendix A
and the rules set forth in Appendix B are used.
149
C.1
Constraint Jacobian
acx
acx
ay
ax
aC
ax
ax
ac,
ax
acx
acx
acx
az
avx
v
acy acy
y
acy
ay
az
vx- avy
acz acz acz acz acz
ay
az avx avy
acx
ac,
acx
acvx
ay
acx
avx
az
acey
ay
acoz
ay
aca
az
acovz
aC
ac
ac, ac,
acr
ax
ax
acV
ay
ay
aCq
ax
aCa
ax
acQ
acoz
avy
aca
Vx
az
ay
acv
aCq
acoz
az
az
acQ
ac,
aCy
acy
acy
ae
acx
ae
aca
avy
acQ
acQ
aca
aCa
avz
ae
aca
aT a0
acr
acr
avz
avz
aca
aCa
acQ
ax
ay
az
acy
acy
acy
acy
avz
acy
ax
ay
z
vx
av-y
avz
v
aCq
v
av
ua
a-
aa
aCa
acr
Toac,
av
aCq
aua
au
acuz
aue
aca
au
ac, ac(7
aua
acr
aua
au,
acr
u,
acv
acV
at5
acz
atj
acvx
atj
at5
acoz
ataca
atf
acJ
atf
acr
atf
ac,
a,
ana uT,
aCq
aCq
atf
aCq
aCa
aca
aCa
aco
ao- aua
aCa
tf
Cvy c acvy acy
aca
Oac( ac,
avz
v
c y
ao-
aa
v
ac,
au
acz
au
ac.x
au
aua
aca
acy
v v
aua
ac x
a
acvz
aCq
ac,
Jo-
ac,,
a0-
avz
acoz
acvy
aU(T
ana
acz
0
acz
acvy
a- aua
acvz acoz
v
aCq
avx
avx
acy
ac
acz
v
ae
avy
ac,
acv
acv
aCq
ac
acx
avz
avx
az
ay
acr
avx
acv
aCq
acQ
avx
acr
az
acv
ay
avy
vx
acr
acx
Dua
ac,
j0-
c
avz
acz
avz
avy
aCa
az
acr
ax
az
aca
ay
avy
acx
a
ac,,y acvy acuz acvy
avx
ax
actz
ax
aca
ax
aJac
ac
acx
T,
DVz
au7
aae
acQ
aacQ
aua
acQ
acQ
au
atj
atf
a-
7-
aua
-u-l
atf
acy
acy
acy
acy
acy
7a
-
auT,
atj
o
ua
_
(C.1)
Partial Derivatives of Cx Constraint
Cx = DNX -
aCx
ax
a_
aCx
acc
= DN
[01
=
ay
acx
az
(C.2)
tfto)vx
ao-
[0]
=
[0]
=
[0]
x
[0]
0acx
[0]
___=
aCx
avx
acx
av,
aCx
___=
-
(tf,-to
au,,
(C)
au,
ac
[0]
atf
[0]
avz
150
_
1
2
= -- [vx
(C.3)
Partial Derivatives of C, Constraint
CY = DNY-
acy
ax
acy
ay
acy
az
acy
avx
=
[0]
=
DN
=
[0]
=
[0]
acy
t2to)V
ac,
aua
acy
=
[0]
=
[0]
=
[0]
(C.5)
au"
acy
0]
-
auf
(t5 - to ()
-
(C.4)
vy
1
2
at5
acy
avz
[0]
=
Partial Derivatives of C, Constraint
Cz = DNZ
acz
acz
az
[0]
-
=
2
t
DN
(C.6)
vz
acz
ac(
acz
ao
acz
= [0]
ax
ay
acz
-
[0]
=
[0]
-
= [0]
(C.7)
acz
avz
-
[0]
=
[0]
au"
acz
atf
0]
-
-I
2[vz]
(t5 -to)()
=
Partial Derivatives of Cx Constraint
Cvx = DNVx
tf
-
( to
Dx + Lx + gx + 2wvy + w2]
151
(C.8)
acvx
ax
acvx
ay
acox
az
acvx
acvx
(t5
_
-
to) (aDx
/\ax
2
(tf
/aDx
2 to
2)
)x
ax
/
C Lx+
agx
ay ay/
a Lx
agx
)z az/
J\az
2
+aLx
_
_
/
( 5 - to
\ 2 /
\ 2
Dx
av
a Dxav
t5-t0
/\avz
_ _ tf-toDx
acvx
am o
)\aa
2
(t5-to)
acvx
a o
2
-
au"
=
atf
agx
\ay
(t5 - to) /aDx
acvx
av
avz
acvx
acvx
aLx
aDx
(\ao
+
agx
av,
v
agx + 2w
)Lx
vy
avy
Lx
agx
z
avz
Lx
Nvz
(C.9)
agx
Of
aa/
aLx
agx
af/
av
[0]
[0]
=-[Dx+Lx+gx+2wvy
+ ozX]
Partial Derivatives of Coy Constraint
Coy = DNVy -
to
D
152
+L + gy
-
2wvx + w2y]
(C.10)
acvy
ax
/
- to
S(t
K
\2
acVY
(tj -to)
ay
acvy
+ a L,
ax
x
aDy
a L,
(tjt~)KaDy
az
2
acvy
avx
\t2 t
acvy
avy
acvy
DN -
az/
agy
} avy
avy
avy
)\aD
a
x
ag, - 2w
avx
L,
ag,
avz
avz/
a L,
agy
z Of
aa/
L,
ar
agy
[D+Ly +gy--2wvx+W
2
aD,
aa
acv
(t-t
atj
ag,
)z
aLy
2
act5
aL,
av
f -
t5 -to)
-
ay
)
aa
acVy
+gy
y
av
- to )D
2
aua
ax/
Kvx
\2
(t f
S
avz
au,
i
ay
2
az
ag,
)K
aDy
or
(c. 11)
au/
[0]
acoy - [0]
=
2y
yY
Partial Derivatives of Coz Constraint
Coz =
DNVz
tf 2 to) [Dz + Lz + gz]
153
(C.12)
acvz
(t5-to)K/aDz
ax
) \ax
\2
acvz
ay
\ 2
acvz
_
2(
}
acz
Jvx
\ 2
)
avy
acvz
_~~~ \ _2( 5- t
}
acvz
au
acvz
_~
\ 2
=
(
ac t
2
ay/
/ Dz
+ agz
\az + aLz
az
az/
/aDz
\avx
+ aLz + agz\
avx
avx
+ aLz + agz\
Dz
avy
avy
avy/
aDz +L
i
t5-t
\2
= DN -
ay
ay
}
( t5- to);
\
avzc
K aDz + aLz + agz
( tf- to);
_
+ agz
+ aLz
ax ax/
) \ iJvz
Jvz
agz
avz/
(C.13)
DzLz + agz
~~ ( 5-oa +
\aa
)
±a
ax/
/aDz + aLz + agz
\ao
Jo
a
Jo/
[0]
=
=
[0 ]
-
[Dz + Lz + gz]
Partial Derivatives of C, Constraint
C,
acx
ax
= DNa -
S [0]
acta S [0]
ay
acaf = [0]
az
acta [0]
avx
ac
actx
av7
S(0]
=
ac
t
-
2
=
ac
a tj
[0]
154
[0]
tf - to
2
aca
au"
aua
(C.14)
DN
=
ac
aci
to u
(C.15)
=
[0]
1 [
2
Partial Derivatives of C, Constraint
t
Ccr= DNO
- to
2
= [0]
[0]
x
acc
a
[0]
=
DN
-
[0]
=[0]
az
3Co,
10]
aucT
[0]
ac
tj - to
2
avx
aCo-
avy
(C.16)
U
(C.17)
1
atf
2
-
[0]
avz
Don
Partial Derivatives of Position Constraint
Cr =
X 2 +y
2
+z
2
(C.18)
aCr
=KX) a~r
aCr
ay
[0]
aCr
[0]
-
aCr
[0]
az
r au"
aCr
aCr
av,
aCr
av2
aCr
=
Cr
ac
at5
-101
[0]
= [0]
=
(C.19)
=
[0]
=
[0]
[0]
Partial Derivatives of Speed Constraint
Co= v2, + v2 + v
155
(C.20)
ax
a f --
[0]
ac
-[0]
C
[0]
acV
0
3C,
ay
azV
u
= [0]
az
/vx
aCV
ac.
(C.21)
aCV
\
avx
[0]
_
av,
\v
3CV
avz
|vz\
au,
-
[0]
a
-
[0]
atf
\v/
Partial Derivatives of Dynamic Pressure Constraint
Cq
(C.22)
= 1pv2
= v2
a
=
V2
=
[0]
_
=
1V)v2
=
[0]
a_
aC4
avx
-
aCq
(pv)
av,
_C
avz
(pv)
=
av
avx
aC4qC.3
au, = [0]
av
at5
av,
0
(pv) av
avz
Partial Derivatives of Sensed Acceleration Constraint
Ca =
L 2 +1D2
156
(C.24)
aL
aCa
ax
aCa
ay
a
az
KD\ 3D
KD)
aK~
a
\a/
aCa
\a
az
aCa
aca
aca
aua
aCa
aua
ay
azL
\a
av,
\a/ avz
avz
aKD)
av,
+
+
aCa
'a
atf
()a
[0]
= [0]
(C.25)
aD
aCa
av,
aca
aL
Ca
[0]
= [0]
aD
D 3D
/D\
\ai/av
Partial Derivatives of Total Heat Load Constraint
Ca =
to) N
k =O
LK
157
()
Pk
Po
3.s51 Wk
/2( Vk ) 3
/2
Ve
I5
J
g
(C.26)
1/2
acQ
ax
aCQ
2
P
Po
12
W
ap
ax
1/2
ay
K
2
aCQ
az
aCQ
Po
p-
2/1
3
1/2
( -1
2
~2
p 1 12
po)
tK - to12
1
)3 is
v
\PO)
v
)
ap
.is
az
v 2.1 5 W
/ v
aCQ
-tv12.poiw
t
t - to
avy
aCQ
aCQ
aua
aCQ
aur
aCQ
1
2.
av
(C.27)
[0]
[0]
[0]
=
=
-
[0]
0]
1/2
a tf
P
2
=
Vk )3.15
7)
(,~L\PO)
jI
Partial Derivatives of Terminal Flight Path Angle Constraint
Cy
rf -Vf
(C.28)
rfvf
XfVx,f + Yf Vy,f + ZfVz,f
rfvf
158
(C.29)
acy
axf
vx,frfvf - (xfvx,f + yfvy,f + zfv z ,f)( rf /axf)vf
(rfvf )2
acy
Vy,5rfvf - (xJvx,f + yfvy,f + zJvz,5)( r/
ayf
yf )vf
(rfvf ) 2
acy
vz,frfvf - (Xfvx,f + Yfvy,f + zfvz,f)(arf /azf)vf
azj
(rfvf )2
acy
avx,5
xfrfv5 - (xJVx,f + Yfvy,f + zfvz,f )rf (arf/avx,f)
(rfvf) 2
acy
yfrfvf - (xfVx,f + YJVy,f + ZfVz,f)rf (aVf/avy,5)
(rfvf )2
avy,f
ZfrfVf
acy
avz,5
acxj
=
[0]
acy
=
[0]
ac
a tf
)rf(aVf5/aVz,f)
(rfvf)z
a5f
acy
aaU
a,f
o-f
(XfVx,f + YfVy,5 + ZfVZ,f
-
(C.30)
= [0]
=
[0]
=
[0]
Partial Derivatives of r
r =
ar
x+
rx
ax
z
y+
ar
o
2
(C.31)
-
(0]
-
[0]
ar
ay
\r /
a0
ar
[0]
r aO
r
ar
[0]
avx
ar
[0]
ar
avy
avz
aua
=
atf
[0]
159
(C.32)
=
[0]
=
[0]
Partial Derivatives of p
p = po exp-
ap
ax
ap
_
ar/ax
-
H
ap
aaf
(p)
ar/ay ()
_
ay
ap
H
ap
avz
(C.34)
ap
= [0]
au
=
[0]
=01at5
at
ap
avy
= [0]
aua
ap
av,
10
air
= ar/az
az
[0]
=
ap
H
ap
(C.33)
(r-Re)IH
= [0]
ap
=[0]
0
0
_
[0]
=_
Partial Derivatives of v
v =
vV+v2,+v
±
S (0]
av
ay
ay
av
=
av
av
30-
= [0]
=
[0]
av
az
aua
av
av
avz
[0]
0]
-
=0]
[
(C.36)
av
Ivx\
v
\v /
av
(C.35)
-
[0]
av
_
Partial Derivatives of CL
CL
CL,O(a
160
(C.37)
aCL
-01
a
aCL
aCL
= [0]
ay
(f
aCL
[0]
acr
aCL
aCL
az
aCL
aua
avx,
-(CL,
a
[0]
[
(C.38)
[0]
=[]
aCL
au,
[0]
[0]
aCL
[0]
aCL
avy
aCL
tx)
at
avz,
Partial Derivatives of CD
aCD
aCD
[0]
ax
[0]
ay
aCL
aCL
[0]
a0-
aCL
aCL
az
aCL=
[0
[0]
aua
=
[0]
a
0
[0]
[
acL
au
avx
a L
(2KCL)
aca
aCL
avy
aCL
(C.39)
CDO + KC
CD
atf
(C.4)
= [0
avz
D
=
12
2
pv2SCD
161
(C.41)
Partial Derivatives of Drag Magnitude
1
D
ax
V2
K1v2SCD9
2
1V2SCD)
ap
K
az
aD
(PSCD)
aD
av,
___
=
(PSCD)
/
1
~
a
az
2
avvx
D
_p
ax
ap
ay
2
aD
ay
\
vSCD)
aD
aor
[0]
aD
[0]
Ju,,
av
aD
a
au,
av
aD
av,
t
2\
pv
aCD
o
(C.42)
[0]
Partial Derivatives of Lift Magnitude
LL=
I
al,
1
2
PV 2 SCL
ap
K
/ax
2
KLI
ap
K2Iv2SCLav/az
ax
2SCL
aL
\2
al,
AL
av,
av
-
avy
aL
av,
a
/
ap
az
/
1
2S
aCL
\ 2 PaV
[0]
-
al
au,
[0]
-[0
(C.44)
aL
au,
(PSCL) av,
(PSCL)
_
ac,
aL
i2SCL
ay
(C.43)
av
aL
av,
atif
av
at
(pSCLav
=v (pSCL)av
[0]
Partial Derivatives of Gravity
g = gxex + gyey + gzez
(C.45)
Partial Derivatives of gx
gx= -
162
x
(C.46)
agx
agx
ax
_ \r3/
agx
ay
agx
_
agx
avy
agx
-
agx
/ay
3px \ ar
Kr4 / az
ao
agx
[0]
agx
aua
au,
_
-
[0]
-
[0]
av2
agf
<3px\ ar
r4
az
ag,
avx
agx
+p( rpx\
ar
4
ax
ax
at5
-
[0]
=
[0]
-
[0]
-
[0]
(C.47)
[0]
Partial Derivatives of gy
gy =
K3piy /ax\ r
3Ay)
K-b
K3y ar r4
ag,
r4
ax
agy
ay
agy
-~~
r4
az
agy
agy
agy
=
[0]
=
[0]
=
[0]
/az
(C.48)
y
agy
a; ag
agy
[0]
[0]
auo
ag
au"
[0]
agy
[0]
atj
[0]
(C.49)
Partial Derivative of gz
gz = -z
163
(C.50)
(yz\
agz
agz
ar
/ ax
/3pz\
ar
4
r lay
ax
r
agz
ay
ag
4
az
K-r3)\rH/az
p + (3pz ar
aa
agz
aug
agz
[0]
agz
avx
agz
avy
au
=
agz
agz
[0]
atf
=
[0]
=
[0]
=
[0]
(C.51)
[0]
-
=
[0]
[0]
avz
Partial Derivatives of Drag
D = Dxex + Dyey + Dzez
(C.52)
Dx = D vx
(C.53)
Partial Derivatives of Dx
vx
aDx
V
aDv
ax
aDx
aDx
avx
aDx
avy
aDx
avz
=
aDx
V - ~~Kvx~a!
azV
au"
vx
D +D
v avx
vx aD
V avy
(vx) D
V avz
[0]
avr
D
vx
_Dvx
av
v
V2
ovx
Dvx)
-- --av--v2
avy
aDx
aua
aDx
a t5
D
\V aa(
aia
aDx
ax
aDx
ay
aDx
az
/vx\
[0]
(C.54)
-
[0]
-
[0]
Dvx) av
v2
avz
Partial De rivatives of Dy
Dy = D
164
Y
(C.55)
aDy
ax
aD,
ay
aD
az
aDy
ax
\v/
Kv
aD,
vy aD
aDv\a
(v y a
\V/ az
aD
vy\ avy
\v/
aDy
avy
aD,
v /avy
avz
V
_
Dv
\v2
avx
/
v
\
-vy- +
/D)
avz
V2 / av
KDvy\
av
\v/
/vy) a
~\v /B
aa
_Dvy
v2
aDc
[0]
aua
aDy
au,
[0]
(C.56)
=
[0]
=
[0]
aD
av
a tf
/avy
Partial Derivatives of Dz
(C.57)
Dz = D V
V
aDz
ax
aDz
ay
aDz
az
aDz
avx
aDz
avy
vz
v
aD
aDz
ax
Ba
aDz
vz aD
a- vz aD-V
az
aD
vz
V avx
- (vz- aDV avy
aa
vz
=
[0]
=
[0]
=
[0]
aDz
aua
Dvz
avx
Dvz
av
avy
v2
aDz
au,
av--
v2
aDz
at
aD
(C.58)
[0]
aDz
avz
vvz
aD
/ avz
D
V
Dvz\
av
v2
avz
Partial Derivatives of Lift
L = Lxex + Lye, + Lzez
Partial Der ivatives
(C.59)
of Lx
Lx = L (sinOw 2 ,x
165
+ cos uw 3 ,x)
(C.60)
Lx
ax
aL (sin
o-w2,x
ALx
ay
aL
Lx
az
Lx
-
ax
ay
-
-
+ COS
avy
aL (sin o-w2x + COS
'Nvz
avz,
aL
aL
(sincow
W3,x ( +
(sin 0-)
'
(L (cos -w 2,x - sinow 3 ,x))
=
[0]
=
[0]
=
[0]
aL
±
aw2,
aw ,x
((sin 0-) 2
(L)
/
(cos o-)
(Cos-)
±(COS
'
a
a'
aw3v )
awsx)
7)
+ (cos o-) a y,
(sin (-)
(L)
,x + cos w 3,x) + (L)
2
+ (cos)
(y
-Ws,x) + (L)
=
aLx
atf
az (sin o-w2,x
aL (sin-w ,x + cos o-w3,x) +
2
-
aua
aua
(sin o) a
(sin o-w2,x + Cos o-w3,x) + (L)
(sinow 2 ,x + cos o-w3,x) + (L) ((sin
av2
aL~
aL
aL,
aL
(sin -) a ' + (cos a)
+ COS -W3,x) + (L)
aw2 ,x + (cos) ac
(sin-)a
(C.61)
Partial Derivatives of Ly
Ly = L (sin Uw 2,y + cos -w3, y)
166
(C.62)
+ cos Uw 3,y) + ( L) ((sin a) a'
=(sin-w2,y
aLy
aL
a-ay
ay
aLy
(sinow2,y + cos o-w 3 ,y) + ( L)
aL (sinow 2 ,y + cos o-w 3,y)
az
=
aL
(sin-w
2 ,y
aLy
± (L)
aLy
((sin
-)
av,
a
+(cos
)
-) ay
+ (cos 0-)
azf
+ (cosa) a
+
(cos 0-)
ay)
+ (cos-) aw 3 y )
+ (cos)
a
)
3 ,y))
[0]
au"
aLy
[0]
au,
aLy
atf
( (sin g-) avy
+ (L) (sin o)
(sinaw 2 ,y + cos o-w 3,y)
(L (cos O-w2 ,y - sin-w
ao-
(L) ((sin a-) av
+ cos w 3,y) + (L)
(sin0-w2 ,y + cos w 3 ,)
A
y
+ ( L) (sin -) az
aw 2y
(sin-w2,y + cosow 3 ,y) +
avy
(sin -)
+ (cos-) a'
=
[0]
(C.63)
Partial Derivatives of Lz
Lz = L (sinUw2,z + cos -w3, z)
167
(C.64)
aLz
aL
ax
aLz
ay
ax
aLz
aL
ay
(sin o-w2 ,2 + cos 0-w3 ,z) +
az
- (cos a-)
axz
+ (cos a) aw3 z)
(sin-w
2 ,2
+ (L) (sin -) aw 2z
+ cos O-w3,z)
(sino-w 2 ,z + cos O-w3,z)
az
(L) (sin-
+ (L)
+-(cos a) az
(sin a-) aw
aLz
az + (cos 0-) awz
-
avx
aLz
aL2
L
-
aLz
aaz
aLz
au
aLz
aL
-
(sin o-)
(sin0-w2,z + cos w 3,z) + (L)
(sin o-)
[0]
au,
=
[0]
at1
=
[0]
avy
+
(cos 0-)
2,z -
sinw
a)z
+ (cos a) aw 3,z
+ Waw 2 z + (cos0-)
(sinow 2 ,z + cos ow3 ,z) +±(L)(sina a-a'
= (L (cos-w
=
aL
(sin-w2,z + cos a-w 3 ,z) + (L)
V
a'
3 ,z))
(C.65)
Partial Derivatives of the Unit Direction wi
v
V
(C.66)
Partial Derivatives of wi,x
wi,x
=
168
vx
v
(C.67)
awlx
ax
[0]
-
"awi,x
ay
awi,x
az
[0]
=
= [0]
awix
awx_
[0]
ao
[0]
aua
vx
=
[0]
av
aw,x
[0
v2
avx
V
Kx)
v
av
au,
awl,x
[0]
V2
av,
atf
Kx)
av
avy
aw,x
aa
awi,x
1
avx
awix
-
avz
(C.68)
\V2 / avz
Partial Derivatives of wi,y
wyy
aiy=
ax
(C.69)
{0]aw~
aa
awi,y = 0
=__
= [0]
wi,y=[0
[0]
=
'w~ = [0]
'w~ = [0]
az
aw)y
Jvx
V2/
awi,y
_1
avy
V
[0]
av
aua
awi,y
vx
au,
vy) av
V2
awi,y
vyy
vz
\ V2/
avy
awi,y
atj
[0]
(C.70)
- [0]
av
avz
Partial Derivatives of wiz
wi,z = Vz
169
(C.71)
aw,
ax
aw,
ay
awl,z
az
aw,
=
[0]
=
[0]
=
[0]
awl,z
acx
awi,z
ao
awl,z
vz
avx
v
vz
avy
v
2
2
aua
av
avx
aw,z
av
avy
awi,z
auf.
at;
=
[0]
=
[0]
=
[0]
(C.72)
[0]
[0]
1\I - vz\ av
Partial Derivatives of Unit Direction w 2
r
x
v
(C.73)
||r x vil
=
(C.74)
w2,xex + w2,yey + w2,zez
Yvz
W2,x =
-
(C.75)
v z )2 + (xvy - yVx)
(ZV
S(Yvz - ZV)
zvy
2
zvx - xvz
w
2 ,y
=
+ (zvx
-
XVz)
2
+ (XVy - Yvx) 2
V(yvz - zvy)2 + (zvx
-
XVz)
2
+ (XvY - yvx) 2
j(Yvz
- zvY)
2
W2,z =
170
(C.76)
(C.77)
Partial derivative Of W2,x
aW2,X,
zvY) [vY (xv
[ (yvz -zV Y)2 + (ZVX
yVZ
_
ax
aW2 ,x
[(yvz
-
zVY) 2+ (ZVX XVZ) 2 ± (XVY - yvx) 2 ]
)2]1/ 2/
VZ) 2 + (XVY -yVx
- zvY) 2 + (ZVX
Vz [(YVz
ay
- xvI)]I
Vz
XV) 2 + (XVy -yVx )2]31/2/
YVx)
-
-
-
-
K(YVz
-zVY)
[(yvz
-VY
[VZ (yvz - ZVY) - VX (xvY - yVx)]
2 3 2
2
2± (ZVX - XVZ) + (XVY - YVx ) ] /
az~
ZVY)
-
2
aW2,x
avx
___x
\
+-(ZVX
-V)
+
-
(ZVX
-
zVY) [z(zvx
~- (
~(YVz - zvY) 2 + (ZVX-
2
XVZ)
-
zVY)[Ivx(zvx
[(yvz
av
2
K(yv
(yv
=
2±
Z _XVZ)
- Y2+
KVy[ (Yvz
aW2,
xvz)
XVZ)
xVz)
-
-
2
-
x
(XVY i
+ (XV y
3
YVx ) ] /
2
-
Vy (YVz
+ (XVY
Y(XV
XIVz) ± (XVY
2
zvY)]I
-
2 3 2
] /
-yVX)
yYvx)]I
2
yVx)
2
]3
1
yVx) 2 ]
XV) 2 ± (XVY
-Z[(YV -zzV Y)2 ± (ZVX
2
2
yVx) 2 ] 31
XVz) + (XVY
[(yv z -ZVY) 2 + (ZV,
/
(C.78)
-( yVz-zv)[X(XVy-YVxVZ(YVz-ZVy)l
2
+ (XVY - yx
zV)2+ (ZVX - XVZ)
-
KY[(vz-zV
avX
-
(YVz
[ (YVz
-
+
(ZVX
-
XVZ)
-ZVy)[Y(YVz
±V)
=
[0]
aW2,x
-
[0]
2
-zvy)
(ZVXXVZ)
aua
aw 2,x
2 ±+(XVY
Y) 2±+(ZVX -XZ)
zvY) 2
K(
[(yvz
au,
/
/
/
[ (yvz
aw,
/
a tf
171
+ (XVY
-x(zv
2 + (XV
-
_x) 2
yVx)
-
v
2
]
3
2
/
xv)]
X) V2]312/
Partial derivative Of w 2 ,y
____y
_
ax
Vz [(yVz -ZVY) +(ZVX -XVZ) 2 +(XV-yVx)2
~ [(yvz - ZV y)2 +±(ZVx - wz) 2 ±+(XVYy-yVX)2]I2/
/(zvx
-
xvz)[Ivy (xv
\(yvZ -zvY)
aW2,y
_
ay
/
az
[(yVz - ZV y)
f\zvx
\(yvz
2
-ZVY)
-
-
2
-
± (zv,
xvz) [V (yvz
[(y~z - ZV y)I +(ZVX
Kvx [(yVz
aW2 ,y
2
-
yvx)
xVi) ]
XV) 2± (XVY - yVx) 2 P/3
avx
-
/
zvy) - vx (xvY - yvx)]
VZ) 2 ±+(xvy
-yv )2]3/2/
+ (ZV
2
XV)
2
+±(ZVx -XVZ)
xvz) IV,,(zvx
±V) (ZVX
+ (XVi - yVx) 2 ]
±+(XV
yv)2]32/
xvz) - Vy (Yvz
-XV) 2 ± (XVi
z [ (YvZ - zvY) 2 + (ZVX ~(yv 2 - ZV )2 ± (ZVX -XVZ)
aW,<y
V Z
-
±V)
2
K(zv
(XVY
±+ (XV
-
-
ZVO)I
YVx
)2 ] 3 /2
/
yVx) 2 ]
/
y yVx) 2] 3 /2
-xVY 2 [z(ZVX XVZ) 2Y+(XV yYVx) 2]3/
-
(V
K
avy(yvz
K
avz
aW2 ,y
aW2,y
±V) (ZVX
-
/(zvx
ZVy ) 2
-
2
XVZ) 2 + (XVy -yVx) 2]
+ (ZVX
+ (ZVX
-
xVz) [Y(YVz
\(YV z- ZVY) 2 + (ZVX
,
[0]
-W
,
[0]
iTou
20]-
10
au"
-W,
[0]
-W,
[0]
au,
at
YX
-z
xII(yvz - zVY)
-W
-
xVz)Ix (xvY - YVx) - z (yv zV
I
- ZV y)2 ± (ZVX - XVZ) 2 + (XV
~yVX) 2 ] 3 /2
I(yv -
act
± (XVY
XVZ)
172
XV)
2
+ (XVY -
zvY)
-XVZ)
-
2
yvx )2]3/2/
x (ZVX - XVI) ]
± (XVY~ - yV
]/
/
(C.79)
(.9
Partial derivative of w2,,z
DW2,z
ax
~[(yvz - zv~) 2 + (ZVX -
K(xvi
[(yVz
aW2,
XV) 2
-
- zVY) + (ZVX -XVZ)
2 ±
XV)
(ZVX -
K(xvY
~~(YVz -ZVX)
3
2 +
az
(I (YvK
2 ±
-zvY)
-
(ZVX
±VY
2
Y I(YV -zVy)
2 +
[(YVz - zvY)
____z
avx
K
aW2z
____
-
XVZ) 2 + (XVi
-
XVZ) 2 + (xviy
XVZ)
(ZVX
+
(ZV,
2 ±
- XVZ)
yx)]/
YVx )2]3/ 2
+ (XVY
(XVX
-
yVx)
2
]
yvx )2]3/2
)/2
-XVZ)
(YVz
K
K
avz
K
aW2,
2
zV ) 2±+(ZVxXVZ)
yv) Iy(xvz
(xvi-Y
(Yvz
zvY)
-
-
2
±
-
yv)
(ZVX - xV)
101(~z
2
z~
a[0]
aor
aux
aW 2,z
2
-
yYVx)I
-
xVz) -y (XVy YVx) Y -Z)2+(V XZ2+(V y)]3/
(xvi - yvx) Iz(zvx
2 + (XVY _ yvx)2 ] 1 /
±VY (ZVX
[(YVz 2
(C.80)
(XVy YVX)l
zV y 2 (ZV -XVZ)
[(YVz
aW 2 ,z
(ZVX
(XVy - yvx) 2 ] 3/ 2/
- xvx
-yvx)[Vz (yvz - zvY)
[(Yvz
yv )2]3/2/
(XVY -
±
XVz)]
YVx)[Vy (Xvy -YVx) -v2 (zvx
2 31 /
2 + (ZVX - XVZ) 2 ± (XV,
]
yVx)
ZVY)
2+ (xv , -yVx) 2]
2
VxK -vI(yvz
ay
2]
Kv[(yVz - ZVY) (~XV,) 2 ±+(XVYyX)
[ 0]
au,
a tj
173
+ (XVY YVx)
- x(z
+
- xzv)]
(XVY - yx
~z~-x)
2 3 2
]
Partial Derivatives of Unit Direction w 3
W3
=
(C.81)
Wi X W 2
(wl,yw2,z - wi,zw 2 ,y)ex
+
(wl,zw2,x -
wi,xw 2 ,z)ey +
(w,xw
2
,y
-
y
Partial Derivatives of W3,x
(C.83)
W3,x = Wl,yW2,z - Wl,zW 2 ,y
aw3,x
ax
-
Ow3,x
-
ay
aw3,x
az
ax
wi,y
W2,z
Wly
ayWi,y
-
-
ax
w2,y W,
-
ay
avx
avy
Ovx
-
aW3,
Ovz
au,
aw3,x
a t;
vx
w2,z + aw 2 ,zvywi,y
w2,z +
w2 ,
vy
aw-
Wl,y ~
avx
avx
W2,y
vx
-
vy
w2,y -
aw
awl
wl,y -
vy
vx
vy
wl,z
wl,z
_w2,y
w
2 ,y
vy
Wl,z
(C.84)
[0]
auo
aw,x
aw3,x
au"
aw3,x
W2,z ±
-wl2y
aws,x
w l ,z
wl__z
___wly
-
wl,z
[0]
=
[0]
[0]
[0]
Partial Derivatives of w 3 ,y
W3,y = Wl,zW2,x - W1,xW2,z
174
(C.85)
aW 2 ,x
aw3,y
aw2,2
ax
ax
ax
aw 2,2
_wzx
ay , 1,z
ay
aW3,y
~ ay
ay
ay
az
W1z-
W2,x
aW3,y
avx
avx
aW3,y
av,
avx
=
W1,x
+
W2,x
aw
2 ,x
1, aW
+ [W1,z
z
'W2,x
+
W2,x
vy
aw,
-
-W1,z
vx
v,
WWx±
aw1
avx
avz
aW2,z
W2,x
-~
aW3,'y
W1,X
vx
_____
NN2,z
-
a
vy
W1,
~
w1,-W2,z
vy
vx
W,
w1,x
W2,z
W1,x
W2,z~
-
w1,x
W,
v,
-
W2,z
vy
w1,x
(C.86)
aW3,y
aua
aw3,y
a o-
aw3,y
au,
=
[0]
= [0]
= [0]
aW3,y
au,
a3,y
= [0]
at5
Partial Derivatives of
W3,z
W3,z
= Wl,xW2,y - W1,yW2,x
175
(C.87)
aW2,y Wx
aW3,z
ax
aW3,z
W2,.x
-
ay
-w1,x
ay
aW3,z
aW 2 ,y
-
az
3
ax
- W2,y Wx
ay
aW
- W2,x W~
ax
-
,z
avx
-aw
=
=
[0]
=
[0]
=
[0]
=
[0]
aw3,
ay
+
1,~~
avxW,
w~
avx
W2,xwy
-
ay
aW1 ,x
W
vx
v
+
w2,y +
w1,x
W2,x
-
-
i~
vi;
WJ~
vx
vx
wi,x
W ,
-
vy
vy
w2,x
-
w 2,x
-
Wi,y
vY
aW2,x W~
(C.88)
vi; wiy
[0]
aty
Objective Gradient
Fgrad
C.2
aF
[
aF
aF
aF
aF
aF
aF
aF
aF
aF
ay aZ avx avy
Ovz
o au" au- atff
aof a&
1 (C.89)
Objective Gradient
N
F = tf 2-to
-
Wk'k
2 k=O
Fk1 0max /2!
/2
2
+
k2 (
Ua,k
(Uamax)
176
±
k3
Uo-,k
\Uo-,max
2
(C.90)
aF
aF
=
[0]
=
[0]
=
[0]
aF
aF
[0]
aF
aF
av,
aF
aua
aF
aua,
aF
=
[0]
=
[0]
=
(tf - to)ki
=
[0]
=
(t5 - to)k
(C.91)
w
Oax&
Uxl
w
2
L
aor
=
(t5 - to)k 3
x,max
w
o-,max
L
1
2 10
Wk
k1
\
a
-
Umax /2!
2]
2
2
+k2(Umk)
177
+
k3
( Uo-,max)
U-,k
[This page intentionally left blank.]
Appendix D
Initial Guess
In order to obtain a solution it is necessary to provide the optimization algorithm
with an initial guess. The closer the guess is to the optimal solution, the less time
it takes the optimizer to find a solution. However, if a poor initial guess is given,
the optimizer may not even converge to an optimal solution. It is often difficult
to generate an initial guess for a problem that is being considered for the first
time, especially for complex problems.
A different formulation of the CAV mission design problem studied in this
thesis was actually solved prior to the work completed in this thesis [201. The
differences between the problem statements stem from the fact that Ref. [20]
formulates the equations of motion in spherical coordinates. The initial guess
was taken to be a converged solution computed by applying the Legendre pseudospectral method described in this thesis and SNOPT. The solution uses the
same constants stated in Chapter 4 and Table D.1 lists the values of the weighting factors used.
Before the converged solution from using spherical coordinates is fed into
ki
k2
k3
.047597
19.828991
17.846091
Table D.1: Values Used to Generate an Initial Guess
179
the optimizer, the solution must be transformed to ECEF Cartesian coordinates.
Coordinate Transformations
Let r = (x, y, z) and v = (vx, vy, vz) denote the ECEF Cartesian position and
velocity, respectively, of a vehicle. In spherical coordinates the position vector
is defined by the geocentric radius r, the Earth relative longitude 0 measured
East from the Prime Meridian, and the geocentric latitude <p measured positively
North from the equatorial plane. Using Fig. D-1 it is evident that the position is
e2e
0
ex
Figure D-1: Spherical Representation of Position with Respect to a Cartesian ECEF
Coordinate System
transformed as
x
= rcos<cos0
(D.1)
y
=
r cos <p sin 0
(D.2)
z
=
r sin <6
(D.3)
The velocity vector is defined by the Earth relative speed v, the Earth relative
flight path angle y, and the heading angle (p. The flight path angle is the angle
between the plane passing through the vehicle that is perpendicular to the position vector (local horizontal) and the velocity vector. When the velocity vector is
above the horizontal, y is positive. The heading angle is positive in the eastward
180
direction. In order to transform the velocity vector, three additional unit vectors
er, eo, and ep are defined as
r
er
r
eo
ee
(D.4)
ez x r
Ilez x r112
(D.5)
x eo
(D.6)
=er
As shown in Fig. D-2 , the velocity vector can then be written as
Y
e,
Figure D-2: Spherical Representation of Velocity with Respect to a Set of Axes
Defined in the Cartesian ECEF Coordinate System
v = v sin yer + v cos y cos peo + v cos y sin qieo
(D.7)
The resulting transformation matrix from spherical coordinates to Cartesian co-
ordinates is denoted by Ts2c where
Ts2c =
[ er eo e+ ]
(D.8)
The velocity is transformed as
L
v sin y
vx
vy
vz
=
J
Ts2c
v cos y cos (P
v cos y sin (p
181
(D.9)
[This page intentionally left blank.]
Appendix E
Earth Relative Downtrack and
Crosstrack
This appendix defines the Earth relative downtrack and crosstrack distances. Let
ro and vo be the initial position and velocity of a vehicle expressed in Cartesian
Earth-centered Earth-fixed (ECEF) coordinates. The three orthogonal unit vectors,
u 1 , u 2 , and u 3 , comprise the downtrack-crosstrack coordinate system and are
defined as:
The u1 -u
2
U1
=
U3
=
U2
=
ro
(E. 1)
ro x vo
(E.2)
Ilrollz
||ro X Vo0ll2
(E.3)
u3 x u 1
plane is the Earth relativedowntrack plane and the u 1 -u
3
is the Earth
relative crosstrack plane. As shown in Fig. E-1, the downtrack angle is denoted
by a while the crosstrack angle is denoted by b. Let r 12 be the projection of the
position vector in the Earth relative downtrack plane and r 3 be the component
of the position vector in the u3 direction. The downtrack and crosstrack angles
183
U3
U2
U1
Figure E-1: Earth Relative Downtrack Plane and Earth Relative Crosstrack Plane
are computed as follows:
(b
riz
a
= arccos
b
=arctan( 11 r31 1 1)
-
rr 2rIIU
2
(E.4)
(E.5)
From these angle, the Earth relative downtrack distance d and Earth relative
crosstrack distance c are then given as
d
=
Rea
(E.6)
c
=
Reb
(E.7)
where Re is the radius of the Earth.
184
Bibliography
[1] Betts, J. T. and Gablonsky, J. M., "A Comparison of Interior Point and
SQP Methods on Optimal Control Problems", M&CT-TECH-02-004, Phantom Works Mathematics and Computing Technology, The Boeing Company, Seattle, WA, March 2002.
[21 Betts, J. PracticalMethods for Optimal Control Using NonlinearProgramming Society for Industrial and Applied Mathematics, Philadelphia, PA,
2001.
[3] Bryson, A., and Ho, Y., Applied Optimal Control, Hemisphere Publishing
Corporation, New York, 1975.
[41 Conway, B. A., and Larson, K. M., "Collocation Versus Differential Inclusion in Direct Optimization," Journal of Guidance, Control, and Dynamics,
Vol. 21, No. 5, 1998, pp. 780-785.
[5] Detra, R. W., Kemp, N. H., and Riddell, F. R., "Addendum to Heat Transfer to
Satellite Vehicles Re-entering the Atmosphere," Journalof Jet Propulsion,
December, 1957.
[61 Dyckman, T. R., Benchmark Characterizationfor Reusable Launch Vehi-
cle Onboard Trajectory Generation Using A Legendre PseudospectralOptimization Method S. M. Thesis, Department of Aeronautics and Astronautics, MIT, June 2001.
185
[71 Elnagar, J., Kazemi, M. A., and Razzaghi, M., "The Pseudospectral Legendre
Method for Discretizing Optimal Control Problems," IEEE Transactionson
Automatic Control,Vol. 40, No. 10, 1995, pp. 1793-1796.
[81 Erbland, P. J., Duffield, C., and Gillard, Homan W., "Technology Development Roadmap for the Space Operations Vehicle," Proceedingsof the AIAA
Space 2001 Conference and Exposition, AIAA-2001-4604, Albuquerque,
NM, 28-30 August 2001.
[91 Fahroo, F., and Ross, I. M., "A Spectral Patching Method for Direct Trajectory Optimization," Journalof AstronauticalSciences,Vol. 48, Nos. 2 and 3,
2000, pp. 269-286.
[10] Fahroo, F., and Ross, I. M., "Computational Optimal Control by Spectral
Collocation with Differential Inclusion," Proceedingsof the 1999 Flight Mechanics Symposium, NASA CP-1999-209235, 1999, pp. 185-200.
[11] Fahroo, F., and Ross, . M., "Costate Estimation by a Legendre Pseudospectral Method," Journalof Guidance, Control, and Dynamics, Vol. 24, No. 2,
2001, pp.270-277.
[12] Fornberg, B., A PracticalGuide to PseudospectralMethods, Cambridge University Press, New York, 1996.
[13] Gill, P. E., Murray, W., and Saunders, M. A., "SNOPT: An SQP Algorithm
for Large-Scale Constrained Optimization," SIAMJournal on Optimization,
Vol.12, No.4, 2002, pp. 979-1006
[14] Gill, P. E., Saunders, M. A., and Wright, M. H., "User's Guide for NPSOL 5.0:
A Fortran Package for Nonlinear Programming", Technical Report SOL 862, Department of Mathematics, University of California, San Diego, 30 July
1998.
[151 Gill, P. E., Murray, W., and Saunders, M. A., "User's guide for SNOPT 5.3: A
Fortran package for large-scale nonlinear programming". Technical Report
186
97-5, Department of Mathematics, University of California, San Diego, La
Jolla, CA, February 1999.
[16] Gottlieg, David and Orszag, Steven A., NumericalAnalysis of Spectral Methods: Theory and Applications Society for Industrial and Applied Mathematics, Philadelphia, PA, 1997.
[171 Kirk, Donald E., Optimal Control Theory: An Introduction, Prentice-Hall
Inc., New Jersey, 1970.
[18] O'Reilly, Oliver M. , Engineering Dynamics A primer, Springer, New York,
2001.
[19] Rao, Anil V., "Extension of a Pseudospectral Legendre Method to Multiple-
Phase Optimal Control Problems" Proceedingsof the AIAA Guidance, Navigation, and Control Conference and Exhibit, AIAA-2003-5634, Austin, TX,
11-14 August 2003.
[20] Rao, Anil V., Clarke, K., "Performance Optimization of a Maneuvering Reentry Vehicle Using a Legendre Pseudospectral Method", Proceedings of
the AIAA Guidance, Navigation, and Control Conference and Exhibit, AIAA
2002-4885, Monterey, CA, 5-8 August 2002.
[211 Rea, J. R., A Legendre PseudospectralMethod for Rapid Optimization of
Launch Vehicle Trajectories.S. M. Thesis, Department of Aeronautics and
Astronautics, MIT, June 2001.
[22] Regan, F. J., and Anandakrishnan, S. M., Dynamics ofAtmospheric Re-Entry,
American Institute of Aeronautics and Astronautics, Washington, D. C.,
1993.
[231 Richie, G, "The Common Aero Vehicle: Space Delivery System Of The Future", Proceedingof the AIAA Space Technology Conference and Exposition,
AIAA-99-42026, Albuquerque, NM, 28-30 September 1999.
187
[241 Scott, W. B., "Rapid Response", Aviation Week & Space Technology, 7 April,
2003, pp. 66-70.
[25] Sills, L. G., "Spaced-Based Global Strike: Understanding Strategic and Military Implications", Occasional Paper No. 24, Center for Strategy and Technology Air War College, August 2001.
[26] Slivinsky, S., Galloway, R., Breitling, Sl, and Wilborn, C., "Missile Technology Demonstration 3 (MTD-3) Demonstrating Global Position System
(GPS) Technology in an application to Range Safety Tracking and Precision
Guidance, Navigation and Control", Proceedings of the AIAA Space Technology Conference and Exposition, AIAA-99-4432, Albuquerque, NM, 28-30
September 1999.
[271 Trefethen, L., Spectral Methods in MATLAB, Society for Industrial and Applied Mathematics, Philadelphia, Pennsylvania, 2000.
[281 Vinh, N. X., Busemann, A., and Culp, R. D., Hypersonic and PlanetaryEntry
FlightMechanics, University of Michigan Press, Ann Arbor, MI, 1980.
[29] Vinh, N. X., Optimal Trajectories In Atmospheric Flight, Elsevier Scientific
Publishing Company, New York, 1981.
188