Silvia Ferrari and Mark Jensenius Department of Mechanical Engineering Duke University Crystal City, VA, September 28, 2005 Robust and.

Slides:



Advertisements
Similar presentations
Artificial Neural Networks
Advertisements

Multi-Layer Perceptron (MLP)
Introduction to Neural Networks Computing
Robotics Research Laboratory 1 Chapter 6 Design Using State-Space Methods.
A New Eigenstructure Fault Isolation Filter Zhenhai Li Supervised by Dr. Imad Jaimoukha Internal Meeting Imperial College, London 4 Aug 2005.
Venkataramanan Balakrishnan Purdue University Applications of Convex Optimization in Systems and Control.
1 Nonlinear Control Design for LDIs via Convex Hull Quadratic Lyapunov Functions Tingshu Hu University of Massachusetts, Lowell.
Systems with Uncertainty. What are “Stochastic, Robust, and Adaptive” Controllers?
Training an Adaptive Critic Flight Controller
280 SYSTEM IDENTIFICATION The System Identification Problem is to estimate a model of a system based on input-output data. Basic Configuration continuous.
I welcome you all to this presentation On: Neural Network Applications Systems Engineering Dept. KFUPM Imran Nadeem & Naveed R. Butt &
NORM BASED APPROACHES FOR AUTOMATIC TUNING OF MODEL BASED PREDICTIVE CONTROL Pastora Vega, Mario Francisco, Eladio Sanz University of Salamanca – Spain.
February 24, Final Presentation AAE Final Presentation Backstepping Based Flight Control Asif Hossain.
Model Predictive Controller Emad Ali Chemical Engineering Department King Saud University.
Artificial Neural Networks
Classification and Prediction by Yen-Hsien Lee Department of Information Management College of Management National Sun Yat-Sen University March 4, 2003.

CH 1 Introduction Prof. Ming-Shaung Ju Dept. of Mechanical Engineering NCKU.
Silvia Ferrari Princeton University
MODEL REFERENCE ADAPTIVE CONTROL
Approximating the Algebraic Solution of Systems of Interval Linear Equations with Use of Neural Networks Nguyen Hoang Viet Michal Kleiber Institute of.
Introduction to estimation theory Seoul Nat’l Univ.
Radial Basis Function Networks
Biointelligence Laboratory, Seoul National University
A Shaft Sensorless Control for PMSM Using Direct Neural Network Adaptive Observer Authors: Guo Qingding Luo Ruifu Wang Limei IEEE IECON 22 nd International.
Adaptive Critic Design for Aircraft Control Silvia Ferrari Advisor: Prof. Robert F. Stengel Princeton University FAA/NASA Joint University Program on Air.
Chapter 3 1 Parameter Identification. Table of Contents   O ne-Parameter Case TT wo Parameters PP ersistence of Excitation and SS ufficiently.
A Framework for Distributed Model Predictive Control
Natural Gradient Works Efficiently in Learning S Amari (Fri) Computational Modeling of Intelligence Summarized by Joon Shik Kim.
Book Adaptive control -astrom and witten mark
Optimal Nonlinear Neural Network Controllers for Aircraft Joint University Program Meeting October 10, 2001 Nilesh V. Kulkarni Advisors Prof. Minh Q. Phan.
11 CSE 4705 Artificial Intelligence Jinbo Bi Department of Computer Science & Engineering
1 Adaptive, Optimal and Reconfigurable Nonlinear Control Design for Futuristic Flight Vehicles Radhakant Padhi Assistant Professor Dept. of Aerospace Engineering.
Artificial Neural Network Supervised Learning دكترمحسن كاهاني
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition LECTURE 16: NEURAL NETWORKS Objectives: Feedforward.
LINEAR CLASSIFICATION. Biological inspirations  Some numbers…  The human brain contains about 10 billion nerve cells ( neurons )  Each neuron is connected.
1 RECENT DEVELOPMENTS IN MULTILAYER PERCEPTRON NEURAL NETWORKS Walter H. Delashmit Lockheed Martin Missiles and Fire Control Dallas, TX 75265
Simultaneous Recurrent Neural Networks for Static Optimization Problems By: Amol Patwardhan Adviser: Dr. Gursel Serpen August, 1999 The University of.
To clarify the statements, we present the following simple, closed-loop system where x(t) is a tracking error signal, is an unknown nonlinear function,
1 Adaptive Control Neural Networks 13(2000): Neural net based MRAC for a class of nonlinear plants M.S. Ahmed.
Chapter 7 Stability and Steady-State Error Analysis
CHAPTER 5 S TOCHASTIC G RADIENT F ORM OF S TOCHASTIC A PROXIMATION Organization of chapter in ISSO –Stochastic gradient Core algorithm Basic principles.
Learning Theory Reza Shadmehr Optimal feedback control stochastic feedback control with and without additive noise.
Low Level Control. Control System Components The main components of a control system are The plant, or the process that is being controlled The controller,
Multi-Layer Perceptron
Reinforcement Learning Control with Robust Stability Chuck Anderson, Matt Kretchmar, Department of Computer Science, Peter Young, Department of Electrical.
Advanced Control of Marine Power System
S ystems Analysis Laboratory Helsinki University of Technology Automated Solution of Realistic Near-Optimal Aircraft Trajectories Using Computational Optimal.
Akram Bitar and Larry Manevitz Department of Computer Science
1 Markov Decision Processes Infinite Horizon Problems Alan Fern * * Based in part on slides by Craig Boutilier and Daniel Weld.
Adaptive Hopfield Network Gürsel Serpen Dr. Gürsel Serpen Associate Professor Electrical Engineering and Computer Science Department University of Toledo.
Nonlinear Predictive Control for Fast Constrained Systems By Ahmed Youssef.
Review: Neural Network Control of Robot Manipulators; Frank L. Lewis; 1996.
Smart Icing System Review, September 30 – October 1, 2002 Autopilot Analysis and EP Scheme for the Twin Otter under Iced Conditions. Vikrant Sharma University.
Chapter 8: Adaptive Networks
Reinforcement Learning for Intelligent Control Presented at Chinese Youth Automation Conference National Academy of Science, Beijing, 8/22/05 George G.
دانشگاه صنعتي اميركبير دانشكده مهندسي پزشكي استاد درس دكتر فرزاد توحيدخواه بهمن 1389 کنترل پيش بين-دکتر توحيدخواه MPC Stability-2.
Effects of System Uncertainty on Adaptive-Critic Flight Control Silvia Ferrari Advisor: Prof. Robert F. Stengel Princeton University FAA/NASA Joint University.
DEPARTMENT OF MECHANICAL TECHNOLOGY VI -SEMESTER AUTOMATIC CONTROL 1 CHAPTER NO.6 State space representation of Continuous Time systems 1 Teaching Innovation.
1 Lu LIU and Jie HUANG Department of Mechanics & Automation Engineering The Chinese University of Hong Kong 9 December, Systems Workshop on Autonomous.
Nonlinear balanced model residualization via neural networks Juergen Hahn.
1 ECE-517: Reinforcement Learning in Artificial Intelligence Lecture 12: Generalization and Function Approximation Dr. Itamar Arel College of Engineering.
Mechatronics at the University of Calgary: Concepts and Applications
第 3 章 神经网络.
Real Neurons Cell structures Cell body Dendrites Axon
The use of Neural Networks to schedule flow-shop with dynamic job arrival ‘A Multi-Neural Network Learning for lot Sizing and Sequencing on a Flow-Shop’
A Dynamic System Analysis of Simultaneous Recurrent Neural Network
Hafez Sarkawi (D1) Control System Theory Lab
NONLINEAR AND ADAPTIVE SIGNAL ESTIMATION
NONLINEAR AND ADAPTIVE SIGNAL ESTIMATION
Presentation transcript:

Silvia Ferrari and Mark Jensenius Department of Mechanical Engineering Duke University Crystal City, VA, September 28, 2005 Robust and Reconfigurable Flight Control by Neural Networks

A Multiphase Learning Approach for Automated Reasoning  On-line Control Identification Planning Routing Scheduling... Supervised Learning: Reinforcement Learning: The same performance metric is optimized during both phases!

Introduction Stringent operational requirements introduce Complexity Nonlinearity Uncertainty Classical/neural synthesis of control systems A-priori control knowledge Adaptive neural networks Action network takes immediate control action Critic network evaluates the action network performance Dual heuristic programming adaptive critic architecture:

Sigmoidal neural networks for control: coping with complexity Applicability to nonlinear systems Applicability to multivariable systems Batch and incremental training Closed-loop stability and robustness by IQCs Constrained training for robust adaptation on line Motivation

Full Envelope Control! Modeling On-line Training Design Approach Initialization Linear Control Linearizations

Nonlinear Dynamical System Full-scale Aircraft Simulation: In particular: State vector: Control vector: Vector of parameters: Output vector: YBYB XBXB ZBZB Thrust Drag Lift V mg p q r  

Classical Control Design Linearizations: Altitude (m) Velocity (m/s) Classical linear designs: Multivariable control (PI) Multi-objective synthesis (LMI) Flight envelope and design points: (  =  =  = 0) k  ( )

Input-to-node variable One-hidden Layer Sigmoidal Neural Network s - Hidden nodes Output: z = NN(p) Input: p Adjustable parameters: W, d, v w 11 p1p2...pqp1p2...pq d1d1 1 d2d2 1 dsds w sq n1n1 n2n2 nsns v1v1 v2v2 vsvs 1 b z 11 22 ss Output equations: z = v T  [Wp + d] Gradient equations: v i  '(n i )w ij, j = 1,..., q

Training set: Requirements: Output and Gradient Initialization Equations: General Algebraic Training Approach Known neural network.. Gradient Output Input (c k ) T = W T {v  [Wx k + d]} u k = v T  [Wx k + d] u = Sv c k = B k W

Gradient-based Algebraic Training n: vector of all input-to-node constants, n i k c: vector of feedback gains b: output bias vector Assume each input-to-node variable, is a known constant Then, n is known and the initialization equations can be written as: Linear algebraic initialization equations: where: Vec operator ; to be solved for w a ; to be solved for w x ; to be solved for v ~

A: (p 2 3s) sparse matrix of scheduling variables S: (p s) matrix of sigmoidal functions of n X: (np ns) sparse matrix evaluated from v and n where: k = 1, 2,..., p Initialization Matrices

Linear Control Comparison of Initialized PI NN and Linear Controllers Time (sec) Velocity (m/s) Climb Angle (deg) Large-Angle Maneuver Small-Angle Maneuver Initialized Neural Network Control Aircraft Response to Climb-Angle Command Input, at Interpolating Conditions (H 0, V 0 ) = (2Km, 95 m/s)

Stability Analysis via Integral Quadratic Constraints (IQCs) Standard feedback interconnection between a transfer matrix G(s) or LTI system, and a causal bounded operator  : IQC Stability Theorem: G(s)G(s) w v  Equivalent LMI feasibility problem with positive, real parameters p i and symmetric matrix P:  , then the interconnection is stable. If there exists  > 0, such that,

Closed-loop Stability of Neural Network Controller Closed-loop system comprised of NN controller and LTI model, Lure-type System Applying the IQC Stability Theorem:  is a bdd, causal diagonal operator with repeated nonlinearities that are monotonically non-decreasing, slope-restricted, and belong to [0, 0.5]. Thus, the stability of the NN controlled system is guaranteed if there exists constant symmetric matrices M, P   s  s that satisfy the following LMIs: i = 1, …, s B N = BV, C N = WC a

Adaptive Critic On-line Adaptation ycyc x(t)x(t) _ u(t)u(t) ucuc + _ xcxc ys(t)ys(t) e a NN F SVG CSG  V/  x a (t)(t) NN A NN C

Dynamic Programming Approach By The Principle of Optimality, Time J*J* terminal cost t0t0 ttftf the minimization of J can be imbedded in the minimization of V(t): V*V* t0t0 ttftf terminal cost a b c V * abc = V ab + V * bc

Critic network criterion: = NN C Target at t NN A Target at t Action network criterion (optimality condition): Recurrence relation [Howard, 1960] : Dual Heuristic Programming

Action/Critic Network On-line Learning, at Time t The (action/critic) network must meet its target, NN +  NN Target Generation E  Network performance   Network error w  Network weights w l+1 = w l +  w l w(t) = w 0 w(t + 1) wlwl w l+1 RProp Modified Resilient Backpropagation (RProp) minimizes E w.r.t. w:

Adaptive vs. Fixed NN Controllers During a Coupled Maneuver Velocity (m/s) Climb Angle (deg) Roll Angle (deg) Sideslip Angle (deg) Time (sec) Aircraft response, (H 0, V 0 ) = (2 Km, 95 m/s) Adaptive Critic Neural Control: Fixed Neural Control: Command Input:

Adaptive vs. Fixed NN Controllers During a Large-Angle Maneuver Velocity (m/s) Climb Angle (deg) Roll Angle (deg) Sideslip Angle (deg) Time (sec) Aircraft response, (H 0, V 0 ) = (7 Km, 160 m/s) Adaptive Critic Neural Control: Command Input: Fixed Neural Control:

Adaptive vs. Fixed NN Controllers During a Large-Angle Maneuver Adaptive Critic Neural Control Fixed Neural Control Time (sec)  T (%) Control history, (H 0, V 0 ) = (7 Km, 160 m/s)  S (deg)  A (deg)  R (deg) Trajectory Altitude (m) North (m) East (m)

Fixed Neural Controller Performance in the Presence of Control Failures Time (sec) Aircraft response, (H 0, V 0 ) = (3 Km, 100 m/s) Fixed Neural Control Command Input Control history Time (sec)  T (%)  S (deg)  A (deg)  R (deg) Control Failures:  T = 0, 0  t  10 sec  S = 0, 5  t  10 sec  R = –34 o, t  5  R = 0, 5  t  10 sec V (m/s)  (deg)  (deg)  (deg)  (deg)

Adaptive vs. Fixed NN Controllers in the Presence of Control Failures Time (sec) Adaptive Critic Neural Control Fixed Neural Control  T (%) Control history, (H 0, V 0 ) = (7 Km, 160 m/s)  S (deg)  A (deg)  R (deg) Control Failures: (10  t  15 sec)  T max = 50%  R = – 15 o

Adaptive vs. Fixed NN Controllers in the Presence of Control Failures Velocity (m/s) Climb Angle (deg) Roll Angle (deg) Sideslip Angle (deg) Time (sec) Aircraft response after t = 10 sec, (H 0, V 0 ) = (3 Km, 100 m/s) Adaptive Critic Neural Control: Command Input: Fixed Neural Control: Yaw Angle (deg) Angle of Attack (deg)

M1M1 M2M2 a1a1 ~ a2a2 ~ 1 a WAWA WRWR V xaxa ~ u ~ or b Robust Adaptation: Constrained Algebraic Training

, b, A, W A constrained weights unconstrained weights Zero Randomized Design points Hyperspherical initialization construction functions Neural Network Weights Partitioning

Interpolation Point

Linear Non-adapting Neural Adapting Neural Controller Performance at Interpolation Point

Linear Non-adapting Neural Adapting Neural Controller Performance at Interpolation Point

Linear Non-adapting Neural Adapting Neural On-line Cost Optimization through Adaptation

Action Neural Networkt = 0 sect = 5 sect = 10 sec Constrained Output MSE x x x10 -7 Unconstrained Output MSE x x x10 11 Constrained Gradient MSE x x x Unconstrained Gradient MSE x x x10 -4 Adaptive NN Controller Performance at Design Points

Extrapolation Point

Linear Non-adapting Neural Adapting Neural Controller Performance at Extrapolation Point

Linear Non-adapting Neural Adapting Neural Controller Performance at Extrapolation Point

Summary of Results Properties of learning control system:  Improves global performance  Lends itself to stability and robustness analysis via IQCs  Preserves prior knowledge through constrained training  Suspends and resumes adaptation, as appropriate Future work:  Computational complexity  Aircraft system identification by neural networks  Stochastic effects  Optimal estimation Acknowledgment: This research is funded by the National Science Foundation.

Silvia Ferrari Department of Mechanical Engineering Duke University Many Thanks to: Mark Jensenius Robust and Reconfigurable Flight Control by Neural Networks

Backup Slides

CBCB P CICI C F, f[] = 0 : Algebraic Initialization, Proportional-Integral Neural Network Controller yc(t)yc(t) x(t)x(t) u(t)u(t) uc(t)uc(t) + - uB(t)uB(t) uI(t)uI(t) xc(t)xc(t) ys(t)ys(t) e(t)e(t) a(t)a(t) NN F SVG CSG (t)(t) : On-line Training. NN I NN C NN B

Feedback Neural Network Initialization zB(t)zB(t) NN B a Linear optimal control law: Initialization Requirements: At each design point (k), (R1) (R2)

Development of Feedback Initialization Equations Feedback Neural Network Initialization Equations: Network output: j = 1, 2,..., q Network gradient: l = 1, 2,..., m where is the l th -row of the matrix where and