Training an Adaptive Critic Flight Controller

Slides:



Advertisements
Similar presentations
1 Backpropagation Neural Network for Soil Moisture Retrieval Using NAFE05 Data : A Comparison Of Different Training Algorithms Soo See Chai Department.
Advertisements


Neural Network Control of a Hypersonic Inlet Joint University Program Meeting April 05, 2001 Nilesh V. Kulkarni Advisors Prof. Minh Q. Phan Dartmouth College.
Optimal Control of the Magneto-hydrodynamic Generator at the Scramjet Inlet Joint University Program Meeting June 12, 2002 Nilesh V. Kulkarni Advisors.
Particle Swarm Optimization (PSO)  Kennedy, J., Eberhart, R. C. (1995). Particle swarm optimization. Proc. IEEE International Conference.
Lecture 11: Recursive Parameter Estimation
280 SYSTEM IDENTIFICATION The System Identification Problem is to estimate a model of a system based on input-output data. Basic Configuration continuous.
February 24, Final Presentation AAE Final Presentation Backstepping Based Flight Control Asif Hossain.
Arizona State University DMML Kernel Methods – Gaussian Processes Presented by Shankar Bhargav.
CH 1 Introduction Prof. Ming-Shaung Ju Dept. of Mechanical Engineering NCKU.
Control of a Ball & Plate System Andrew Percy Academic Supervisor – James Welsh.
CL A Coordinated Flight of Uninhabited Air Vehicles Olivier Laplace Princeton University FAA/NASA Joint University Program Quarterly Review - April, 2001.
Neural Networks. Background - Neural Networks can be : Biological - Biological models Artificial - Artificial models - Desire to produce artificial systems.
Silvia Ferrari Princeton University
Performance Optimization of the Magneto-hydrodynamic Generator at the Scramjet Inlet Nilesh V. Kulkarni Advisors: Prof. Minh Q. Phan Dartmouth College.
Introduction to estimation theory Seoul Nat’l Univ.
„Bandwidth Extension of Speech Signals“ 2nd Workshop on Wideband Speech Quality in Terminals and Networks: Assessment and Prediction 22nd and 23rd June.
Seminar at Monash University, Sunway Campus, 14 Dec 2009 Mobile Robot Navigation – some issues in controller design and implementation L. Huang School.
A Shaft Sensorless Control for PMSM Using Direct Neural Network Adaptive Observer Authors: Guo Qingding Luo Ruifu Wang Limei IEEE IECON 22 nd International.
Adaptive Critic Design for Aircraft Control Silvia Ferrari Advisor: Prof. Robert F. Stengel Princeton University FAA/NASA Joint University Program on Air.
20/10/2009 IVR Herrmann IVR: Introduction to Control OVERVIEW Control systems Transformations Simple control algorithms.
Optimal Nonlinear Neural Network Controllers for Aircraft Joint University Program Meeting October 10, 2001 Nilesh V. Kulkarni Advisors Prof. Minh Q. Phan.
Artificial Neural Networks An Overview and Analysis.
© Copyright 2004 ECE, UM-Rolla. All rights reserved A Brief Overview of Neural Networks By Rohit Dua, Samuel A. Mulder, Steve E. Watkins, and Donald C.
11 CSE 4705 Artificial Intelligence Jinbo Bi Department of Computer Science & Engineering
Neural Network Based Online Optimal Control of Unknown MIMO Nonaffine Systems with Application to HCCI Engines OBJECTIVES  Develop an optimal control.
1 Adaptive, Optimal and Reconfigurable Nonlinear Control Design for Futuristic Flight Vehicles Radhakant Padhi Assistant Professor Dept. of Aerospace Engineering.
Using Neural Networks to Predict Claim Duration in the Presence of Right Censoring and Covariates David Speights Senior Research Statistician HNC Insurance.
CSDA Conference, Limassol, 2005 University of Medicine and Pharmacy “Gr. T. Popa” Iasi Department of Mathematics and Informatics Gabriel Dimitriu University.
1 RECENT DEVELOPMENTS IN MULTILAYER PERCEPTRON NEURAL NETWORKS Walter H. Delashmit Lockheed Martin Missiles and Fire Control Dallas, TX 75265
Optimal Feedback Control of the Magneto-hydrodynamic Generator for a Hypersonic Vehicle Nilesh V. Kulkarni Advisors: Prof. Minh Q. Phan Dartmouth College.
Johann Schumann and Pramod Gupta NASA Ames Research Center Bayesian Verification & Validation tools.
Time-Varying Angular Rate Sensing for a MEMS Z-Axis Gyroscope Mohammad Salah †, Michael McIntyre †, Darren Dawson †, and John Wagner ‡ Mohammad Salah †,
1 Adaptive Control Neural Networks 13(2000): Neural net based MRAC for a class of nonlinear plants M.S. Ahmed.
INVENTORY CONTROL AS IDENTIFICATION PROBLEM BASED ON FUZZY LOGIC ALEXANDER ROTSHTEIN Dept. of Industrial Engineering and Management, Jerusalem College.
HQ U.S. Air Force Academy I n t e g r i t y - S e r v i c e - E x c e l l e n c e Improving the Performance of Out-of-Order Sigma-Point Kalman Filters.
CHAPTER 5 S TOCHASTIC G RADIENT F ORM OF S TOCHASTIC A PROXIMATION Organization of chapter in ISSO –Stochastic gradient Core algorithm Basic principles.
Advanced Control of Marine Power System
S ystems Analysis Laboratory Helsinki University of Technology Automated Solution of Realistic Near-Optimal Aircraft Trajectories Using Computational Optimal.
Akram Bitar and Larry Manevitz Department of Computer Science
Smart Icing Systems Review, June 19-20, Aircraft Autopilot Studies Petros Voulgaris Vikrant Sharma University of Illinois.
1 Moscow aviation institute Pilot – aircraft system design provided the necessary level of flight safety prof. Efremov A.V., Ph.D Dean of aeronautical.
Smart Icing System Review, September 30 – October 1, 2002 Autopilot Analysis and EP Scheme for the Twin Otter under Iced Conditions. Vikrant Sharma University.
7/6/99 MITE1 Fully Parallel Learning Neural Network Chip for Real-time Control Students: (Dr. Jin Liu), Borte Terlemez Advisor: Dr. Martin Brooke.

Adaptive Optimal Control of Nonlinear Parametric Strict Feedback Systems with application to Helicopter Attitude Control OBJECTIVES  Optimal adaptive.
Professor : Ming – Shyan Wang Department of Electrical Engineering Southern Taiwan University Thesis progress report Sensorless Operation of PMSM Using.
Effects of System Uncertainty on Adaptive-Critic Flight Control Silvia Ferrari Advisor: Prof. Robert F. Stengel Princeton University FAA/NASA Joint University.
Cameron Rowe.  Introduction  Purpose  Implementation  Simple Example Problem  Extended Kalman Filters  Conclusion  Real World Examples.
NEURAL NETWORKS LECTURE 1 dr Zoran Ševarac FON, 2015.
Dynamic Neural Network Control (DNNC): A Non-Conventional Neural Network Model Masoud Nikravesh EECS Department, CS Division BISC Program University of.
The Unscented Kalman Filter for Nonlinear Estimation Young Ki Baik.
Neural Networks. Background - Neural Networks can be : Biological - Biological models Artificial - Artificial models - Desire to produce artificial systems.
Towards Adaptive Optimal Control of the Scramjet Inlet Nilesh V. Kulkarni Advisors: Prof. Minh Q. Phan Dartmouth College Prof. Robert F. Stengel Princeton.
Silvia Ferrari and Mark Jensenius Department of Mechanical Engineering Duke University Crystal City, VA, September 28, 2005 Robust and.
© Crown copyright Met Office Wind and turbulence measurements on the BAe146 Phil Brown, OBR Conference, Dec 2012.
A PID Neural Network Controller
Unscented Kalman Filter for a coal run-of-mine bin
Velocity Estimation from noisy Measurements
Policy Compression for MDPs
Student: Hao Xu, ECE Department
Camera Calibration Using Neural Network for Image-Based Soil Deformation Measurement Systems Zhao, Honghua Ge, Louis Civil, Architectural, and Environmental.
Christoph J. Backi and Sigurd Skogestad
The use of Neural Networks to schedule flow-shop with dynamic job arrival ‘A Multi-Neural Network Learning for lot Sizing and Sequencing on a Flow-Shop’
Masoud Nikravesh EECS Department, CS Division BISC Program
NONLINEAR AND ADAPTIVE SIGNAL ESTIMATION
NONLINEAR AND ADAPTIVE SIGNAL ESTIMATION
Motion Cueing Standards for Commercial Flight Simulation
Akram Bitar and Larry Manevitz Department of Computer Science
Presentation transcript:

Training an Adaptive Critic Flight Controller Silvia Ferrari Advisor: Prof. Robert F. Stengel Princeton University FAA/NASA Joint University Program on Air Transportation, Princeton University, Princeton, NJ April 5-6, 2001

Introduction Classical/neural synthesis of control systems Prior knowledge Adaptive control and artificial neural networks Adaptive critics Learn in real time Cope with noise Cope with many variables Plan over time in a complex way ... Discuss as in the GNC paper how gain-sched. controllers have been used to control nonlinear systems, why (i.e. changing op. conditions) what they consist of (briefly) and where do they fail. *Mention aircrafts Explain what we propose and what are the objectives of the (general approach) -> one particular feature of NNs is that they can learn in an on-line fashion (therefore allowing for...), one architecture that allows for this is adaptive critics (brain-like capabilities) which have the following characteristics.. Begin to introduce the notion of why we'll begin with dynamic programming to move on to adaptive critics.. critic networks remove the learning process one step from the control network KEYWORDS: reinforcement learning, dynamic programming Adaptation takes place during every time interval: Action network takes immediate control action Critic network estimates projected cost

Motivation Provide full envelope control Multiphase learning: Pre-training phase, motivated by corresponding linear controller On-line training phase, during simulations or testing On-line training accounts for: Differences between actual and assumed dynamic models Nonlinear effects not captured in linearizations Potential applications: Incorporate pilot's knowledge into controller a-priori Uninhabited air vehicles control Aerobatic flight control INTRODUCE PI STRUCTURE AND IDEA FOR PINN MOTIVATION… 1- .. by replacing the gains with neural networks 2- Explain well what is covered here and what is instead the ultimate goal - this is why the training is called "pre-training" - and that performance baselines for this phase are established by an equivalent linear model.. 3- The overall procedure provides improved global control w.r.t. gain scheduling, because ... 4- This phase already defines a global nonlinear neural network controller with less effort than.. (also compare to previous such methods) and is demonstrated here 5- On-line training improves control response for large, coupled motions

Table of Contents Aircraft control design approach Initialization, or pre-training phase Adaptive critic neural network controller On-line training Resilient backpropagation

Aircraft Control Design Approach Modeling Linearizations Linear Control Initialization On-line Training Full Envelope Control!

Linear control design: Linearizations: Aircraft Flight Envelope Altitude (m) Linear control design: Longitudinal Lateral-directional Velocity (m/s)

Linear Proportional-Integral Controller Closed-loop stability: ys(t) + + + x(t) CI Hu Hx - - u(t) yc x(t) CF AIRCRAFT + - Delta’s omitted for simplicity. *Study the formulas and notation from the paper in case of questions.. CB Omitting D's, for simplicity: yc = desired output, (xc,uc) = set point.

Proportional-Integral Neural Network Controller yc x(t) + - u(t) uc uB(t) uI(t) xc ys(t) e a NNI NNF NNB SVG CSG AIRCRAFT NNC l = dV/dDxa Orange lines indicate MAIN training lines to show the flow of the main information at EACH TIME STEP training occurs along these lines. Other flows are also possible, like the one of the control from the action TO the critic for an AD design, but are not shown for simplicity (see on-line training slides..) Note that the action network is the sum of NNB and NNI Note the total control is the sum of uc and the NNs contributions.. Where:

Algebraic Neural Network Pre-training Phase Feedback: Integral Error: Critic: NNI|L CI |L CI |LD NNI|LD NNC|L P |L P |LD NNC|LD NNB|L CB |L CB |LD NNB|LD Combine longitudinal and lateral-directional networks: NNB|L NNB|LD NNB , etc. ... Obtain action network: NNB NNI NNA

Roll and Sideslip Angle Comparison of Neural Network and Linear Controllers Between Training Points Flight condition: (14,000 m; 220 m/s) Velocity (m/s) Velocity and Climb Angle Command Climb Angle (deg) Roll Angle (deg) Roll and Sideslip Angle Command Sideslip Angle (deg) NN-Control Linear Control Time (sec)

Adaptive Critic Implementation: Action Network On-line Training Train action network, at time t, holding the critic parameters fixed x(t) Aicraft Model x(t+1) a NNA Utility Function NNA Target Optimality Condition NNC Aircraft model block indicates that a discretized model of the airplane is used at that location, but in reality other info can be used as well, for instance an integrator to produce xi(t), similarly for other blocks.. they are only indicative of what is being used NNB + + NNI [Balakrishnan and Biega, 1996]

Adaptive Critic Implementation: Critic Network On-line Training Train critic network, at time t, holding the action parameters fixed x(t) Aicraft Model x(t+1) a NNA Utility Function NNC NNC Target NNC + [Balakrishnan and Biega, 1996]

On-line Neural Network Training Goal Given a target, t(p), for the network output, z(p): with network parameters, w, provided by the initialization phase. Scaling effect: v w E v w z p s

Comparison of Neural Network Training Algorithms Technique Speed Implement. Complexity Memory Requirement Main Drawbacks Backpropagation Poor Low Small Scaling Levenberg- Marquardt Excellent Medium Large Extended Kalman Filter (Highest) High Resilient Medium-High Medium- Local convergence

Resilient Backpropagation NN Architecture and w (initialization) * * * Store w, and D

Resilient Backpropagation Algorithm Performance Adaptive critics neural network controller test case: Action Network Epochs Mean-squared error performance

Summary and Conclusions Adaptive critic flight controller: Improve aircraft control performance under extreme conditions Systematic approach for designing nonlinear control systems, innovative neural network training techniques Adaptive critic neural network controller implementation Algebraic pre-training based on a-priori knowledge On-line training during simulations (severe conditions) The aircraft nonlinear model is in terms of the given state and control vectors. In this and the following slides, the body frame, the inertial frame and all of the significant aircraft angle are introduced to illustrate the complexity of the problem at hand. Future Work: Testing: acrobatic maneuvers, severe operating conditions, coupling and nonlinear effects!