POLI di MI tecnicolano ADAPTIVE AUGMENTED CONTROL OF UNMANNED ROTORCRAFT VEHICLES C.L. Bottasso, R. Nicastro, L. Riviello, B. Savini Politecnico di Milano.

Slides:



Advertisements
Similar presentations
Introductory Control Theory I400/B659: Intelligent robotics Kris Hauser.
Advertisements

David Rosen Goals  Overview of some of the big ideas in autonomous systems  Theme: Dynamical and stochastic systems lie at the intersection of mathematics.

Neural Network Control of a Hypersonic Inlet Joint University Program Meeting April 05, 2001 Nilesh V. Kulkarni Advisors Prof. Minh Q. Phan Dartmouth College.
POLI di MI tecnicolano Carlo L. BottassoGiorgio Maisano Francesco Scorcelletti TRAJECTORY OPTIMIZATION PROCEDURES FOR ROTORCRAFT VEHICLES, THEIR SOFTWARE.
POLI di MI tecnicolano NAVIGATION AND CONTROL OF AUTONOMOUS VEHICLES WITH INTEGRATED FLIGHT ENVELOPE PROTECTION C.L. Bottasso Politecnico di Milano Workshop.
Robust control Saba Rezvanian Fall-Winter 88.
Automotive Research Center Robotics and Mechatronics A Nonlinear Tracking Controller for a Haptic Interface Steer-by-Wire Systems A Nonlinear Tracking.
POLI di MI tecnicolano TRAJECTORY PLANNING FOR UAVs BY SMOOTHING WITH MOTION PRIMITIVES C.L. Bottasso, D. Leonello, B. Savini Politecnico di Milano AHS.
POLI di MI tecnicolano Carlo L. BottassoLuca Riviello ROTORCRAFT TRIM BY A NEURAL MODEL-PREDICTIVE AUTO-PILOT Carlo L. Bottasso and Luca Riviello Politecnico.
I welcome you all to this presentation On: Neural Network Applications Systems Engineering Dept. KFUPM Imran Nadeem & Naveed R. Butt &
Chapter 5 NEURAL NETWORKS
POLI di MI tecnicolano PROCEDURES FOR ENABLING THE SIMULATION OF MANEUVERS WITH COMPREHENSIVE CODES Carlo L. Bottasso, Alessandro Croce, Domenico Leonello.
Neural Networks Marco Loog.
POLI di MI tecnicolano Numerical Simulation of Aero-Servo-Elastic Problems, with Application to Wind Turbines and Rotary Wing Vehicles Carlo L. Bottasso.
PERFORMANCE COMPARISON OF CONTROL SCHEMES FOR VARIABLE-SPEED WIND TURBINES C.L. Bottasso, A. Croce, B. Savini Politecnico di Milano Milano, Italy.
Discriminative Training of Kalman Filters P. Abbeel, A. Coates, M
Georgia ofTech Institutenology OPTIMIZATION OF CRITICAL TRAJECTORIES FOR ROTORCRAFT VEHICLES Carlo L. Bottasso Georgia Institute of Technology Alessandro.
POLI di MI tecnicolano VISION-AUGMENTED INERTIAL NAVIGATION BY SENSOR FUSION FOR AN AUTONOMOUS ROTORCRAFT VEHICLE C.L. Bottasso, D. Leonello Politecnico.
Neural Networks. Background - Neural Networks can be : Biological - Biological models Artificial - Artificial models - Desire to produce artificial systems.
Overview and Mathematics Bjoern Griesbach
Radial Basis Function Networks
Unit 3a Industrial Control Systems
Helsinki University of Technology Adaptive Informatics Research Centre Finland Variational Bayesian Approach for Nonlinear Identification and Control Matti.
Dr. Hala Moushir Ebied Faculty of Computers & Information Sciences
AUTOMATIC CONTROL THEORY II Slovak University of Technology Faculty of Material Science and Technology in Trnava.
Kalman filtering techniques for parameter estimation Jared Barber Department of Mathematics, University of Pittsburgh Work with Ivan Yotov and Mark Tronzo.
A Shaft Sensorless Control for PMSM Using Direct Neural Network Adaptive Observer Authors: Guo Qingding Luo Ruifu Wang Limei IEEE IECON 22 nd International.
Adaptive Critic Design for Aircraft Control Silvia Ferrari Advisor: Prof. Robert F. Stengel Princeton University FAA/NASA Joint University Program on Air.
A Framework for Distributed Model Predictive Control
Artificial Neural Networks
Optimal Nonlinear Neural Network Controllers for Aircraft Joint University Program Meeting October 10, 2001 Nilesh V. Kulkarni Advisors Prof. Minh Q. Phan.
Neural Network Based Online Optimal Control of Unknown MIMO Nonaffine Systems with Application to HCCI Engines OBJECTIVES  Develop an optimal control.
CS 478 – Tools for Machine Learning and Data Mining Backpropagation.
To clarify the statements, we present the following simple, closed-loop system where x(t) is a tracking error signal, is an unknown nonlinear function,
MURI: Integrated Fusion, Performance Prediction, and Sensor Management for Automatic Target Exploitation 1 Dynamic Sensor Resource Management for ATE MURI.
CHAPTER 5 S TOCHASTIC G RADIENT F ORM OF S TOCHASTIC A PROXIMATION Organization of chapter in ISSO –Stochastic gradient Core algorithm Basic principles.
Model Reference Adaptive Control (MRAC). MRAS The Model-Reference Adaptive system (MRAS) was originally proposed to solve a problem in which the performance.
September Bound Computation for Adaptive Systems V&V Giampiero Campa September 2008 West Virginia University.
Low Level Control. Control System Components The main components of a control system are The plant, or the process that is being controlled The controller,
Order of Magnitude Scaling of Complex Engineering Problems Patricio F. Mendez Thomas W. Eagar May 14 th, 1999.
S ystems Analysis Laboratory Helsinki University of Technology Automated Solution of Realistic Near-Optimal Aircraft Trajectories Using Computational Optimal.
Simulation and Research Services Division Patuxent River, Maryland Roger Burton Program Manager MEETING No. 99 AEROSPACE.
Chapter 20 1 Overall Objectives of Model Predictive Control 1.Prevent violations of input and output constraints. 2.Drive some output variables to their.
Review: Neural Network Control of Robot Manipulators; Frank L. Lewis; 1996.
Lecture 25: Implementation Complicating factors Control design without a model Implementation of control algorithms ME 431, Lecture 25.
CHAPTER 10 Widrow-Hoff Learning Ming-Feng Yeh.
7/6/99 MITE1 Fully Parallel Learning Neural Network Chip for Real-time Control Students: (Dr. Jin Liu), Borte Terlemez Advisor: Dr. Martin Brooke.
EEE502 Pattern Recognition
Chapter 8: Adaptive Networks

Adaptive Optimal Control of Nonlinear Parametric Strict Feedback Systems with application to Helicopter Attitude Control OBJECTIVES  Optimal adaptive.
Chapter 2-OPTIMIZATION G.Anuradha. Contents Derivative-based Optimization –Descent Methods –The Method of Steepest Descent –Classical Newton’s Method.
دانشگاه صنعتي اميركبير دانشكده مهندسي پزشكي استاد درس دكتر فرزاد توحيدخواه بهمن 1389 کنترل پيش بين-دکتر توحيدخواه MPC Stability-2.
September 28, 2000 Improved Simultaneous Data Reconciliation, Bias Detection and Identification Using Mixed Integer Optimization Methods Presented by:
CHEE825 Fall 2005J. McLellan1 Nonlinear Empirical Models.
Chapter 6 Neural Network.
Dynamic Neural Network Control (DNNC): A Non-Conventional Neural Network Model Masoud Nikravesh EECS Department, CS Division BISC Program University of.
1 Lu LIU and Jie HUANG Department of Mechanics & Automation Engineering The Chinese University of Hong Kong 9 December, Systems Workshop on Autonomous.
Bump Hunting The objective PRIM algorithm Beam search References: Feelders, A.J. (2002). Rule induction by bump hunting. In J. Meij (Ed.), Dealing with.
Control Theory Lachlan Blackhall and Tyler Summers.
A Presentation on Adaptive Neuro-Fuzzy Inference System using Particle Swarm Optimization and it’s Application By Sumanta Kundu (En.R.No.
OVERVIEW Impact of Modelling and simulation in Mechatronics system
CS b659: Intelligent Robotics
Student: Hao Xu, ECE Department
Linear Control Systems
Instructor :Dr. Aamer Iqbal Bhatti
Masoud Nikravesh EECS Department, CS Division BISC Program
NONLINEAR AND ADAPTIVE SIGNAL ESTIMATION
Jetson-Enabled Autonomous Vehicle
Presentation transcript:

POLI di MI tecnicolano ADAPTIVE AUGMENTED CONTROL OF UNMANNED ROTORCRAFT VEHICLES C.L. Bottasso, R. Nicastro, L. Riviello, B. Savini Politecnico di Milano AHS International Specialists' Meeting on Unmanned Rotorcraft Chandler, AZ, January 23-25, 2007

Reference Augmented Predictive Control POLITECNICO di MILANO DIA Rotorcraft UAVs at PoliMI navigation control Low-cost platform for development and testing of navigation and control strategies (including vision, flight envelope protection, etc.); Vehicles: off-the-shelf hobby helicopters; On-board control hardware based on PC-104 standard; everything is in-house developed Bottom-up approach: everything is in-house developed (Inertial Navigation System, Guidance and Control algorithms, Linux-based real-time OS, flight simulators, etc. etc.)

Reference Augmented Predictive Control POLITECNICO di MILANO DIA Outline Non-linear model predictive control; Reference Augmented Predictive Control (RAPC): motivations; Reference Augmented Model Identification; Reference Augmented Neural Control; Results; Conclusions and outlook.

Reference Augmented Predictive Control POLITECNICO di MILANO DIA UAV Control Architecture Target Obstacles Hierarchical three-layer control architecture Hierarchical three-layer control architecture (Gat 1998): Vision/sensor range Strategic layer: assign mission objectives (typically relegated to a human operator); Tactical layer: generate vehicle guidance information, based on input from strategic layer and sensor information; Reflexive layer: track trajectory generated by tactical layer, control, stabilize and regulate vehicle. Adaptive Non-linear Model Predictive Control In this paper: Adaptive Non-linear Model Predictive Control.

Reference Augmented Predictive Control POLITECNICO di MILANO DIA Non-Linear Model Predictive Control Non-linear Model Predictive Control Non-linear Model Predictive Control (NMPC): non-linear reduced model Find the control action which minimizes an index of performance, by predicting the future behavior of the plant using a non-linear reduced model. - Reduced model: - Initial conditions: - Output definition: Cost: with desired goal outputs and controls. Stability results Stability results: Findeisen et al. 2003, Grimm et al L ( y ; u ) = ( y ¡ y ¤ ) T Q ( y ¡ y ¤ ) + ( u ¡ u ¤ ) T R ( u ¡ u ¤ ) m i n u ; x ; y J = Z t 0 + T p t 0 L ( y ; u ) d t s. t. : f ( _ x ; x ; u ) = 0 t 2 [ t 0 ; t 0 + T p ] x ( t 0 ) = x 0 y = g ( x ) t 2 [ t 0 ; t 0 + T p ] ( ¢ ) ¤

Reference Augmented Predictive Control POLITECNICO di MILANO DIA F u t ure P as t S t eer i ngw i n d ow P re d i c t i onw i n d ow x 0 P re d i c t i onerror S t a t e t rac k i ngerror C on t ro l t rac k i ngerror t 0 t 0 t 0 + T p t 0 + T s C ompu t e d con t ro l u ( t ) P l an t response e x ( t ) P re d i c t e d response x ( t ) G oa l response x ¤ ( t ) G oa l con t ro l u ¤ ( t ) Non-Linear Model Predictive Control

Reference Augmented Predictive Control POLITECNICO di MILANO DIA Model-Adaptive Predictive Control 1. Tracking problem Plant response 3. Reduced model update Predictive solutions 2. Steering problem Prediction window Steering window Tracking cost Prediction error Prediction window Tracking cost Steering window Prediction error Tracking cost Prediction window Steering window Prediction error Goal trajectory Receding horizon control Receding horizon control:

Reference Augmented Predictive Control POLITECNICO di MILANO DIA Motivation knowledgelegacy For any given problem: wealth of knowledge and legacy methods which perform reasonably well; undesirable wasteful Quest for better performance/improved capabilities: undesirable and wasteful to neglect valuable existing knowledge; Reference Augmented Predictive ControlRAPC Reference Augmented Predictive Control (RAPC): exploit available legacy methods, embedding them in a non-linear model predictive control framework. Specifically: Model Model: augment flight mechanics rotorcraft models (BEM+inflow theories) to account for unresolved or unmodeled physics; Control Control: design a non-linear controller augmenting linear ones (LQR) which are known to provide a minimum level of performance about certain linearized operating conditions.

Reference Augmented Predictive Control POLITECNICO di MILANO DIA Reference Augmented Predictive Control Approach: reference model / reference control law; - Choose a reference model / reference control law; adaptiveparametric function; - Augment the reference using an adaptive parametric function; Adjustgood approximation of the actual system / optimal control law - Adjust the function parameters to ensure good approximation of the actual system / optimal control law (parameter identification). Reasons for using a reference model / control Reasons for using a reference model / control: even before any learning - Reasonable predictions / controls even before any learning has taken place (otherwise would need extensive pre-training); -small quantity - Easier and faster adaption: the defect is typically a small quantity, if the reference solution is well chosen.

Reference Augmented Predictive Control POLITECNICO di MILANO DIA Reference model Reference model: Euler’s eqs.+BEM+inflow model Plant

Reference Augmented Predictive Control POLITECNICO di MILANO DIA Outline Non-linear model predictive control; Reference Augmented Predictive Control (RAPC): motivations; Reference Augmented Model Identification; Reference Augmented Neural Control; Results; Conclusions and outlook.

Reference Augmented Predictive Control POLITECNICO di MILANO DIA Goal Goal: reduced modelpredicting the behavior of the plant Develop reduced model capable of predicting the behavior of the plant with minimum error (same outputs when subjected to same inputs); self-adaptive Reduced model must be self-adaptive (capable of learning) to adjust to varying operating conditions. Predictive solutions Prediction (tracking) window Steering window Prediction error to be minimized Reference Augmented Model Identification

Reference Augmented Predictive Control POLITECNICO di MILANO DIA Neural augmented reference model Neural augmented reference model: reference (problem dependent) analytical model, Remarknotadequate Remark: reference model will not, in general, ensure adequate predictions, i.e. when = system states/controls, = model states/controls. Augmented reference model: unknowndefect where is the unknown reference model defect that ensures when if we knewperfect prediction Hence, if we knew, we would have perfect prediction capabilities. d d e u = u. e u = u ; f re f ( _ x ; x ; u ) = 0 : e x 6 = x x ; u e x ; e u f re f ( _ x ; x ; u ) = d ( x ; u ) ; e x = x Reference Augmented Model Identification

Reference Augmented Predictive Control POLITECNICO di MILANO DIA single-hidden-layer neural networks Approximate with single-hidden-layer neural networks: where and = functional reconstruction error; = matrices of synaptic weights and biases; = sigmoid activation functions; = network input. reduced model parameters The reduced model parameters Kalman filtering are identified on-line using Kalman filtering. d ¾ ( Á ) = ( ¾ ( Á 1 ) ;:::; ¾ ( Á N n )) T d ( y ; u ) = d p ( x ; u ; p m ) + " ; d p ( x ; u ; p m ) = W m T ¾ ( V m T i + a m ) + b m ; " W m ; V m ; a m ; b m i = ( x T ; u T ) T p m = ( :::; W m i k ; V m i k ; a m i ; b m i ;::: ) T Reference Augmented Model Identification

Reference Augmented Predictive Control POLITECNICO di MILANO DIA Model Augmentation Results Pitch rate for plant, reference, and neural-augmented reference with same prescribed inputs. Short transient = fast adaption Black: plant Red: reference model Blue: reference model +neural network

Reference Augmented Predictive Control POLITECNICO di MILANO DIA Outline Non-linear model predictive control; Reference Augmented Predictive Control (RAPC): motivations; Reference Augmented Model Identification; Reference Augmented Neural Control; Results; Conclusions and outlook.

Reference Augmented Predictive Control POLITECNICO di MILANO DIA Prediction problem: Enforcing optimality Enforcing optimality, we get: Non-Linear Model Predictive Control m i n u ; x ; y J = Z t 0 + T p t 0 L ( y ; u ) d t s. t. : f ( _ x ; x ; u ) = 0 t 2 [ t 0 ; t 0 + T p ] x ( t 0 ) = x 0 y = g ( x ) t 2 [ t 0 ; t 0 + T p ] Model equations: Adjoint equations: Transversality conditions: State initial conditions: Co-state final conditions:

Reference Augmented Predictive Control POLITECNICO di MILANO DIA F u t ure P as t P re d i c t i onw i n d ow x 0 t 0 t 0 t 0 + T p G oa l response x ¤ ( t ) G oa l con t ro l u ¤ ( t ) Non-Linear Model Predictive Control u ( t ) O p t i ma l con t ro l u ( t ) x ( t ) ; t < t 0 u ( t ) ; t < t 0 Â ( ¢ ; ¢ ; ¢ ; ¢ ) minimizing control It can be shown that minimizing control is See paper for details. u ( t ) = Â ¡ x 0 ; y ¤ ( t ) ; u ¤ ( t ) ; t ¢

Reference Augmented Predictive Control POLITECNICO di MILANO DIA Reference augmented form: where is the unknown control defect. Remark Remark: if one knew, the optimal control would be available without having to solve the open-loop optimal control problem. Idea Idea: -Approximate - Approximate using an adaptive parametric element: -Identify - Identify on-line, i.e. find the parameters which minimize the reconstruction error. p c " Reference Augmented Predictive Control u ( t ) = u re f ( t ) + À ¡ x 0 ; y ¤ ( t ) ; u ¤ ( t ) ; t ¢ À ( ¢ ; ¢ ; ¢ ; ¢ ) À ( ¢ ; ¢ ; ¢ ; ¢ ) À ( ¢ ; ¢ ; ¢ ; ¢ ) À ¡ x 0 ; y ¤ ( t ) ; u ¤ ( t ) ; t ¢ = À p ¡ x 0 ; y ¤ ( t ) ; u ¤ ( t ) ; t ; p c ¢ + " c À p ( ¢ ; ¢ ; ¢ ; ¢ )

POLITECNICO di MILANO DIA Iterative procedure Iterative procedure to solve the problem in real-time: forward (state prediction): Integrate reduced model equations forward in time over the prediction window, using and the latest available parameters (state prediction): backward (co-state prediction): Integrate adjoint equations backward in time (co-state prediction): Correct Correct control law parameters, e.g. using steepest descent: p c u re f p c _ p c = ¡ ´ ^ J ; p c ! p new c = p o ld c ¡ ´ ^ J ; p c ¡ d ( f T ; _ x ¸ ) d t + ( f ; x + u T ; x f ; u ) T ¸ + y T ; x L ; y + u T ; x L ; u = 0 t 2 [ t 0 ; t 0 + T p ] ¸ ( t 0 + T p ) = 0 f ( _ x ; x ; u ; p m ) = 0 t 2 [ t 0 ; t 0 + T p ] x ( t 0 ) = x 0 On-line Identification of Control Parameters

Reference Augmented Predictive Control POLITECNICO di MILANO DIA Remark Remark: the parameter correction step transversality condition seeks to enforce the transversality condition optimal satisfied Once this is satisfied, the control is optimal, since the state and co- state equations and the boundary conditions are satisfied. _ p c = ¡ ´ ^ J ; p c Z t 0 + T p t 0 À T ; p c ( L ; u + f T ; u ¸ ) d t = 0 On-line Identification of Control Parameters

Reference Augmented Predictive Control POLITECNICO di MILANO DIA Tracking cost Future Target Predict Predict state forward Predict Predict co-state backwards Predict Predict control action Update Update estimate of control action, based on transversality violation _ p c = ¡ ´ ^ J ; p c Advance Advance plant Update Update model, based on prediction error Past Optimal control Prediction error Repeat Repeat Future Past Prediction horizon Steering window State Control On-line Identification of Control Parameters x ( t ) ¸ ( t ) u ( t )

Reference Augmented Predictive Control POLITECNICO di MILANO DIA Neural-Network-Based Implementation - - Drop dependence on time history of goal quantities: - - Approximate temporal dependence using shape functions: -single-hidden-layer neural network - Associate each nodal value with the output of a single-hidden-layer feed-forward neural network, one for each component: where Output: Input: Control parameters: À p ¡ x 0 ; y ¤ ( t ) ; u ¤ ( t ) ; t ; p c ¢ ¼ À p ¡ x 0 ; y ¤ ( t 0 ) ; u ¤ ( t 0 ) ; t ; p c ¢ À p ¡ x 0 ; y ¤ ( t 0 ) ; u ¤ ( t 0 ) ; ¿ ; p c ¢ ¼ ( 1 ¡ » ) À p k ¡ x 0 ; y ¤ ( t 0 ) ; u ¤ ( t 0 ) ; p c ¢ + » À p k + 1 ¡ x 0 ; y ¤ ( t 0 ) ; u ¤ ( t 0 ) ; p c ¢ o c = W T c ¾ ( V T c i c + a c ) + b c o c = ( À T p 0 ; À T p 1 ;:::; À T p M ¡ 1 ) T i c = ¡ x T 0 ; x ¤ T ( t 0 ) ; u ¤ T ( t 0 ) ¢ T p c = ( :::; W c ij ;:::; V c ij ;:::; a c i ;:::; b c i ;::: ) T

Reference Augmented Predictive Control POLITECNICO di MILANO DIA F u t ure P as t P re d i c t i onw i n d ow x 0 t 0 t 0 t 0 + T p x ( t ) ; t < t 0 u ( t ) ; t < t 0 u ¤ ( t 0 ) x ¤ ( t 0 ) NN À p k Neural-Network-Based Implementation x ¤ ( t ) u ¤ ( t )

Reference Augmented Predictive Control POLITECNICO di MILANO DIA Outline Non-linear model predictive control; Reference Augmented Predictive Control (RAPC): motivations; Reference Augmented Model Identification; Reference Augmented Neural Control; Results; Conclusions and outlook.

Reference Augmented Predictive Control POLITECNICO di MILANO DIA Vehicle Model and Simulation Environment Vehicle model Vehicle model: Blade element and inflow theory (Prouty, Peters); Quasi-steady flapping dynamics, aerodynamic damping correction; Look-up tables for aerodynamic coefficients of lifting surfaces; Effects of compressibility and downwash at the tail due to main rotor; Process and measurement noise, delays. Reflexive controller Reflexive controller: State reconstruction by Extended Kalman Filtering; Reference controller: output- feedback LQR at 50 Hz; Goal trajectory planned as in Bottasso et al

Reference Augmented Predictive Control POLITECNICO di MILANO DIA Results

Reference Augmented Predictive Control POLITECNICO di MILANO DIA Results Integral tracking error vs. length of prediction window: Significant improvement over LQR

Reference Augmented Predictive Control POLITECNICO di MILANO DIA Results Turn rate vs. time: RAPC LQR

Reference Augmented Predictive Control POLITECNICO di MILANO DIA Results Integral tracking error vs. model mismatch parameter: RAPC without model adaption RAPC with model adaption Significant improvement over LQR

Reference Augmented Predictive Control POLITECNICO di MILANO DIA Results Main rotor collective & norm of control network parameters: Initial transient Adapted

Reference Augmented Predictive Control POLITECNICO di MILANO DIA Conclusions reduced model identification Non-linear reduced model identification for capturing unmodeled or unresolved physics; promoted Linear controller promoted to non-linear; real-time Hard real-time capable (fixed number of ops, no iterations); independently Adaption of control action can be performed independently from adaption of reduced model; before adaptionsimplify adaption Reference model and reference control ensure good predictions even before adaption, avoid need for pre-training, simplify adaption since defect is small; adaption diagnostics Conceptually possible (but not investigated here) to do adaption diagnostics by monitoring defects; non-linearly stable Theoretically non-linearly stable (if identification of, successful); Basic concept demonstrated in a high-fidelity virtual environment. p c p m

Reference Augmented Predictive Control POLITECNICO di MILANO DIA Outlook Real-time implementation and integration in a rotorcraft UAV (in progress) at the Autonomous Flight Lab at PoliMI; Testing and extensive experimentation; Integration with vision for fully autonomous navigation in complex environments.