LEARNING POLICIES FOR BATTERY USAGE OPTIMIZATION IN ELECTRIC VEHICLES Stefano Ermon ECML-PKDD September 2012 Joint work with Yexiang Xue, Carla Gomes,

Slides:

Advertisements

Similar presentations

A Support Vector Method for Optimizing Average Precision

Advertisements

Introduction to Transportation Systems. PART II: FREIGHT TRANSPORTATION.

Probabilistic Planning (goal-oriented) Action Probabilistic Outcome Time 1 Time 2 Goal State 1 Action State Maximize Goal Achievement Dead End A1A2 I A1.

Query Optimization of Frequent Itemset Mining on Multiple Databases Mining on Multiple Databases David Fuhry Department of Computer Science Kent State.

1 University of Southern California Keep the Adversary Guessing: Agent Security by Policy Randomization Praveen Paruchuri University of Southern California.

U NIVERSITY OF M ASSACHUSETTS, A MHERST Department of Computer Science Solving POMDPs Using Quadratically Constrained Linear Programs Christopher Amato.

VTrack: Accurate, Energy-Aware Road Traffic Delay Estimation Using Mobile Phones Arvind Thiagarajan, Lenin Ravindranath, Katrina LaCurts, Sivan Toledo,

ANDREW MAO, STACY WONG Regrets and Kidneys. Intro to Online Stochastic Optimization Data revealed over time Distribution of future events is known Under.

Class Project Due at end of finals week Essentially anything you want, so long as it’s AI related and I approve Any programming language you want In pairs.

Individualised Marketing: Travel behaviour change Equivalent to discovering another Iraq? Proven methods of reducing automobile travel can produce “nega-barrels”*

Planning under Uncertainty

Slide 1 Harnessing Wind in China: Controlling Variability through Location and Regulation DIMACS Workshop: U.S.-China Collaborations in Computer Science.

TRADING OFF PREDICTION ACCURACY AND POWER CONSUMPTION FOR CONTEXT- AWARE WEARABLE COMPUTING Presented By: Jeff Khoshgozaran.

Lecture 5: Learning models using EM

Integrating POMDP and RL for a Two Layer Simulated Robot Architecture Presented by Alp Sardağ.

1 Hybrid Agent-Based Modeling: Architectures,Analyses and Applications (Stage One) Li, Hailin.

U NIVERSITY OF M ASSACHUSETTS, A MHERST Department of Computer Science Optimal Fixed-Size Controllers for Decentralized POMDPs Christopher Amato Daniel.

CS121 Heuristic Search Planning CSPs Adversarial Search Probabilistic Reasoning Probabilistic Belief Learning.

Pieter Abbeel and Andrew Y. Ng Reinforcement Learning and Apprenticeship Learning Pieter Abbeel and Andrew Y. Ng Stanford University.

Experimental Evaluation

Exploration in Reinforcement Learning Jeremy Wyatt Intelligent Robotics Lab School of Computer Science University of Birmingham, UK

Problems with the proposed CAFE Standards for 2020 Dennis Silverman Physics and Astronomy UC Irvine.

Flywheel Energy Storage for Regional Rail Vehicles Matthew Read 1, Roderick A Smith 1, Keith Pullen 2 1 Future Railway Research Centre, Department of Mechanical.

Team logo Electrification and "Publification" of the Transportation Infrastructure Claire Kearns-McCoy, Max Powers, CK Umachi Principles of Engineering.

Getting Green Building Automation. Why is Building Automation a Green Technology? There are programs starting all over the nation that focus on alternative.

CS Reinforcement Learning1 Reinforcement Learning Variation on Supervised Learning Exact target outputs are not given Some variation of reward is.

Clean Cities / 1 EAST BAY CLEAN CITIES COALITION Electric Drive Vehicles Overview Richard Battersby Director, East Bay Clean Cities Coalition Date.

Clean Cities / 1 COALITION NAME Electric Drive Vehicles Overview Presenter Title Date.

Sensys 2009 Speaker:Lawrence.  Introduction  Overview & Challenges  Algorithm  Travel Time Estimation  Evaluation  Conclusion.

The First International Transport Forum, May , Leipzig INDUCING TRANSPORT MODE CHOICE BEHAVIORIAL CHANGES IN KOREA: A Quantitative Analysis.

Presented by Kenneth R. Fischer McDonald Transit Associates, Inc.

Computational Stochastic Optimization: Bridging communities October 25, 2012 Warren Powell CASTLE Laboratory Princeton University

Andrew Carrier 1, Dominik Wechsler 1, Philip Jessop 1, Boyd Davis 2 1 Department of Chemistry 2 Queen’s-RMC Fuel Cell Research Centre Queen’s University.

Planning and Verification for Stochastic Processes with Asynchronous Events Håkan L. S. Younes Carnegie Mellon University.

ECES 741: Stochastic Decision & Control Processes – Chapter 1: The DP Algorithm 1 Chapter 1: The DP Algorithm To do:  sequential decision-making  state.

Optimizing Hybrid Vehicles via Route Prediction jon froehlich & john krumm HCI Intern Talk July 26 th, 2007.

Lecture 13: Energy Storage Energy Law and Policy Fall 2013.

A* Lasso for Learning a Sparse Bayesian Network Structure for Continuous Variances Jing Xiang & Seyoung Kim Bayesian Network Structure Learning X 1...

© 2009 IBM Corporation 1 Improving Consolidation of Virtual Machines with Risk-aware Bandwidth Oversubscription in Compute Clouds Amir Epstein Joint work.

Statistical Analysis. Statistics u Description –Describes the data –Mean –Median –Mode u Inferential –Allows prediction from the sample to the population.

Biswanath Panda, Mirek Riedewald, Daniel Fink ICDE Conference 2010 The Model-Summary Problem and a Solution for Trees 1.

1 CS 391L: Machine Learning: Experimental Evaluation Raymond J. Mooney University of Texas at Austin.

Earth’s Changing Environment Lecture 24 Increasing Transportation Efficiency.

1 S ystems Analysis Laboratory Helsinki University of Technology Kai Virtanen, Tuomas Raivio and Raimo P. Hämäläinen Systems Analysis Laboratory Helsinki.

Approximate Dynamic Programming Methods for Resource Constrained Sensor Management John W. Fisher III, Jason L. Williams and Alan S. Willsky MIT CSAIL.

Design Principles for Creating Human-Shapable Agents W. Bradley Knox, Ian Fasel, and Peter Stone The University of Texas at Austin Department of Computer.

1 ECE 517: Reinforcement Learning in Artificial Intelligence Lecture 8: Dynamic Programming – Value Iteration Dr. Itamar Arel College of Engineering Department.

1 Theory of Constraints Short-term Capacity Optimization.

Operational Research & ManagementOperations Scheduling Economic Lot Scheduling 1.Summary Machine Scheduling 2.ELSP (one item, multiple items) 3.Arbitrary.

The Application of Graphene-Based Supercapacitors in Conjunction with Today’s Technology By Jenna Cario Today’s Electrical Storage Technology Lithium ion.

1 1 Slide Simulation Professor Ahmadi. 2 2 Slide Simulation Chapter Outline n Computer Simulation n Simulation Modeling n Random Variables and Pseudo-Random.

Alternatively Fueled Vehicles The Pollution Solution?

Fast Query-Optimized Kernel Machine Classification Via Incremental Approximate Nearest Support Vectors by Dennis DeCoste and Dominic Mazzoni International.

Balance and Filtering in Structured Satisfiability Problems Henry Kautz University of Washington joint work with Yongshao Ruan (UW), Dimitris Achlioptas.

 14:00 LEED Presentation  14:30 Teamwork time ST1  Compare your individual result  Prepare the presentation and save it to Moodle  Get the computer.

Supervised Machine Learning: Classification Techniques Chaleece Sandberg Chris Bradley Kyle Walsh.

Learning Photographic Global Tonal Adjustment with a Database of Input / Output Image Pairs.

A Decision Tree Classification Model For Determining The Location For Solar Power Plant A PRESENTATION BY-  DISHANT MITTAL  DEV GAURAV VIT UNIVERSITY,VELLORE.

Unified Adaptivity Optimization of Clock and Logic Signals Shiyan Hu and Jiang Hu Dept of Electrical and Computer Engineering Texas A&M University.

Stochastic tree search and stochastic games

T-Share: A Large-Scale Dynamic Taxi Ridesharing Service

Analytics and OR DP- summary.

Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 7

ANTICIPATORY LOGISTICS

AV Autonomous Vehicles.

 Real-Time Scheduling via Reinforcement Learning

 Real-Time Scheduling via Reinforcement Learning

Energy Conservation Home, School, and Transportation

Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 7

Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 7

Presentation transcript:

LEARNING POLICIES FOR BATTERY USAGE OPTIMIZATION IN ELECTRIC VEHICLES Stefano Ermon ECML-PKDD September 2012 Joint work with Yexiang Xue, Carla Gomes, and Bart Selman Department of Computer Science, Cornell University

I NTRODUCTION In 2010, transportation contributed approximately 27 percent of total U.S. greenhouse gas emissions accounts for 45 percent of the net increase in total U.S. greenhouse gas emissions from [U.S Environmental Protection Agency, 2012] More sustainable transportation: low-carbon fuels strategies to reduce the number of vehicle miles traveled new and improved vehicle technologies operating vehicles more efficiently Nissan CEO has predicted that one in 10 cars will run on battery power alone by The U.S. has pledged US$2.4 billion in grants for electric cars and batteries. Our Work : Machine Learning and AI to make this technology more practical

I NTRODUCTION Major limitations in battery technology: Limited capacity (range) Price Limited lifespan (max number of charge/discharge cycles) Inefficient (energetically) for vehicle usage 1. Internal resistance: 2. Peukert's Law: the faster a battery is discharged with respect to the nominal rate, the smaller the actual delivered capacity is (exponential in the current I) Energy wasted as heat: r. I 2

M ULTIPLE - BATTERY SYSTEMS Both effects depend on variability of the output current: How can we keep output more stable? Cannot control demand.. Multiple-battery systems [Dille et al. 2010, Kotz et al 2001,…]: Include a smaller capacity but more efficient battery Hope: get the best of both worlds Large capacity High efficiency Reasonable cost time current time current Wastes more energy (variance) Same total energy output (integral)

M ULTIPLE - BATTERY SYSTEMS Use a supercapacitor that behaves like an ideal battery Intuition: battery is good at holding the charge for long times supercapacitor is efficient for rapid cycles of charge and discharge Use supercapacitor as a buffer to keep battery output stable Store when demand is low, then discharge when demand is high Smaller (1000 times) More expensive More efficient

M ULTIPLE - BATTERY M ANAGEMENT Performance depends critically on how the system is managed Difficult problem: Vehicle acceleration (-) Regenerative braking (+) Highly stochastic Example policy: “keep capacitor close to full capacity” ready for sudden accelerations suboptimal because there might not be enough space left to hold regenerative braking energy  Intuitively, the system needs to be able to predict future high-current events (positive or negative), preparing the capacitor to handle them Charge level

O BJECTIVE Goal: design an Intelligent Management System Intelligent Management System Past driving behavior Action: how to allocate the demand Vehicle conditions Mining a large dataset of crowdsouced commuter trips, we constructed DPDecTree Can keep battery output stable (less energy is wasted) Position, speed, time of the day, … (Real world trip, based on vehicle simulator) How much energy from battery? How much energy from capacitor? Should we charge/discharge the capacitor?

M ODELING Quadratic Programming formulation over T steps: (1): demand has to be met (2): cannot overcharge/overdraw the capacitor I 2 -score: sum of the squared battery output subject to Demand d Current from battery to motor QP (CVXOPT) can only solve relatively short trips (no real-time planning)

S PEEDING UP 1.Reduce the dimensionality (change of variables): 3T  T variables 2.Exploit the sequential nature of the problem: discretized problem can be solved by dynamic programming Faster than CVXOPT (~2 orders of magnitude) Suboptimal (discretized) but close What if we only partially know the future demand? Rolling horizon: Demand is stochastic (unkown) Can we construct a probabilistic model? Knowing the future 10 seconds is enough to be within 35% of omniscent optimal Example: QP score of in about 11 minutes. DP solver: score of in 15 seconds.

MDP M ODELING We formulate as an MDP: States = (charge levels, current demand, GPS coordinates, speed, acceleration, altitude, time of day, …) Admissible Actions= (i bm,i bc,i cm ) that meet the demand Cost= i 2 score, (i bm + i bc ) 2 squared battery output current Transition probabilities? we have an internal model for the batteries We need a model for vehicle dynamics + driving behavior We leverage a large crowd-sourced dataset of commuter trips (ChargeCar project) to learn the model C C(t+1)=C(t) +i(t) -o(t) i(t)o(t) Assumed to be independent

A VAILABLE D ATA ChargeCar Project ( Crowdsourced dataset of commuter trips across United States Publicly available

Sample based optimization Compute “posterior-optimal” action for every observed state s s S(s) MultiSet of all possible successors that have been observed Trip 1 Trip 2 Trip 3 Equivalent to learnining the transition probabilities and optimize the resulting MDP A trip is a sequence of states Given a state s, what’s the best action to take?

Training set generation Generate training set of (state, action) pairs Generate more examples by looking at other (hypothetical) charge levels per state (models are decoupled) Then use supervised learning to learn a policy (regression) Policy: mapping from states to actions Compact Generalizes to previously unseen states Crowd-souced Trips (State,Action) … (State,Action) Policy Sample based optimization Supervised Learning (regression)

Learning the policy ChargeCar algorithmic competition Dataset: 1,984 trips (average length 15 minutes) Training set: labeled pairs (state, optimal action) Judging set: 168 trips (8%) We use Bagged Decision Trees Split according to capacity when training set is too big. The resulting policy is called DPDecTree

Results Using DPDecTree, the battery output is significantly smoother  energy savings

ChargeCar competition results DatasetDPDecTreeMPLNaïve BufferBaselineOmniscent alik arnold mike thor illah gary Total % improvement, statistically significant (one-sided paired t-test and Wilcoxon Signed Rank test) Score = sum of squared battery output. Lower is better.

Conclusions Electric vehicles as a promising direction towards more sustainable transportation systems Battery technology is not mature Multiple-battery systems as a more cost-effective alternative AI/Machine learning techniques to improve performance: QP formulation for the battery optimization problem Use of sample-based optimization + supervised learning Outperforms other methods in the ChargeCar competition Growing interest in mining GPS trajectories (Urban Computing) Many datasets publicly available Our angle: focused on energy aspects (Computational Sustainability) Many other applications