Name of Principal Author and all other author(s): _______________________________________________Michael P Bailey____________________________________________.

Slides:



Advertisements
Similar presentations
Instructional Decision Making
Advertisements

SAI Performance Measurement Framework
Bologna Process in terms of EU aims and objectives
Alternate Software Development Methodologies
Operations Research Session Objectives:  To describe the need and importance of Operations Research for rationale decision making in health care delivery.
Legal Issues and Export Controls Career-Ending Opportunities and Ways to Get Fitted for an Orange Jumpsuit David Lombard Harrison, Associate Vice President.
1 Introduction to System Engineering G. Nacouzi ME 155B.
© 2005 Prentice Hall, Decision Support Systems and Intelligent Systems, 7th Edition, Turban, Aronson, and Liang 4-1 Chapter 4 Modeling and Analysis Turban,
A Healthy Place to Live, Learn, Work and Play:
Project Human Resource Management
The Software Development Life Cycle: An Overview
Evidence-Based Practice Current knowledge and practice must be based on evidence of efficacy rather than intuition, tradition, or past practice. The importance.
Literature Review and Parts of Proposal
1 Validation & Verification Chapter VALIDATION & VERIFICATION Very Difficult Very Important Conceptually distinct, but performed simultaneously.
The Audit Process Tahera Chaudry March Clinical audit A quality improvement process that seeks to improve patient care and outcomes through systematic.
Research !!.  Philosophy The foundation of human knowledge A search for a general understanding of values and reality by chiefly speculative rather thanobservational.
Quote for today “Sometimes the questions are complicated and the answers are simple” - ?? ????? “Sometimes the questions are complicated and the answers.
Launch of Barcelona Principles September 2015.
Certificate IV in Project Management Introduction to Project Management Course Number Qualification Code BSB41507.
Chapter 6 : Software Metrics
Semester 2: Lecture 9 Analyzing Qualitative Data: Evaluation Research Prepared by: Dr. Lloyd Waller ©
Assessing Quality for Integration Based Data M. Denk, W. Grossmann Institute for Scientific Computing.
Product Documentation Chapter 5. Required Medical Device Documentation  Business proposal  Product specification  Design specification  Software.
How to use the VSS to design a National Strategy for the Development of Statistics (NSDS) 1.
Generic Approaches to Model Validation Presented at Growth Model User’s Group August 10, 2005 David K. Walters.
Measuring the Quality of Decisionmaking and Planning Framed in the Context of IBC Experimentation February 9, 2007 Evidence Based Research, Inc.
National Aeronautics and Space Administration From Determinism to “Probabilism” Changing our mindsets, or why PTC isn’t an easy sell - yet.
1 Improving Data Quality. COURSE DESCRIPTION Introduction to Data Quality- Course Outline.
AMERICA’S ARMY: THE STRENGTH OF THE NATION Mort Anvari 1 Cost Risk and Uncertainty Analysis MORS Special Meeting | September.
Environmental Management System Definitions
URBDP 591 I Lecture 3: Research Process Objectives What are the major steps in the research process? What is an operational definition of variables? What.
Successful Concepts Study Rationale Literature Review Study Design Rationale for Intervention Eligibility Criteria Endpoint Measurement Tools.
Science and Psychology Psych 231: Research Methods in Psychology.
For ABA Importance of Individual Subjects Enables applied behavior analysts to discover and refine effective interventions for socially significant behaviors.
Validation Methodology for Agent-Based Simulations Workshop Perspectives on Agent-Based Simulation and VV&A Dr. Bob Sheldon Joint and External Analysis.
March 23, NYSCSS Annual Conference Crossroads of Change: The Common Core in Social Studies.
Guidance Training CFR §483.75(i) F501 Medical Director.
Irregular Warfare Modeling/Data Validation Best Practices 11 th Annual MOVES Research Education Summit 12 July 2011.
Episode 3 E3-WP0-MWPM PPT-V th ExCom Slides 1 Episode 3 - Ex-Com Final Report Tools & Validation Dr. Ralph Leemüller R&D Division - DFS.
Choosing Evidence-Based Approaches (Programs, Policies, Practices) A Comprehensive Framework.
Ch 10 Methodology.
Evidence-Based Practice Evidence-Based Practice Current knowledge and practice must be based on evidence of efficacy rather than intuition, tradition,
Kathy Corbiere Service Delivery and Performance Commission
Experimental Control Definition Is a predictable change in behavior (dependent variable) that can be reliably produced by the systematic manipulation.
Research Methods Observations Interviews Case Studies Surveys Quasi Experiments.
Barcelona Declaration of Measurement Principles Presented June 17, 2010 Revised June 20, 2010 Final July 19, 2010 Global Alliance ICCO Institute for Public.
Search Engine Optimization © HiTech Institute. All rights reserved. Slide 1 Click to edit Master title style What is Business Analysis Body of Knowledge?
Ms. Lisa Jean Moya WernerAnderson, Inc. 01 May 2007 Validation Methodology for Agent-Based Simulations Workshop DoD Validation Baseline.
Aspect 1 Defining the problem - Problem: The design context will normally offer a variety of potential problems to solve. A focused problem and need is.
Chapter 1 Introduction to Research in Psychology.
The IB Way.  Title  Background Information -enhances the understanding of the investigation and… - demonstrates personal significance, interest or curiosity.
ABS VV&A Framework Study Phase II Project Overview Lisa Jean Moya WernerAnderson, Inc. Phase II Workshop 3 8 July /8/2008.
Cross-Domain Semantic Interoperability ~ Via Common Upper Ontologies ~ Presentation to: Expedition Workshop #53 15 Aug 2006 James Schoening
AUDIT QUALITY AND ASSURANCE 2 ND AND 3 RD OCTOBER 2014 HILTON HOTEL MATERIALITY IN PLANNING AND PERFORMING THE AUDIT (ISA 320) 1.
Orange Coast College Office of Institutional Effectiveness ISLO Update to Institutional Effectiveness Committee 4/25/2014 ISLO GE SLO Local AA/AS.
Investigate Plan Design Create Evaluate (Test it to objective evaluation at each stage of the design cycle) state – describe - explain the problem some.
Introduction Social ecological approach to behavior change
Critically Appraising a Medical Journal Article
Software Quality Engineering
Software Quality Engineering
TechStambha PMP Certification Training
Doing more with less: Evaluation with the Rapid Cycle Evaluation Coach
Medical Device Regulatory Essentials: An FDA Division of Cardiovascular Devices Perspective Bram Zuckerman, MD, FACC Director, FDA Division of Cardiovascular.
The Open Group Architecture Framework (TOGAF)
R. W. Eberth Sanderling Research, Inc. 01 May 2007
Concepts – Evolution of a Global Conceptual Framework
Concepts – Evolution of a Global Conceptual Framework
Operations Analysis Division Marine Corps Combat Development Command
Process Wind Tunnel for Improving Business Processes
Presentation transcript:

Name of Principal Author and all other author(s): _______________________________________________Michael P Bailey____________________________________________ Principal Author’s Organization and address: 3300 Russell Rd, Quantico, VA ____________ Original title on 712 A/B: __________Validation Methodology for Agent-based Simulations___________________________________________________________ (Please use the same title listed on MORSS Form 712 A/B. If the title was changed please list the revised title below.) Revised title: Presented in: WG(s) #_ 33 ____ CG_________, Special Session _________________________________________________________, Demonstration, ___________________________________, Tutorial, ________________________________ or Focus Session # ___________ 75 th MORSS 712CD Cover Page June 2007, at US Naval Academy, Annapolis, MD If you would like your presentation included in the 75 th MORSS Final Report CD it must: 1.Be unclassified, approved for public release, distribution unlimited, and is exempt from US export licensing and other export approvals including the International Traffic in Arms Regulations (22CFR120 et.seq.), 2.include MORS Form 712CD as the first page of the presentation and 3.a MORS form 712 A or B must be in the MORS Office no later than 14 June The following presentation is believed to be: unclassified, approved for public release, distribution unlimited, and is exempt from US export licensing and other export approvals including the International Traffic in Arms Regulations (22CFR120 et.seq.)

VALIDATION METHODOLOGY for AGENT-BASED SIMULATION AkstAltAxtellBaileyBarryBeanlandBlaisBottomClarkClemenceDavisDuncanDuongEbert hEspinosaHenscheidHollandHorneIlachinskiJacksonKaneKoehlerLiesnowitzLucasLyle MacallMcDonaldMcGaugheyMeyersMiddletonMiddletonMoyaNicholsParkRauschRogins kiSaundersSheldonStarrSteinerSteinerStephensStutzmanVaneVernaViscoWangWeisel WongWorksYilmazYoungYoungbloodYuZandbergen

ABOUT ABSVal WHICH SIMULATIONS are the target of interest  Directly applicable to IW problem set  In addition to ISAAC, Pythagoras, MANA, consider decision rules, knowledge-based systems, cellular automata, population dynamics Discussion about VV&A  Goal of validation: Match Tool to Application Conservation of Vagueness: definition of “match” DoD definitions & Processes Looking at UK MOD, AIAA, ASME, NASA, DOE Two phases  This phase: What we need to know to evaluate an ABS  Next Phase: Experiment with methods

ABOUT ABSVal WHICH SIMULATIONS are the target of interest  Directly applicable to IW problem set  In addition to ISAAC, Pythagoras, MANA, consider decision rules, knowledge-based systems, cellular automata, population dynamics Discussion about VV&A  Goal of validation: Match Tool to Application Conservation of Vagueness: definition of “match” DoD definitions & Processes Looking at UK MOD, AIAA, ASME, NASA, DOE Two phases  This phase: What we need to know to evaluate an ABS  Next Phase: Experiment with methods lots of attention little attention

Schism Agent-based simulations use modular rules and local reasoning to produce realistic and/or interesting emergent aggregate behavior.  Surprise is good** Successful simulation testing (core to face/results validation) based on demonstrating credibility across the range of potential input.  Surprise not good** ** Refined later in this talk

Questions What activities can I reasonably support with my ABS? What are the limits? What caveats are necessary? Compared with traditional simulation solutions, how are my results to be used? How can I make credibility statements about a simulation that is out of my (top-down) control?  Value of training experiences  Value of analytical results Can I support the scientific method with this ABS?

ABSVal Products General, institutionally acceptable processes and criteria for assessing the validity of an agent-based simulation used as part of a DoD-level analysis  What information?  What assurances and endorsements?  What desirable qualities?

Benefit Increased awareness of the value of analysis results supported by agent based simulation(s) [Potential] Increased credibility of results [Potential] More valuable agent-based simulations [Potential] Responsible analytical application of ABS by OA professionals [Potential] Civilization of the discourse concerning ABS-generated analysis results Benefactors: HBR, VVA, All Military Organizations using ABS, Analysis, Planning, Experimentation, Training, Acquisition, …

GOAL OF SIMULATION a)Aggregate effects you understand b)Calculate probability of simultaneous/sequential events c)Challenge user’s intuition** SEEING THE INSIDE AND OUTSIDE a)Depict agent’s behavior b)Depict aggregation methods c)Serial aggregation (building blocks) d)Prose, pictures, diagrams, tests e)Visualization of outcomes, trends, cause-effect DATA a)Model exists because of the data b)Data exists because of the model c)Accuracy, precision d)Covering the possible truths e)Propagating uncertainty & model sensitivity analysis VALIDATION SURROGATES a)History of successful uses b)Credible believers c)Large, mature user community d)Transparency e)Relative validity f)(Over-?) Fitting historical data First Principles

Surprise! Surprise Explore Explain Accept/reject Production Runs How do we tell about this experience? What abstractions, data values, shortcuts, tricks, intentional oversights, modifications to expectations, code changes were required?

Representation Dynamic Influence  1st-order effect  Direct influence  Relevant over large interval  Plausibly relevant over limited interval  Possibly influential  Minor detail  No relevance Distillation  Include only the highly-relevant dynamics  Aggregation of effects  Referent often loose/missing Completeness vs. Parsimony

Statistical Methods Balance Predictiveness vs. Parsimony x i ’s are the levels of dynamics included/ excluded (capacities) Y is the response variable (utility) Y = f(x 1, x 2,..., x n ) DI = SSE with /df SSE without /df Qualitative assessment meets Critical Values Bailey, M. P. and W. G. Kemple The Scientific Method of Choosing Model Fidelity, WSC Proceedings.

Goals Understanding the meaning of Valid Enough Techniques for uncovering validation shortcomings  in the presence of a weak referent Expressing validation boundaries Being conservative with VV&A resources Framework  transparent, traceable, repeatable, communicate-able

In Sum Achieve the Goals of Simulation Validation for ABS  Concentrate on analytical applications  Test-case-driven & practical  Institutional acceptability  Vast collection of potential partners