Thanks to G. Bradshaw, W. Bridewell, S. Dzeroski, H. A. Simon, L. Todorovski, R. Valdes-Perez, and J. Zytkow for discussions that led to many of these.

Slides:



Advertisements
Similar presentations
Pat Langley Computational Learning Laboratory Center for the Study of Language and Information Stanford University, Stanford, CA 94304
Advertisements

Pat Langley Dileep George Stephen Bay Computational Learning Laboratory Center for the Study of Language and Information Stanford University, Stanford,
Pat Langley Computational Learning Laboratory Center for the Study of Language and Information Stanford University, Stanford, California
Pat Langley Institute for the Study of Learning and Expertise Javier Sanchez CSLI / Stanford University Ljupco Todorovski Saso Dzeroski Jozef Stefan Institute.
Pat Langley Institute for the Study of Learning and Expertise and Center for the Study of Language and Information Stanford University, Stanford, California.
Pat Langley Institute for the Study of Learning and Expertise Palo Alto, California and Center for the Study of Language and Information Stanford University,
Pat Langley Computational Learning Laboratory Center for the Study of Language and Information Stanford University, Stanford, California
Pat Langley School of Computing and Informatics Arizona State University Tempe, Arizona Computational Discovery.
Pat Langley Institute for the Study of Learning and Expertise 2164 Staunton Court, Palo Alto, California and School of Computing and Informatics Arizona.
Next Generation Science Standards Intro to NGSS.
Copyright © Allyn & Bacon (2007) Research is a Process of Inquiry Graziano and Raulin Research Methods: Chapter 2 This multimedia product and its contents.
Copyright © Allyn & Bacon (2010) Research is a Process of Inquiry Graziano and Raulin Research Methods: Chapter 2 This multimedia product and its contents.
Understanding the Research Process
Post-Positivist Perspectives on Theory Development
Roy Kennedy Massachusetts Bay Community College Wellesley Hills, MA Introductory Chemistry, 3 rd Edition Nivaldo Tro 2009, Prentice Hall Chapter 1 The.
Pat Langley School of Computing and Informatics Arizona State University Tempe, Arizona Institute for the Study of Learning and Expertise Palo Alto, California.
Pat Langley Computational Learning Laboratory Center for the Study of Language and Information Stanford University, Stanford, California
Machine Creativity. Outline BackgroundBackground –The problem and its importance. –The known algorithms and systems. Summary of the Creativity Machine.
Developing Ideas for Research and Evaluating Theories of Behavior
Computational Thinking Related Efforts. CS Principles – Big Ideas  Computing is a creative human activity that engenders innovation and promotes exploration.
Virginia Standard of Learning BIO.1a-m
Thanks to K. Arrigo, G. Bradshaw, S. Borrett, W. Bridewell, S. Dzeroski, H. Simon, L. Todorovski, and J. Zytkow for their contributions to this research,
Achieving Authentic Inquiry in Your Classroom Presented by Eric Garber.
Thursday 9/22 Date Activity Page 9/22Hypotheses, Theories and Laws.
1.What is science? 2.Why should we study science? 3.What did we do before science? 4.What role does Math have in Science? What We Will Address.
Gregor Johann Mendel Studied peas in his monastry garden Kept careful records of all procedures and findings “controlled” pollination between plants with.
Knowledge representation
1 Brief Review of Research Model / Hypothesis. 2 Research is Argument.
Taxonomies and Laws Lecture 10. Taxonomies and Laws Taxonomies enumerate scientifically relevant classes and organize them into a hierarchical structure,
Big Idea 1: The Practice of Science Description A: Scientific inquiry is a multifaceted activity; the processes of science include the formulation of scientifically.
Educational Psychology Define and contrast descriptive, correlational and experimental studies, giving examples of how each of these have been used in.
Discovering Dynamic Models Lecture 21. Dynamic Models: Introduction Dynamic models can describe how variables change over time or explain variation by.
Nature of Science. NOS Card Exchange Step 1: Obtain 8 cards (that are different from one another). Step 2: Trade cards with classmates in order to amass.
1 Issues in Assessment in Higher Education: Science Higher Education Forum on Scientific Competencies Medellin-Colombia Nov 2-4, 2005 Dr Hans Wagemaker.
ITR: Collaborative research: software for interpretation of cosmogenic isotope inventories - a combination of geology, modeling, software engineering and.
Science This introductory science course is a prerequisite to other science courses offered at Harrison Trimble. Text: Nelson, Science 10 Prerequisite:
Pat Langley Adam Arvay Department of Computer Science University of Auckland Auckland, NZ Heuristic Induction of Rate-Based Process Models Thanks to W.
Introduction to Research
Copyright © by Holt, Rinehart and Winston. All rights reserved. ResourcesChapter menu Copyright © by Holt, Rinehart and Winston. All rights reserved. ResourcesChapter.
The Field of Psychology Gaining Insight into Behavior Behavior results from physiological (physical) processes and cognitive (intellectual) processes.
RHS 303. TRANSITION OF THEORY AND TREATMENT nature of existence and gives meaning to and guides the action Philosophical Base: Philosophy of occupational.
Science & Technology Ch.1.
Introduction to Science Informatics Lecture 1. What Is Science? a dependence on external verification; an expectation of reproducible results; a focus.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Developing and Evaluating Theories of Behavior.
What Is Science?. Learning Objectives  State the goals of science.  Describe the steps used in scientific methodology.
WHAT IS THE NATURE OF SCIENCE?. SCIENTIFIC WORLD VIEW 1.The Universe Is Understandable. 2.The Universe Is a Vast Single System In Which the Basic Rules.
Discovering Descriptive Knowledge Lecture 18. Descriptive Knowledge in Science In an earlier lecture, we introduced the representation and use of taxonomies.
Introduction to Physical Science. A-Science- A-Science- Is a way of learning about the universe and it’s natural laws (Gravity) 1- Skills of scientist.
Chapter 1 The Chemical World. Tro's "Introductory Chemistry", Chapter 1 2 What Is Chemistry? What chemists try to do is discover the relationships between.
Discovering Structural Models Lecture 19. Structural Models in Science Structural models encode the spatial relationships among the components of some.
What Is Chemistry? What chemists try to do is discover the relationships between the particle structure of matter and the properties of matter we observe.
Unpacking the Elements of Scientific Reasoning Keisha Varma, Patricia Ross, Frances Lawrenz, Gill Roehrig, Douglas Huffman, Leah McGuire, Ying-Chih Chen,
Lesson Overview Lesson Overview What Is Science? Lesson Overview 1.1 What Is Science?
Chapter 1 – Introducing Psychology Section 1 - Why Study Psychology Section 2 – A Brief History in Psychology Section 3 – Psychology as a Profession.
Dendral: A Case Study Lecture 25.
Welcome to Physics--Jump in!
SOCIOLOGICAL RESEARCH Importance of social research Help solve social problems by understanding how they come about, and why they persist. Makes clear.
Models The PMT is one of many models used in science. A model is anything that allows us to better understand a concept. It may be something to look at.
Chapter 1 Introduction to Research in Psychology.
 Continue to develop a common understanding of what STEM education is/could/should be here at Killip.
WHAT MODELS DO THAT THEORIES CAN’T Lilia Gurova Department of Cognitive Science and Psychology New Bulgarian University.
Objectives The objectives of this lecture is to:
International Conference on Sequence Analysis and Related Methods
Introduction to Physical Science
Lee, Jung-Woo Interdisciplinary Program in Cognitive Science
Pat Langley Department of Computer Science University of Auckland
Chapter 1 The Nature Of Chemistry
Developing and Evaluating Theories of Behavior
Introduction to Psychology Chapter 1
Discovery Informatics
Presentation transcript:

Thanks to G. Bradshaw, W. Bridewell, S. Dzeroski, H. A. Simon, L. Todorovski, R. Valdes-Perez, and J. Zytkow for discussions that led to many of these ideas, which was partly funded by ONR Grant No. N Pat Langley Department of Computer Science University of Auckland Silicon Valley Campus Carnegie Mellon University Themes and Progress in Computational Scientific Discovery

Examples of Scientific Discoveries Science is a distinguished by its reliance on formal laws, models, and theories of observed phenomena. We often refer to the process of finding such accounts as scientific discovery. Kepler’s laws of planetary motion Newton’s theory of gravitation Krebs’ citric acid cycle Dalton’s atomic theory

Philosophy of Science  character of scientific observations and experiments;  structure of scientific theories, laws, and models;  nature of scientific explanations and predictions;  evaluation of scientific theories, models, and laws. The philosophy of science has studied science since the 19th Century, focusing on the: However, philosophers of science typically avoided the topic of scientific discovery.

Many have claimed that scientific discovery cannot be analyzed in rational terms. Popper (1934) wrote: The initial stage, the act of conceiving or inventing a theory, seems to me neither to call for logical analysis nor to be susceptible of it … My view may be expressed by saying that every discovery contains an ‘irrational element’, or ‘a creative intuition’ … He was not alone. Hempel and many others believed discovery was inherently irrational and beyond understanding. Mystical Views of Scientific Discovery

His ideas provided a powerful new approach to understanding the nature of scientific discovery. Moreover, it offered ways to automate this mysterious process. Scientific Discovery as Problem Solving  Search through a space of connected problem states;  Generated from earlier states by mental operators;  Guided by heuristics that keep the search tractable. Simon (1966) offered another view – that scientific discovery is a variety of problem solving that involves:

The Task of Scientific Discovery We can state the discovery task in terms of the inputs provided and the outputs produced:   Given: A set of scientific data or phenomena to be modeled;   Given: A space of candidate laws, hypotheses, or models stated in an established scientific formalism;   Given: Knowledge and heuristics for the scientific domain;   Find: Laws or models that describe or explain the data or phenomena (and that generalize well). We can develop AI systems that carry out search through this space of candidate accounts.

Some Laws Discovered by Bacon (Langley et al., 1983) Basic algebraic relations: Ideal gas lawPV = aNT + bN Kepler’s third lawD 3 = [(A – k) / t] 2 = j Coulomb’s lawFD 2 / Q 1 Q 2 = c Ohm’s lawTD 2 / (LI – rI) = r Relations with intrinsic properties: Snell’s law of refractionsin I / sin R = n 1 / n 2 Archimedes’ lawC = V + i Momentum conservationm 1 V 1 = m 2 V 2 Black’s specific heat lawc 1 m 1 T 1 + c 2 m 2 T 2 = (c 1 m 1 + c 2 m 2 ) T f

Bacon.1–Bacon.5 Abacus, Coper Fahrehneit, E*, Tetrad, IDS N Hume, ARC DST, GP N LaGrange SDS SSF, RF5, LaGramge Dalton, Stahl RL, Progol Gell-Mann BR-3, Mendel Pauli Stahlp, Revolver  Dendral  AM GlauberNGlauber IDS Q, Live IE Coast, Phineas, AbE, Kekada Mechem, CDP Astra, GP M HR BR-4 Numeric lawsQualitative lawsStructural modelsProcess models Legend Research on computational scientific discovery covers many forms of laws and models. Early Progress in Scientific Discovery Most early work focused on historical examples, but more recent efforts have aided the scientific enterprise.

Successes of Computational Scientific Discovery AI systems of this type have helped to discover new knowledge in many scientific fields: Qualitative chemical factors in mutagenesis (King et al., 1996) Quantitative laws of metallic behavior (Sleeman et al., 1997) Quantitative conjectures in graph theory (Fajtlowicz et al., 1988)   Qualitative conjectures in number theory (Colton et al., 2000) Temporal laws of ecological behavior (Todorovski et al., 2000) Reaction pathways in catalytic chemistry (Valdes-Perez, 1994, 1997) Each of these has led to publications in the refereed literature of the relevant scientific field (Langley, 2000).

Books on Scientific Discovery Research on computational scientific discovery has produced a number of books on the topic. These further demonstrate the diversity of problems and methods while emphasizing their underlying unity

 Emphasized the availability of large amounts of data;  Used computational methods to find regularities in the data;  Adopted heuristic search through a space of hypotheses;  Initially focused on commercial applications and data sets. During the 1990s, a new paradigm known as data mining and knowledge discovery emerged that: Most work used notations invented by computer scientists, unlike work on scientific discovery, which used scientific formalisms. Data mining has been applied to scientific data, but the results seldom bear a resemblance to scientific knowledge. The Data Mining Movement

Discovering Explanatory Models The early stages of any science focus on descriptive laws that summarize empirical regularities. Mature sciences instead emphasize the creation of models that explain phenomena in terms of: Inferred components and structures of entities; Hypothesized processes about entities’ interactions. Explanatory models move beyond description to provide deeper accounts linked to theoretical constructs. Can we develop computational systems that address this more sophisticated side of scientific discovery?

Classic Work: DENDRAL (Lindsay et al.,1980) The DENDRAL system inferred a molecule’s chemical bonds given its component formula and a mass spectrogram. E.g., from the formula C 6 H 5 OH and other relevant information, the program produced structures like: HCHC HCOH HCHC CHCH CHCH C DENDRAL relied on heuristic search to infer structural models, using knowledge from 20 th Century chemistry as a guide.

Classic Work: MECHEM (Valdes-Perez, 1994) MECHEM was a graphical interactive system that generated plausible pathways to explain chemical reactions. The screenshot shows, clock- wise from the upper left: the main menu; the current reaction; a sample mechanism; a set of constraints; and the system’s output log. These made MECHEM more accessible to chemists.

Inductive Process Modeling (Bridewell et al., 2008) process exponential_growth variables: P {population} equations: d[P,t] = [0, 1,  ]  P process logistic_growth variables: P {population} equations: d[P,t] = [0, 1,  ]  P  (1  P / [0, 1,  ]) process constant_inflow variables: I {inorganic_nutrient} equations: d[I,t] = [0, 1,  ] process consumption variables: P1 {population}, P2 {population}, nutrient_P2 equations: d[P1,t] = [0, 1,  ]  P1  nutrient_P2, d[P2,t] =  [0, 1,  ]  P1  nutrient_P2 process no_saturation variables: P {number}, nutrient_P {number} equations: nutrient_P = P process saturation variables: P {number}, nutrient_P {number} equations: nutrient_P = P / (P + [0, 1,  ]) model AquaticEcosystem variables: nitro, phyto, zoo, nutrient_nitro, nutrient_phyto observables: nitro, phyto, zoo process phyto_exponential_growth equations: d[phyto,t] = 0.1  phyto process zoo_logistic_growth equations: d[zoo,t] = 0.1  zoo / (1  zoo / 1.5) process phyto_nitro_consumption equations: d[nitro,t] =  1  phyto  nutrient_nitro, d[phyto,t] = 1  phyto  nutrient_nitro process phyto_nitro_no_saturation equations: nutrient_nitro = nitro process zoo_phyto_consumption equations: d[phyto,t] =  1  zoo  nutrient_phyto, d[zoo,t] = 1  zoo  nutrient_phyto process zoo_phyto_saturation equations: nutrient_phyto = phyto / (phyto + 0.5) Heuristic Search observations generic processes process model phyto, nitro, zoo, nutrient_nitro, nutrient_phyto entities constraints Always-together[growth(P), loss(P)] Exactly-one[lotka-volterra(P, G), ivlev(P, G), watts(P, G)] At-most-one[photoinhibition(P, E)] Necessary[nutrient-mixing(N), remineralization(N, D)]

Recent Progress: Biological Models King et al. (2009) have constructed an integrated system for biological discovery that: Designs auxotrophic growth studies with yeast gene knockouts; Runs these experiments using a robotic manipulator; Measures the growth rates for each experimental condition; and Revises its causal model for how genes influence metabolism. This closes the loop between experiment design, data collection, and model construction in biology. But note that Zytkow et al. (AAAI-90) reported an even earlier robot scientist in the field of electrochemistry.

Recent Progress: Cosmogenic Dating Anderson et al. (2014) report ACE, an AI system for cosmogenic dating that: Designs inputs nucleotide densities for rocks from a landform; Generates process accounts for how the landform was produced; Weighs arguments for and against each process explanation. ACE has been downloaded ~600 times and is used actively by many geologists to understand their data. The system is user-extensible and, years after it launch, has led to zero requests for help from computer scientists.

Big Data and Scientific Discovery  Scaling to large and heterogeneous data sets;  Scaling to large and complex scientific models;  Scaling to large spaces of candidate models. Digital collection and storage have led to rapid growth of data in many areas. The big data movement seeks to capitalize on this content, but, in science at least, we must address three distinct issues: We need far more work on the last two issues, for which methods from computational scientific discovery are well suited.

Summary Remarks There has been a long history of work on computational scientific discovery, including methods for constructing:   Descriptive laws stated as numeric equations   Explanatory models of structures and processes Recent research has focused on the latter, which is associated with more mature fields of science. Work in this paradigm discovers knowledge stated in formalisms and concepts that are familiar to scientists. Challenges involve dealing not with ‘big data’, but with complex models and large search spaces.

Publications on Computational Scientific Discovery Bridewell, W., & Langley, P. (2010). Two kinds of knowledge in scientific discovery. Topics in Cognitive Science, 2, 36–52. Bridewell, W., Langley, P., Todorovski, L., & Dzeroski, S. (2008). Inductive process modeling. Machine Learning, 71, Bridewell, W., Sanchez, J. N., Langley, P., & Billman, D. (2006). An interactive environment for the modeling and discovery of scientific knowledge. International Journal of Human-Computer Studies, 64, Dzeroski, S., Langley, P., & Todorovski, L. (2007). Computational discovery of scientific knowledge. In S. Dzeroski & L. Todorovski (Eds.), Computational discovery of communicable scientific knowledge. Berlin: Springer. Langley, P. (2000). The computational support of scientific discovery. International Journal of Human-Computer Studies, 53, 393–410. Langley, P., Simon, H. A., Bradshaw, G. L., & Zytkow, J. M. (1987). Scientific discovery: Computational explorations of the creative processes. Cambridge, MA: MIT Press. Langley, P., & Zytkow, J. M. (1989). Data-driven approaches to empirical discovery. Arti- ficial Intelligence, 40, 283–312. Todorovski, L., Bridewell, W., & Langley, P. (2012). Discovering constraints for inductive process modeling. Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence. Toronto: AAAI Press.

In Memoriam Herbert A. Simon (1916 – 2001) In 2001, the field of computational scientific discovery lost two of its founding fathers. Both were interdisciplinary researchers who published in computer science, psychology, philosophy, and statistics. Herb Simon and Jan Zytkow were excellent role models for us all. Jan M. Zytkow (1945 – 2001)

End of Presentation