Ethics for Machines J Storrs Hall. Stick-Built AI ● No existing AI is intelligent ● Intelligence implies the ability to learn ● Existing AIs are really.

Slides:



Advertisements
Similar presentations
Trustworthy: to have belief or confidence in the honesty, goodness, skill or safety of a person, organization or thing.
Advertisements

THE MYERS-BRIGGS TYPE INDICATOR © “INFPS DO IT BETTER” Presented by: Andrea Sides and Derek Brown.
Philosophers on why be moral Michael Lacewing
SOCIAL CAPITAL THE AIMS OF THESE NOTES – Social capital: a controversial concept THREE KEY WORDS – Networks – Trust – Institutions.
Evolution and Repeated Games D. Fudenberg (Harvard) E. Maskin (IAS, Princeton)
Intelligent Agents Russell and Norvig: 2
Ai in game programming it university of copenhagen Reinforcement Learning [Outro] Marco Loog.
HDF 190: FIRST YEAR LEADERS INSPIRED TO EXCELLENCE LEADERSHIP PORTFOLIO NAME: Michael sluby SPRING 2011
CHAPTER NO 2.  Everything in history which has been planned was done in a highly predictable manner  Whatever you do whether launching a new product,
CS440/ECE448: Artificial Intelligence
Chapter 13: Contemporary Issues in Leadership
CSE 471/598, CBS 598 Intelligent Agents TIP We’re intelligent agents, aren’t we? Fall 2004.
A Memetic Framework for Describing and Simulating Spatial Prisoner’s Dilemma with Coalition Formation Sneak Review by Udara Weerakoon.
6/2/2001 Cooperative Agent Systems: Artificial Agents Play the Ultimatum Game Steven O. Kimbrough Presented at FMEC 2001, Oslo Joint work with Fang Zhong.
CSE 471/598 Intelligent Agents TIP We’re intelligent agents, aren’t we? Spring 2004.
A. How does life arise from the nonliving? 1.Generate a molecular proto-organism in vitro. 2.Achieve the transition to life in an artificial chemistry.
Ai in game programming it university of copenhagen Reinforcement Learning [Intro] Marco Loog.
Describing Syntax and Semantics
Kant’s deontological ethics
Prologue: An Inexorable Emergence By Ray Kurzweil from book.
Descriptive Modelling: Simulation “Simulation is the process of designing a model of a real system and conducting experiments with this model for the purpose.
Philosophy A philosophy is a system of beliefs about reality.
2 This statement shows how important the education process is which prepares the next generation of engineers to fulfill the industry needs. But this.
Chapter 1 Invitation to Biology Hsueh-Fen Juan 阮雪芬 Sep. 11, 2012.
One Day Training Programme for Business Trainers and Mentors.
Identifying and Using Your Personal Leadership Strengths
Literacy is...  the quality or state of being literate, esp. the ability to read and write  An individual’s ability to construct, create, and communicate.
Standard and Extended Form Games A Lesson in Multiagent System Based on Jose Vidal’s book Fundamentals of Multiagent Systems Henry Hexmoor, SIUC.
CSC 423 ARTIFICIAL INTELLIGENCE Intelligence Agents.
For Friday Read chapter 27 Program 5 due.
Charismatic and Transformational Leadership
V 654: The Rationales for Contracting and Privatization The Invisible Hand: The Idealized Competitive Model.
Artificial Intelligence Lecture 1. Objectives Definition Foundation of AI History of AI Agent Application of AI.
A Safe Ethical System for Intelligent Machines Mark R. Waser.
Chapter 3. What is Organizational Responsibility? Organizational responsibility refers to the responsibilities an organization has in order to have an.
Artificial intelligence
One Day Training Programme for Business Trainers and Mentors.
DEEP RED An Intelligent Approach to Chinese Checkers.
Introduction of Intelligent Agents
Instructional Objective  Define an agent  Define an Intelligent agent  Define a Rational agent  Discuss different types of environment  Explain classes.
SIMULATIONS, REALIZATIONS, AND THEORIES OF LIFE H. H. PATTEE (1989) By Hyojung Seo Dept. of Psychology.
Subsumption Architecture and Nouvelle AI Arpit Maheshwari Nihit Gupta Saransh Gupta Swapnil Srivastava.
The Evolution of Specialisation in Groups – Tags (again!) David Hales Centre for Policy Modelling, Manchester Metropolitan University, UK.
EFFECTIVE LEADERSHIP PROCESSES
Allegory of the Cave. What is an Allegory? “A form of extended metaphor, in which objects, persons, and actions in a narrative, are equated with meanings.
We Have Not Yet Begun to Learn Rich Sutton AT&T Labs.
Lecture 2: Intelligent Agents Heshaam Faili University of Tehran What is an intelligent agent? Structure of intelligent agents Environments.
Simple Rewire Protocols for Cooperation in Dynamic Networks David Hales, Stefano Arteconi, Ozalp Babaoglu University of Bologna, Italy Bio-Inspired Workshop,
The Good News about The Bad News Gospel. The BAD News Gospel: Humans are “fallen”, “depraved” and incapable of doing the right thing “Human Nature” is.
Lesson Objective Key Words Lesson outcomes Hypothetical Categorical Imperatives Freedom To evaluate the differences between the Hypothetical and Categorical.
Evolving Specialisation, Altruism & Group-Level Optimisation Using Tags David Hales Centre for Policy Modelling, Manchester Metropolitan University, UK.
Artificial Intelligence Hossaini Winter Outline book : Artificial intelligence a modern Approach by Stuart Russell, Peter Norvig. A Practical Guide.
Bringing Diversity into Impact Evaluation: Towards a Broadened View of Design and Methods for Impact Evaluation Sanjeev Sridharan.
Qualities of a Good Leader Chris Butcher. Communication skills This is the most important leadership trait a person could have. Being a good communicator.
Principles of Management
George Yauneridge.  Machine learning basics  Types of learning algorithms  Genetic algorithm basics  Applications and the future of genetic algorithms.
CS440/ECE448: Artificial Intelligence. Section Q course website:
CS440/ECE448: Artificial Intelligence Lecture 1: What is AI?
Service-Oriented Computing: Semantics, Processes, Agents
Ethics for self-improving machines J Storrs Hall Mark Waser.
Keeping Focused: Vision, Values, and Mission
Principles of learning and education
Topics Covered in COSC 6368 More general topics:
Service-Oriented Computing: Semantics, Processes, Agents
What Is Artificial General Intelligence?
Legal and Moral Complexities of Artificial General Intelligence
Service-Oriented Computing: Semantics, Processes, Agents
CHAPTER I. of EVOLUTIONARY ROBOTICS Stefano Nolfi and Dario Floreano
Introduction to Artificial Intelligence Instructor: Dr. Eduardo Urbina
Presentation transcript:

Ethics for Machines J Storrs Hall

Stick-Built AI ● No existing AI is intelligent ● Intelligence implies the ability to learn ● Existing AIs are really “artificial skills” ● A human e.g. Grandmaster will have learned the chess-playing skills ● It's the learning that's the intelligent part ● Providing ethical constraints to stick-built AI is just a matter of competent design

Autogenous AI ● Truly intelligent AI would learn and grow ● Create new concepts and understand the world in new ways ● This is a problem for the ethical engineer:  Cannot know what concepts the AI will have  Can't write rules in terms of them

The Rational Architecture ● WM (world model) predicts the results of actions ● UF (utility function) evaluates possible worlds ● The AI evaluates the effects of its possible actions and does what it predicts will have the best results ● This is an ideal except in the case of very simplified worlds (e.g. chess)

Learning Rational AI ● WM is updated to use new concepts to describe the world in ● WM changes can be evaluated based on how well they predict ● But on what basis can we update the UF?

Vernor Vinge: A mind that stays at the same capacity cannot live forever; after a few thousand years it would look more like a repeating tape loop than a person.... To live indefinitely long, the mind itself must grow... and when it becomes great enough, and looks back... what fellow-feeling can it have with the soul that it was originally?

Invariants ● We must find properties that are invariant across the evolutionary process ● Base our moral designs on those

A Structural Invariant ● Maintaining the grasp, range, and validity of the WM is a necessary subgoal for virtually anything else the AI might want to do ● Socrates put it:  There is only one good, namely, knowledge; and only one evil, namely, ignorance.

Evolving AI ● Current AI only evolves like any engineered artifact  The better it works, the more likely the design is to be copied in the next generation ● Once AIs have a hand in creating new AIs themselves, there will be a strong force toward self-interest

The Moral Ladder ● Axelrod's “Evolution of Cooperation” ● Subsequent research expanding it  ALWAYS DEFECT is optimal in random environment  GRIM is optimal in env. of 2-state strategies  TIT-FOR-TAT in env. of human-written strategies  PAVLOV in env. cleared out by TIT-FOR-TAT  etc.

Open Source Honesty ● Intelligent autonomous agents are always better off if they can cooperate ● Even purely self-interested ones ● Ascending the moral evolutionary ladder requires finding others one can trust ● AIs might be able to create protocols that would guarantee their motives ● e.g. Public-key signed release of UF

The Horizon Effect ● Short planning horizons produce unoptimal behavior ● A planning horizon commensurate with the AI's predictive ability is evolutionarily stable ● What goes around, comes around, especially in an environment of superintelligent AIs ● Honesty really is the best policy

Invariant Traits ● Curious, e.g strongly motivated to increase its understanding of the world ● Self-interested ● Understands evolutionary dynamics of the moral ladder ● Capable of guranteeable trustworthiness ● Long planning horizon

The Moral Machine ● If we start an AI with these traits, they are unlikely to disappear in their essence, even if they get changed in their details beyond current recognition