Ensuring Safe AI via a Moral Emotion Motivational & Control System

Slides:

Advertisements

Similar presentations

B OOTSTRAPPING A S TRUCTURED S ELF -I MPROVING & S AFE A UTOPOIETIC S ELF Mark R. Waser Digital Wisdom Institute

Advertisements

Mark R. Waser Digital Wisdom Institute

Rational Universal Benevolence Mark R. Waser Simpler, Safer and Wiser than “Friendly AI”

Wisdom DOES Imply Benevolence Mark R. Waser. Super-Intelligence  Ethics (except in a very small number of low-probability edge cases) So... What’s the.

Argument 2 Aristotle.

Mark R. Waser Digital Wisdom Institute

Quantifying Eudaimonia for Motivational and Social Systems Mark R. Waser

A Game-Theoretically Optimal Basis For Safe and Ethical Intelligence: Mark R. Waser A Thanksgiving.

CREATIVE AND ETHICAL DECISION MAKING by John Lisenko Not an oxymoron A look at what constitutes a creative approach to decision making How to help assure.

Mark R. Waser Digital Wisdom Institute

Mark R. Waser Digital Wisdom Institute

Your Potential As An Entrepreneur VHS. Key Objectives Describe the rewards of going into business for yourself. Describe the risks of going into business.

EVALUATING HUMAN DRIVES AND NEEDS FOR A SAFE MOTIVATIONAL SYSTEM Morgan Waser Virginia Commonwealth University Dept. Computer Science.

Mark R. Waser Digital Wisdom Institute

My learning based life. Motivation and goal orientation Learning to Learn Training What is motivation and why we need it? Developed with the support of.

1 KM Track Overview & Gaining Value from Knowledge -- Knowledge Management (KM) and the Contracting Professional Breakout Session # 119 Name: Gaining.

SELF-ESTEEM Relationships Self-Concept.

The Individual, The Government, and Mixed Markets Limited Government.

Module 2 ： The Customer Unit 3 Total Quality Management 松江电大卢翌春上海电大倪锦诚.

Business Ethics Concepts & Cases

Chapter 8 Management Essentials.

Deep Learning of Morality (Inverse Reinforcement Learning)

Ethics for self-improving machines J Storrs Hall Mark Waser.

Positive Behavior Support Project

Ways to help your child build self-esteem.

What Drives human Behavior?

Creating a culture of greatness

1.03 Workplace Skills Accounting I

Some thoughts on the future

Career Plan By 31Dec19 By 31Dec18 By 30Jun18 By 31Dec17 By 30Jun17

Chapter 16 Participating in Groups and Teams.

Equality and diversity – session 2

4.05 Time Management.

John Stuart Mill.

Growth Mindset vs Fixed Mindset

Business Ethics Concepts & Cases

Natural Rights Philosophy Vs Classical Republicanism

Chapter 14 - Leadership Copyright ©2017 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible.

What is Entrepreneurship?

Without using your notes:

Leading Your Team Like You Mean It

Without using your notes:

Presentation to TRAN Committee

Artificial Intelligence and Ethics

What Is Artificial General Intelligence?

Recap – Function Argument

Thursday 12th March 2014 Mr Nicholls

Accounting I Objective 1.03 Understanding Skills Needed

Recap of Aristotle So Far…

ARTIFICIAL INTELLIGENCE

What is the difference between: Can you give an example of each?

How To Become A Better Fire Chief

Bellringer: Name the four motivation theories.

This presentation uses a free template provided by FPPT.com Artificial intelligence Presented by : Syed jarrar haider.

Prisoner’s Dilemma (aka Reds & Blues)

Legal and Moral Complexities of Artificial General Intelligence

The Role of the Department Chair

Step One: Identify the Purposes of Communication

The Importance of Play….

Unit 1 Governing the People

Designing Your Own Success

CHAPTER 3: ETHICS AND CORPORATE SOCIAL RESPONSIBILITY

Employability Skills.

Parenting Styles & Their Effects on Children Child Studies 11

Accounting I Objective 1.03 Understanding Skills Needed

Definite Of Purpose Presented By Ali Yasin

Why do we monitor? Protecting the Government’s investment in Tertiary Education Assuring the Minister about the viability and sustainability of institutions.

Social-Emotional Learning

Keys to Success in Engineering Study

Presentation transcript:

Ensuring Safe AI via a Moral Emotion Motivational & Control System Mark R. Waser Digital Wisdom Institute Mark.Waser@Wisdom.Digital

Some Preliminary Priming This does not require consciousness and qualia This does not require autopoietic entity AI This can (and should) be applied NOW

AI is a Wicked Social Problem The most critical next step in our pursuit of AI is to agree on an ethical & empathic framework for its design -- Satya Nadal, 2017 If we use, to achieve our purposes, a mechanical agency whose operation we cannot interfere effectively . . . we had better be quite sure that the purpose put into the machine is the purpose we really desire. -- Norbert Wiener, 1960

In Search of a Solution We need something like a Manhattan Project on the topic of artificial intelligence, not to build it because I think we will inevitably do that, but to understand how to avoid an arms race & to build it in a way that is aligned with our interests -- Sam Harris, Can we build AI without losing control over it? … pretty much everyone agreed that they had no idea of how to define morality, or to select the right one for an AI -- Peter Voss, AI Safety Research: A Road to Nowhere

Agenda Decide what we want (business requirements) Safety A better life for humanity Decide how it will work (functional specification) Bottom-up control via moral emotions Top down control by understanding morality and the meaning of life NOT enslaved Bonus: We get to better understand & promote morality and the meaning of life

Values Alignment (aka Agreeing on the Meaning of Life) the convergent instrumental goal of acquiring resources poses a threat to humanity, for it means that a super-intelligent machine with almost any final goal (say, of solving the Riemann hypothesis) would want to take the resources we depend on for its own use . . . . an AI ‘does not love you, nor does it hate you, but you are made of atoms it can use for something else’ Moreover, the AI would correctly recognize that humans do not want their resources used for the AI’s purposes, and that humans therefore pose a threat to the fulfillment of its goals – a threat to be mitigated however possible. Muehlhauser & Bostrom (2014). WHY WE NEED FRIENDLY AI. Think 13: 41-47

Love Conquers All But . . . what if . . . the AI *does* love you?

Love & Altruism are super-rational advantageous beyond our ability to calculate and/or guarantee their ultimate effect (see also: faith)

Failures of Rationality Centipede Game

Instrumental Goals Evolve Self-improvement Rationality/integrity Preserve goals/utility function Decrease/prevent fraud/counterfeit utility Survival/self-protection Efficiency (in resource acquisition & use) Self-improvement Rationality/integrity Preserve goals/utility function Decrease/prevent fraud/counterfeit utility Survival/self-protection Efficiency (in resource acquisition & use) Community = assistance/non-interference through GTO reciprocation (OTfT + AP) Reproduction (adapted from Omohundro 2008 The Basic AI Drives)

But Some Are Short-Sighted or Negative Sum Preserve goals/utility function Money Power Size Efficiency Theft “Externalizing” costs & risks Cutting safety margins Suppressing diversity Survival/self-protection AI must be controlled!