Presentation is loading. Please wait.

Presentation is loading. Please wait.

Towards Robot Theatre Marek Perkowski Department of Electrical and Computer Engineering, Portland State University, Portland, Oregon, 97207-0751.

Similar presentations


Presentation on theme: "Towards Robot Theatre Marek Perkowski Department of Electrical and Computer Engineering, Portland State University, Portland, Oregon, 97207-0751."— Presentation transcript:

1 Towards Robot Theatre Marek Perkowski Department of Electrical and Computer Engineering, Portland State University, Portland, Oregon, 97207-0751

2 Week 2 Lectures 3 and 4

3 Humanoid Robots and Robot Toys

4 Talking Robots Many talking robots exist, but they are still very primitive Work with elderly and disabled Actors for robot theatre, agents for advertisement, education and entertainment. Designing inexpensive natural size humanoid caricature and realistic robot heads We concentrate on Machine Learning techniques used to teach robots behaviors, natural language dialogs and facial gestures. Dog.com from Japan Work in progress

5 Robot with a Personality? Future robots will interact closely with non-sophisticated users, children and elderly, so the question arises, how they should look like? If human face for a robot, then what kind of a face? Handsome or average, realistic or simplified, normal size or enlarged? Why is Kismet so successful? We believe that a robot that will interact with humans should have some kind of “personality” and Kismet so far is the only robot with “personality”. The famous example of a robot head is Kismet from MIT.

6 Robot face should be friendly and funny The Muppets of Jim Henson are hard to match examples of puppet artistry and animation perfection. We are interested in robot’s personality as expressed by its: –behavior, –facial gestures, –emotions, –learned speech patterns.

7 Behavior, Dialog and Learning Robot activity as a mapping of the sensed environment and internal states to behaviors and new internal states (emotions, energy levels, etc). Our goal is to uniformly integrate verbal and non-verbal robot behaviors. Words communicate only about 35 % of the information transmitted from a sender to a receiver in a human-to-human communication. The remaining information is included in para-language. Emotions, thoughts, decision and intentions of a speaker can be recognized earlier than they are verbalized. NASA

8

9 Morita’s Theory

10 Robot Metaphors and Models

11 Animatronic “Robot” or device brain effectors

12 Perceiving “Robot” brain sensors

13 Reactive Robot is the simplest behavioral robot Brain is a mapping sensors This is the simplest robot that satisfies the definition of a robot effectors

14 Reactive Robot in environment brain sensors This is the simplest robot that satisfies the definition of a robot effectors ENVIRONMENT is a feedback

15 Braitenberg Vehicles and Quantum Automata Robots

16 Another Example: Braitenberg Vehicles and Quantum BV

17 Braitenberg Vehicles

18 Emotional Robot has a simple form of memory or state Brain is a Finite State Machine sensors This is the simplest robot that satisfies the definition of a robot effectors

19 Behavior as an interpretation of a string Newton, Einstein and Bohr. Hello Professor Hello Sir Turn Left. Turn right. behavior

20 Behavior as an interpretation of a tree Newton, Einstein and Bohr. Hello Professor Hello Sir Turn Left. Turn right. behavior Grammar. Derivation. Alphabets.

21 Our Base Model and Designs

22 Neck and upper body movement generation

23 Robot Head Construction, 1999 Furby head with new control Jonas Jonas We built and animated various kinds of humanoid heads with from 4 to 20 DOF, looking for comical and entertaining values. High school summer camps, hobby roboticists, undergraduates

24 Mister Butcher 4 degree of freedom neck Latex skin from Hollywood

25 Robot Head Construction, 2000 Skeleton Alien We use inexpensive servos from Hitec and Futaba, plastic, playwood and aluminum. The robots are either PC-interfaced, use simple micro-controllers such as Basic Stamp, or are radio controlled from a PC or by the user.

26 Adam Marvin the Crazy Robot Technical Construction, 2001 Details

27 Virginia Woolf heads equipped with microphones, USB cameras, sonars and CDS light sensors 2001

28 Max Image processing and pattern recognition uses software developed at PSU, CMU and Intel (public domain software available on WWW). Software is in Visual C++, Visual Basic, Lisp and Prolog. BUG (Big Ugly Robot) 2002

29 Visual Feedback and Learning based on Constructive Induction 2002 Uland Wong, 17 years old

30 Professor Perky 1 dollar latex skin from China We compared several commercial speech systems from Microsoft, Sensory and Fonix. Based on experiences in highly noisy environments and with a variety of speakers, we selected Fonix for both ASR and TTS for Professor Perky and Maria robots. We use microphone array from Andrea Electronics. Professor Perky with automated speech recognition (ASR) and text-to-speech (TTS) capabilities 2002, Japan

31 Maria, 2002/2003 20 DOF

32 Construction details of Maria location of controlling rods location of head servos location of remote servos Custom designed skin skull

33 Animation of eyes and eyelids

34 Cynthia, 2004, June

35 Currently the hands are not moveable. We have a separate hand design project.

36 Software/Hardware Architecture Network- 10 processors, ultimately 100 processors. Robotics Processors. ACS 16 Speech cards on Intel grant More cameras Tracking in all robots. Robotic languages – Alice and Cyc-like technologies.

37 Face detection localizes the person and is the first step for feature and face recognition. Acquiring information about the human: face detection and recognition, speech recognition and sensors.

38 Face features recognition and visualization.

39 Use of Multiple- Valued (five- valued) variables Smile, Mouth_Open and Eye_Brow_Raise for facial feature and face recognition.

40 HAHOE KAIST ROBOT THEATRE, KOREA, SUMMER 2004 Sonbi, the Confucian ScholarPaekchong, the bad butcher Czy znacie dobra sztuke dla teatru robotow?

41 Editing movements

42 Yangban the Aristocrat and Pune his concubine The Narrator

43

44

45 We base all our robots on inexpensive radio- controlled servo technology.

46 We are familiar with latex and polyester technologies for faces Martin Lukac and Jeff Allen wait for your help, whether you want to program, design behaviors, add muscles, improve vision, etc.

47 New Silicone Skins

48 A simplified diagram of software explaining the principle of using machine learning based on constructive induction to create new interaction modes of a human and a robot.

49 Probabilistic and Finite State Machines

50 Probabilistic State Machines to describe emotions Happy state Ironic state Unhappy state “you are beautiful” / ”Thanks for a compliment” “you are blonde!” / ”I am not an idiot” P=1 P=0.3 “you are blonde!” / Do you suggest I am an idiot?” P=0.7

51 Facial Behaviors of Maria Do I look like younger than twenty three? Maria asks:  “yes”  “no” 0.3 0.7 Response: Maria smiles Maria frowns

52 Probabilistic Grammars for performances Who? What? Where? Speak ”Professor Perky”, blinks eyes twice Speak “In the classroom”, shakes head P=0.1 Speak “Was drinking wine” P=0.1 P=0.3 P=0.5 Speak ”Professor Perky” Speak ”Doctor Lee” Speak “in some location”, smiles broadly Speak “Was singing and dancing” P=0.5 P=0.1 …. P=0.1

53 Human-controlled modes of dialog/interaction Robot asks Human teaches Human commandsHuman asks Robot performs “Hello Maria” “Thanks, I have a question” “Thanks, I have a lesson” “Thanks, I have a command” “Lesson finished” “Questioning finished” “Command finished” “Stop performance” “Question”

54 Dialog and Robot’s Knowledge

55 Robot-Receptionist Initiated Conversation Robot What can I do for you? Human Robot asks This represents operation mode

56 Robot-Receptionist Initiated Conversation Robot What can I do for you? Human I would like to order a table for two Robot asks

57 Robot-Receptionist Initiated Conversation Robot Smoking or non- smoking? Human Robot asks

58 Robot-Receptionist Initiated Conversation Robot Smoking or non- smoking? Human I do not understand Robot asks

59 Robot-Receptionist Initiated Conversation Robot Do you want a table in a smoking or non-smoking section of the restaurant? Non-smoking section is near the terrace. Human Robot asks

60 Robot-Receptionist Initiated Conversation Robot Do you want a table in a smoking or non-smoking section of the restaurant? Non-smoking section is near the terrace. Human A table near the terrace, please Robot asks

61 Human-Initiated Conversation Robot Human Hello Maria Robot asks initialization

62 Human-Initiated Conversation Robot Human Hello Maria What can I do for you? Robot asks

63 Human-Asking Robot Human Question Human asks Question Robot asks

64 Human-Asking Robot Human Question Human asks Yes, you ask a question.

65 Human-Asking Robot Human What book wrote Lee? Human asks Yes, you ask a question.

66 Human-Asking Robot Human What book wrote Lee? Human asks I have no sure information.

67 Human-Asking Robot Human Try to guess. Human asks I have no sure information.

68 Human-Asking Robot Human Try to guess. Human asks Lee wrote book “Flowers”.

69 Human-Asking Robot Human This is not true. Human asks Lee wrote book “Flowers”.

70 Human-Teaching Robot Human Questioning finished Human teaches “Questioning finished” Robot asks Human asks Thanks, I have a lesson Human ends questioning

71 Human-Teaching Robot Human Questioning finished Human teaches “Questioning finished” Robot asks Human asks Thanks, I have a lesson Robot enters asking mode What can I do for you?

72 Human-Teaching Robot Human Thanks, I have a lesson Human teaches “Questioning finished” Robot asks Human asks Thanks, I have a lesson Human starts teaching What can I do for you?

73 Human-Teaching Robot Human Thanks, I have a lesson Yes Human teaches

74 Human-Teaching Robot Human I give you question- answer pattern Yes Human teaches

75 Human-Teaching Robot Human Question pattern: What book Smith wrote? Yes Human teaches

76 Robot Human Answer pattern: Smith wrote book “Automata Theory” Yes Human teaches Human-Teaching

77 Human-Teaching Robot Human Checking question: What book wrote Smith? Yes Human teaches

78 Human-Teaching Robot Human Checking question: What book wrote Smith? Smith wrote book “Automata Theory” Human teaches

79 Human-Teaching Robot Human I give you question- answer pattern Yes Human teaches

80 Human-Teaching Robot Human Question pattern: Where is room of Lee? Yes Human teaches

81 Human-Teaching Robot Human Answer pattern: Lee is in room 332 Yes Human teaches

82 Human-Checking what robot learned Robot Human Lesson finished Human asks Question Robot asks Human teaches “Lesson finished”

83 Human-Checking what robot learned Robot Human Lesson finished Human asks Question Robot asks Human teaches “Lesson finished” What can I do for you?

84 Human-Checking what robot learned Robot Human Question Human asks Question Robot asks Human teaches “Lesson finished” What can I do for you?

85 Human-Asking Robot Human Question Human asks Question Robot asks Human teaches “Lesson finished” Yes, you ask a question.

86 Human-Asking Robot Human What book wrote Lee? Human asks Yes, you ask a question.

87 Human-Asking Robot Human What book wrote Lee? Human asks I have no sure information.

88 Human-Asking Robot Human Try to guess. Human asks I have no sure information.

89 Human-Asking Robot Human Try to guess. Human asks Lee wrote book “Automata Theory” Observe that robot found similarity between Smith and Lee and generalized (incorrectly)

90 Behavior, Dialog and Learning The dialog/behavior has the following components: –(1) Eliza-like natural language dialogs based on pattern matching and limited parsing. Commercial products like Memoni, Dog.Com, Heart, Alice, and Doctor all use this technology, very successfully – for instance Alice program won the 2001 Turing competition. –This is a “conversational” part of the robot brain, based on pattern-matching, parsing and black-board principles. –It is also a kind of “operating system” of the robot, which supervises other subroutines.

91 (2) Subroutines with logical data base and natural language parsing (CHAT). –This is the logical part of the brain used to find connections between places, timings and all kind of logical and relational reasonings, such as answering questions about Japanese geography. Behavior, Dialog and Learning

92 (3) Use of generalization and analogy in dialog on many levels. –Random and intentional linking of spoken language, sound effects and facial gestures. –Use of Constructive Induction approach to help generalization, analogy reasoning and probabilistic generations in verbal and non-verbal dialog, like learning when to smile or turn the head off the partner. Behavior, Dialog and Learning

93 (4) Model of the robot, model of the user, scenario of the situation, history of the dialog, all used in the conversation. (5) Use of word spotting in speech recognition rather than single word or continuous speech recognition. (6) Continuous speech recognition (Microsoft) (7) Avoidance of “I do not know”, “I do not understand” answers from the robot. –Our robot will have always something to say, in the worst case, over-generalized, with not valid analogies or even nonsensical and random. Behavior, Dialog and Learning

94 Constructive Induction

95 What is constructive induction? Constructive induction is a logic-based method of teaching a robot of new knowledge. It can be compared to neural networks. Teaching is constructing some structure of a logic function: –Decision tree –Sum of Products –Decomposed structue

96

97 Name (examples) Age (output) d SmileHeightHair Color Joan Kid (0) a(3)b(0)c(0) Mike Teenager (1) a(2)b(1)c(1) Peter Mid-age (2) a(1)b(2)c(2) Frank Old (3) a(0)b(3)c(3) Example “Age Recognition” Examples of data for learning, four people, given to the system

98 Smile - a Very often often moderately rarely Values 3210 Height - b Very Tall TallMiddleShort Values 3210 Color - c GreyBlackBrownBlonde Values 3210 Example “Age Recognition” Encoding of features, values of multiple-valued variables

99 Multi-valued Map for Data ab\ c0123 00---- 01---3 02---- 03---- 10---- 11---- 12--2- 13---- 20---- 21-1-- 22---- 23---- 300--- 31---- 32---- 33---- d = F( a, b, c ) ab\ c0123 00---- 01---3 02---- 03---- 10---- 11---- 12--2- 13---- 20---- 21-1-- 22---- 23---- 300--- 31---- 32---- 33---- Groups show a simple induction from the Data

100 Old people smile rarely ab\ c0123 00---- 01---3 02---- 03---- 10---- 11---- 12--2- 13---- 20---- 21-1-- 22---- 23---- 300--- 31---- 32---- 33---- Groups show a simple induction from the Data Middle-age people smile moderately Teenagers smile often Children smile very often Grey hair blonde hair

101 Another example: teaching movements Input variables Output variables

102 Generalization of the Ashenhurst- Curtis decomposition model

103 This kind of tables known from Rough Sets, Decision Trees, etc Data Mining

104 Decomposition is hierarchical At every step many decompositions exist Which decomposition is better? Original table First variant of decomposition Second variant

105 Constructive Induction: Technical Details U. Wong and M. Perkowski, A New Approach to Robot’s Imitation of Behaviors by Decomposition of Multiple-Valued Relations, Proc. 5 th Intern. Workshop on Boolean Problems, Freiberg, Germany, Sept. 19-20, 2002, pp. 265-270. A. Mishchenko, B. Steinbach and M. Perkowski, An Algorithm for Bi-Decomposition of Logic Functions, Proc. DAC 2001, June 18-22, Las Vegas, pp. 103-108. A. Mishchenko, B. Steinbach and M. Perkowski, Bi- Decomposition of Multi-Valued Relations, Proc. 10 th IWLS, pp. 35-40, Granlibakken, CA, June 12-15, 2001. IEEE Computer Society and ACM SIGDA.

106 Decision Trees, Ashenhurst/Curtis hierarchical decomposition and Bi-Decomposition algorithms are used in our software These methods create our subset of MVSIS system developed under Prof. Robert Brayton at University of California at Berkeley [2]. – The entire MVSIS system can be also used. The system generates robot’s behaviors (C program codes) from examples given by the users. This method is used for embedded system design, but we use it specifically for robot interaction. Constructive Induction

107 Ashenhurst Functional Decomposition Evaluates the data function and attempts to decompose into simpler functions. if A  B = , it is disjoint decomposition if A  B  , it is non-disjoint decomposition B - bound set A - free set F(X) = H( G(B), A ), X = A  B X

108 A Standard Map of function ‘z’ Bound Set Free Set a b \ c z Columns 0 and 1 and columns 0 and 2 are compatible column compatibility = 2 Explain the concept of generalized don’t cares

109 NEW Decomposition of Multi- Valued Relations if A  B = , it is disjoint decomposition if A  B  , it is non-disjoint decomposition F(X) = H( G(B), A ), X = A  B Relation A B X

110 Forming a CCG from a K-Map z Bound Set Free Set a b \ c Columns 0 and 1 and columns 0 and 2 are compatible column compatibility index = 2 C1C1 C2C2 C0C0 Column Compatibility Graph

111 Forming a CIG from a K-Map Columns 1 and 2 are incompatible chromatic number = 2 z a b \ c C1C1 C2C2 C0C0 Column Incompatibility Graph

112 A unified internal language is used to describe behaviors in which text generation and facial gestures are unified. This language is for learned behaviors. Expressions (programs) in this language are either created by humans or induced automatically from examples given by trainers. Constructive Induction

113 Conclusion. What did we learn (1) the more degrees of freedom the better the animation realism. Art and interesting behavior above certain threshold of complexity. (2) synchronization of spoken text and head (especially jaw) movements are important but difficult. Each robot is very different. (3) gestures and speech intonation of the head should be slightly exaggerated – superrealism, not realism.

114 Conclusion. What did we learn(cont) (4) Noise of servos: –the sound should be laud to cover noises coming from motors and gears and for a better theatrical effect. –noise of servos can be also reduced by appropriate animation and synchronization. (5) TTS should be enhanced with some new sound-generating system. What? (6) best available ATR and TTS packages should be applied. (7) OpenCV from Intel is excellent. (8) use puppet theatre experiences. We need artists. The weakness of technology can become the strength of the art in hands of an artist.

115 (9) because of a too slow learning, improved parameterized learning methods should be developed, but also based on constructive induction. (10) open question: funny versus beautiful. (11) either high quality voice recognition from headset or low quality in noisy room. YOU CANNOT HAVE BOTH WITH CURRENT ATR TOOLS. (12) low reliability of the latex skins and this entire technology is an issue. Conclusion. What did we learn(cont)

116 We won an award in PDXBOT 2004. We showed our robots to several audiences International Intel Science Talent Competition and PDXBOT 2004, 2005 Robot shows are exciting Our Goal is to build toys for 21-st Century and in this process, change the way how engineers are educated.

117 What to remember? Robot as a mapping from inputs to outputs Braitenberg Vehicles State machines, grammars and probabilistic state machines Natural language conversation with a robot Image processing for a interactive robot. Constructive induction for behavior and language acquisition.

118 Projects: Project 1 –Lego NXT. 2 people. Editor for state-machine and probabilistic state machine base robot behavior of mobile robots with sensors. Project 2 –Vision for KHR-1 robot; Immitation. 2 people. Matthias Sunardi – group leader. Project 3 –Head design for a humanoid robot

119 Projects: Project 4 –Leg design for a humanoid robot Project 5 –Hand design for a humanoid robot Project 6 –EyeSim simulator – no robot needed. Project 7 –Conversation with a humanoid robot (dialog and speech).

120 Projects: Project 8 –Editor for an animatronic robot theatre Project 9 – Quantum-Computer Controlled Robot Project 10 – Project 11 –


Download ppt "Towards Robot Theatre Marek Perkowski Department of Electrical and Computer Engineering, Portland State University, Portland, Oregon, 97207-0751."

Similar presentations


Ads by Google