Animating Virtual Humans in Intelligent Multimedia Storytelling Minhua Eunice Ma and Paul Mc Kevitt School of Computing and Intelligent Systems Faculty.

Slides:



Advertisements
Similar presentations
HOMER: A Creative Story Generation System Student: Dimitrios N. Konstantinou Supervisor: Prof. Paul Mc Kevitt School of Computing and Intelligent Systems.
Advertisements

1Notes  Assignment 0 marks should be ready by tonight (hand back in class on Monday)
Managing Data Resources
EE442—Multimedia Networking Jane Dong California State University, Los Angeles.
KAIST CS780 Topics in Interactive Computer Graphics : Crowd Simulation A Task Definition Language for Virtual Agents WSCG’03 Spyros Vosinakis, Themis Panayiotopoulos.
©Ian Sommerville 2006Software Engineering, 8th edition. Chapter 8 Slide 1 System models.
Multimedia Search and Retrieval: New Concepts, System Implementation, and Application Qian Huang, Atul Puri, Zhu Liu IEEE TRANSACTION ON CIRCUITS AND SYSTEMS.
WMES3103 : INFORMATION RETRIEVAL
Supervised by Prof. LYU, Rung Tsong Michael Department of Computer Science & Engineering The Chinese University of Hong Kong Prepared by: Chan Pik Wah,
Presentation Outline  Project Aims  Introduction of Digital Video Library  Introduction of Our Work  Considerations and Approach  Design and Implementation.
© De Montfort University, D Graphics and VRML Howell Istance and Chris Hand* De Montfort University * now at
A Standard Humanoid Representation on The Web: VRML/X3D Humanoid Animation Working Group (H-anim) Che-Jen Chen May/30/2003.
Modified from Sommerville’s originalsSoftware Engineering, 7th edition. Chapter 8 Slide 1 System models.
Smart Learning Services Based on Smart Cloud Computing
Programming Tools and Applications. Programming Tools 3D systems – Maya – Blender – Unity – Ogre3D Libraries – OpenGL – Direct3D.
CONFUCIUS: An Intelligent MultiMedia Storytelling Interpretation and Presentation System Minhua Eunice Ma Supervisor: Prof. Paul Mc Kevitt School of Computing.
Aurora: A Conceptual Model for Web-content Adaptation to Support the Universal Accessibility of Web-based Services Anita W. Huang, Neel Sundaresan Presented.
VRML Dr. Alun Moon What is VRML The Virtual Reality Modeling Language (VRML) is a file format for describing interactive 3D objects.
GUI: Specifying Complete User Interaction Soft computing Laboratory Yonsei University October 25, 2004.
1 Darmstadt, October 02, 2007 Amalia Ortiz Asociación VICOMTech Mikeletegi Pasealekua Donostia - San Sebastián (Gipuzkoa)
Chapter 4 System Models A description of the various models that can be used to specify software systems.
DEVSView: A DEVS Visualization Tool Wilson Venhola.
System models Abstract descriptions of systems whose requirements are being analysed Abstract descriptions of systems whose requirements are being analysed.
1. 2 Purpose of This Presentation ◆ To explain how spacecraft can be virtualized by using a standard modeling method; ◆ To introduce the basic concept.
Building character animation for intelligent storytelling with the H-Anim standard Minhua Eunice Ma and Paul Mc Kevitt School of Computing and Intelligent.
Digital Multimedia, 2nd edition Nigel Chapman & Jenny Chapman Chapter 8 This presentation © 2004, MacAvon Media Productions Animation.
Parser-Driven Games Tool programming © Allan C. Milne Abertay University v
APML, a Markup Language for Believable Behavior Generation Soft computing Laboratory Yonsei University October 25, 2004.
SceneMaker: Multimodal Visualisation of Natural Language Film Scripts Dr. Minhua Eunice Ma School of Computing & Intelligent Systems Faculty of Computing.
© 2005 Prentice Hall, Decision Support Systems and Intelligent Systems, 7th Edition, Turban, Aronson, and Liang 5-1 Chapter 5 Business Intelligence: Data.
PETRA – the Personal Embedded Translation and Reading Assistant Werner Winiwarter University of Vienna InSTIL/ICALL Symposium 2004 June 17-19, 2004.
By NIST/ITL/IAD, Mike Rubinfeld, January 16, 2002 Page 1 L3 Overview L3 Standards Overview By Mike Rubinfeld Chairman, INCITS/L3 (MPEG & JPEG) NIST, Gaithersburg,
Temporal Relations in Visual Semantics of Verbs Minhua Eunice Ma and Paul Mc Kevitt School of Computing and Intelligent Systems Faculty of Engineering.
CONFUCIUS: an Intelligent MultiMedia storytelling interpretation & presentation system Minhua Eunice Ma Supervisor: Prof. Paul Mc Kevitt School of Computing.
1 Mpeg-4 Overview Gerhard Roth. 2 Overview Much more general than all previous mpegs –standard finished in the last two years standardized ways to support:
SceneMaker: Automatic Visualisation of Screenplays School of Computing & Intelligent Systems Faculty of Computing & Engineering University of Ulster, Magee,
Chapter 7 System models.
System models l Abstract descriptions of systems whose requirements are being analysed.
Pertemuan 19 PEMODELAN SISTEM Matakuliah: D0174/ Pemodelan Sistem dan Simulasi Tahun: Tahun 2009.
Software Engineering, 8th edition Chapter 8 1 Courtesy: ©Ian Somerville 2006 April 06 th, 2009 Lecture # 13 System models.
Sommerville 2004,Mejia-Alvarez 2009Software Engineering, 7th edition. Chapter 8 Slide 1 System models.
Introduction to Flash Animation CS 318. Topics Introduction to Flash and animation The Flash development environment Creating Flash animations  Layers.
A Multi-agent Approach for the Integration of the Graphical and Intelligent Components of a Virtual Environment Rui Prada INESC-ID.
Database Environment Chapter 2. Data Independence Sometimes the way data are physically organized depends on the requirements of the application. Result:
Toward a Unified Scripting Language 1 Toward a Unified Scripting Language : Lessons Learned from Developing CML and AML Soft computing Laboratory Yonsei.
1 1. Representing and Parameterizing Agent Behaviors Jan Allbeck and Norm Badler 연세대학교 컴퓨터과학과 로봇 공학 특강 학기 유 지 오.
1 Applications of video-content analysis and retrieval IEEE Multimedia Magazine 2002 JUL-SEP Reporter: 林浩棟.
Data and Applications Security Developments and Directions Dr. Bhavani Thuraisingham The University of Texas at Dallas Lecture #15 Secure Multimedia Data.
1 CS145 Lecture 26 What’s next?. 2 What software questions do we study? Where is software headed?
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
DANIELA KOLAROVA INSTITUTE OF INFORMATION TECHNOLOGIES, BAS Multimedia Semantics and the Semantic Web.
Intelligent MultiMedia Storytelling System (IMSS) - Automatic Generation of Animation From Natural Language Input By Eunice Ma Supervisor: Prof. Paul Mc.
PGNET, Liverpool JMU, June 2005 MediaHub: An Intelligent MultiMedia Distributed Platform Hub Glenn Campbell, Tom Lunney, Paul Mc Kevitt School of Computing.
WEB 3D – A PEEK UNDER THE HOOD Keerti Joshi. W HAT ’ S THE FUTURE ? Web 3.0 = ?? The Web ain't just for 2D any more !
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
IMSTD:Intelligent Multimedia System for teaching Databases By : NAZLIA OMAR Supervisors: Prof. Paul Mc Kevitt Dr. Paul Hanna School of Computing and Mathematical.
1cs426-winter-2008 Notes. 2 Kinematics  The study of how things move  Usually boils down to describing the motion of articulated rigid figures Things.
 To explain why the context of a system should be modelled as part of the RE process  To describe behavioural modelling, data modelling and object modelling.
MULTIMEDIA DATA MODELS AND AUTHORING
VIRTUAL REALITY (VR) INTRODUCTION AND BASIC APPLICATIONS الواقع الافتراضي : مقدمة وتطبيقات Dr. Naji Shukri Alzaza Assist. Prof. of Mobile technology Dean.
Introduction to MPEG  Moving Pictures Experts Group,  Geneva based working group under the ISO/IEC standards.  In charge of developing standards for.
Armand Rousso describe the X3D Technology
Unity 3D Rolfe Bozier 24-Apr-2017
Advanced Computer Systems
MPEG-4 Binary Information for Scenes (BIFS)
Visual Information Retrieval
Web Ontology Language for Service (OWL-S)
Multimedia Content Description Interface
Prepared by: Engr . Syed Atir Iftikhar
Presentation transcript:

Animating Virtual Humans in Intelligent Multimedia Storytelling Minhua Eunice Ma and Paul Mc Kevitt School of Computing and Intelligent Systems Faculty of Engineering University of Ulster, Magee Derry, Northern Ireland

PGNet 2005 Liverpool, 28 June 2005 Outline  State-of-the-art virtual human animation standards VRML/X3D & MPEG-4 for object modelling H-Anim & MPEG-4 SNHC for humanoid modelling VHML & STEP for human animation modelling Natural language to 3D animation  Language visualisation (animation) in intelligent multimodal storytelling system, CONFUCIUS Humanoid animation in CONFUCIUS Multiple animation channels Space sites of virtual humans Virtual object manipulation  Conclusion & future work

PGNet 2005 Liverpool, 28 June 2005 Four levels of virtual human representation VRML (X3D) H-Anim VHML (BAML) XML-based STEP script-based Level 1 3D object modelling Level 2 3D human modelling Level 3 Human animation modelling high level animation low level animation MPEG-4 CONFUCIUS Level 4 Natural language to animation AnimNL MPEG-4 SNHC Current virtual human representation languages can be classified to four groups according to the levels of abstraction, starting from 3D geometry modelling to language animation.

PGNet 2005 Liverpool, 28 June 2005 Level 1: 3D object modelling VRML (X3D) H-Anim VHML (BAML) XML-based STEP script-based Level 1 3D object modelling Level 2 3D human modelling Level 3 Human animation modelling high level animation low level animation MPEG-4 CONFUCIUS Level 4 Natural language to animation AnimNL MPEG-4 SNHC  VRML (Virtual Reality Modelling Language) is a hierarchical scene description language that defines the geometry and behaviour of a 3D scene. X3D is the successor to VRML.  MPEG-4 uses BIFS (Binary Format for Scenes) for real-time streaming. BIFS borrows many concepts from VRML. BIFS and VRML can be seen as different representations of the same data.

PGNet 2005 Liverpool, 28 June 2005 VRML (X3D) H-Anim VHML (BAML) XML-based STEP script-based Level 1 3D object modelling Level 2 3D human modelling Level 3 Human animation modelling high level animation low level animation MPEG-4 CONFUCIUS Level 4 Natural language to animation AnimNL MPEG-4 SNHC  H-Anim is a stardard VRML97 representation for humanoids. It defines standard human Joints articulation, segments dimensions, and sites for “end effector” and attachment points for clothing.  MPEG-4 SNHC (Synthetic/Natural Hybrid Coding) incorporates H-Anim and provides an efficient way to animate virtual human and tools for the efficient compression of the animation parameters associated with the H-Anim human model. Level 2: 3D human modelling

PGNet 2005 Liverpool, 28 June 2005 H-Anim joint-segment hierarchy  An H-Anim file contains a joint- segment hierarchy.  Each joint node may contain other joint nodes and a segment node that describes the body part associated with the joint.  Each segment is a normal VRML transform node describing the body part's geometry and texture.  H-Anim humanoids can be animated using keyframing, inverse kinematics, & other animation techniques.

PGNet 2005 Liverpool, 28 June 2005 H-Anim models on the Web Virtual human models Nancy 1 Baxter, Nana 2 Y.T., Hiro 3 Dilbert 3 Max 3 Jake 3 Dork 4 AuthorsCindy Ballreich Christian Babski Matt Beitler Michael Miller URLs:

PGNet 2005 Liverpool, 28 June 2005 VRML (X3D) H-Anim VHML (BAML) XML-based STEP script-based Level 1 3D object modelling Level 2 3D human modelling Level 3 Human animation modelling high level animation low level animation MPEG-4 CONFUCIUS Level 4 Natural language to animation AnimNL MPEG-4 SNHC  VHML (Virtual Human Mark-up Language) is an XML-based language which provides an intuitive way to define virtual human animation. It is composed of several sub-languages: DMML, FAML, BAML, SML, and EML.  STEP is a scripting language for human actions. It has a Prolog-like syntax, which makes it compatible with most standard logic programming languages. Level 3: Human animation modelling

PGNet 2005 Liverpool, 28 June 2005 VHML & STEP examples Standing on my knees I beg you pardon A.A VHML example script(walk_forward_step(Agent),ActionList):- ActionList=[parallel( [script_action( walk_pose(Agent), move(Agent,front,fast) ])]. B. A STEP example

PGNet 2005 Liverpool, 28 June 2005 VRML (X3D) H-Anim VHML (BAML) XML-based STEP script-based Level 1 3D object modelling Level 2 3D human modelling Level 3 Human animation modelling high level animation low level animation MPEG-4 CONFUCIUS Level 4 Natural language to animation AnimNL MPEG-4 SNHC  High level animation applications converting natural language to virtual human animation. Little research on virtual human animation focuses on this level.  The AnimNL project aims to enable people to use natural language instructions to tell virtual humans what to do  CONFUCIUS also deals with language animation  Research on this level will lead to powerful web-based applications Level 4: Natural language to animation

PGNet 2005 Liverpool, 28 June 2005 Architecture of CONFUCIUS 3D authoring tools existing 3D models & virtual human models Visual/audio knowledge (3D models & animations, audio encapsulated in graphic models) Knowledge base Surface transformer Media allocator Natural Language Processing Text-to- Speech Animation engine (with nonspeech audio) Synchronizing 3D virtual world with speech in VRML Natural language sentences Language knowledge (WordNet, LCS database, FDG parser) mapping semantic representation Presentation agent (Merlin the Narrator) Narration integration Multimodal presentation

PGNet 2005 Liverpool, 28 June 2005 match basic motions in library? User interaction animation controller environment placement N Y VRML file of the virtual story world Motion instantiation Either loading a precreated keyframe animation or providing animation specification for animation generation Semantic Representation If the event predicate matches basic human motions in animation library Apply spatial info & place OBJ/HUMAN into a specified environment Camera controller Automatic camera placement & apply cinematic rules Humanoid animation in CONFUCIUS

PGNet 2005 Liverpool, 28 June 2005 Multiple animation channels  3rd level human animation modeling languages (VHML, STEP) provide a facility to specify both sequential and parallel temporal relations  Simultaneous animations cause the Dining Philosopher's problem for higher level animation using predefined animation data (multiple animations may request to access same body parts at the same time)  Multiple animation channels allow characters to run multiple animations at the same time, e.g. walking with the lower body while waving with the upper body  Multiple animation channels often disable one channel when a specific animation is playing on another channel to avoid conflicts with another animation Involved joints /Animationssacroiliacl_hipr_hip…r_shoulder walk222…1 jump222…1 wave000…2 run222…1 scratch head000…2 sit222…1 ………………

PGNet 2005 Liverpool, 28 June 2005 Space sites of virtual humans  Types of virtual objects Small props, manipulated by hands or feet, e.g. cup, hat, ball Big props, source or targets of actions, e.g. table, chair, tree Stage props have internal structure, e.g. house, restaurant, chapel  Site tags of virtual humans Manipulating small props, 6 sites on hands (three sites for each hand), one site on head (skull_tip), one site for each foot tip For big props placement, 5 sites indicating five directions around the human body: x_front, x_back, x_left, x_right, x_bottom. Big props like a table or chairs usually placed at these positions. For stage props setting, 5 more space tags indicating further places: far_front, far_back, far_left, far_right, far_top. Stage props (e.g. a house) often locate at these far sites. grip, pincer grip pushing pointing

PGNet 2005 Liverpool, 28 June 2005 Virtual object manipulation 1.Store applicable objects in the animation file of an action and using lexical knowledge of nouns to infer hypernymy relations between objects 2.Including the manipulation hand postures and movements within the object description, besides its intrinsic object properties. These objects have the ability to describe in details their functionality and their possible interactions with virtual humans. 4 stored hand postures for interacting with 3D objects index pointing (press a button) grip (hold cup handle, knob, a bottle) pincer grip (use thumb and index finger to pick up small objects) palm push (push a piece of furniture) Two approaches to organize knowledge required for successful grasping

PGNet 2005 Liverpool, 28 June 2005 Conclusion  Classified virtual human representation languages into four levels of abstraction  CONFUCIUS is an overall framework of intelligent multimedia storytelling, using 3D modelling/animation techniques with natural language understanding technologies to achieve higher level virtual human animation  A number of projects are currently based on virtual human animation, working on various application domains. Few of them takes modern NLP approach that a high level human animation system should be based on.  The value of CONFUCIUS lies in generation of 3D animation from natural language by automating the processes of language parsing, semantic representation and animation production.  Potential application areas: computer games, animation production and direction, multimedia presentation, shared virtual worlds  Future work: coordination & synchronization of multiple virtual humans