1 Chinese-Speaking 3D Talking Head Project No: H08040 Sang Siew Hoon Supervisor: Dr Ng Teck Khim.

Slides:

Advertisements

Similar presentations

Short introduction to the use of PEARL General properties First tier assessments Higher tier assessments Before looking at first and higher tier assessments,

Advertisements

Poltys CA Outbound Dialer Module Training Presentation.

Usage of the memoQ web service API by LSP – a case study

Overview This session is aimed at both PeopleSoft Financials users and Security Administrators. We will discuss plans for the 9.2 upgrade including.

STATEMATE A Working Environment for the Development of Complex Reactive Systems.

SIUE DENTAL SCHOOL VIRTUAL ENVIRONMENT BY STEVE KLAAS SIUE-GEOG 421: DIGITAL ELEVATION MODELING DR. SHUNFU HU, FALL 2013.

Designing Facial Animation For Speaking Persian Language Hadi Rahimzadeh June 2005.

Automatic Lip- Synchronization Using Linear Prediction of Speech Christopher Kohnert SK Semwal University of Colorado, Colorado Springs.

Page 1 Integrating Multiple Data Sources using a Standardized XML Dictionary Ramon Lawrence Integrating Multiple Data Sources using a Standardized XML.

Video Rewrite Driving Visual Speech with Audio Christoph Bregler Michele Covell Malcolm Slaney Presenter : Jack jeryes 3/3/2008.

LYU0603 A Generic Real-Time Facial Expression Modelling System Supervisor: Prof. Michael R. Lyu Group Member: Cheung Ka Shun ( ) Wong Chi Kin ( )

Create Photo-Realistic Talking Face Changbo Hu * This work was done during visiting Microsoft Research China with Baining Guo and Bo Zhang.

MPEG-4, NETWORKED MULTIMEDIA STANDARD

Lip Feature Extraction Using Red Exclusion Trent W. Lewis and David M.W. Powers Flinders University of SA VIP2000.

1 Expression Cloning Jung-yong Noh Ulrich Neumann Siggraph01.

Principles of Information Systems, Sixth Edition 1 Systems Investigation and Analysis Chapter 12.

MSIS 110: Introduction to Computers; Instructor: S. Mathiyalakan1 Systems Investigation and Analysis Chapter 12.

Sunee Holland University of South Australia School of Computer and Information Science Supervisor: Dr G Stewart Von Itzstein.

1 Newspaper Digitisation Workflows Rose Holley- Manager ANDP Presentation to Cultural Heritage Digitisation professionals 26 November 2008.

1 Australian Newspapers Digitisation Program Development of the Newspapers Content Management System Rose Holley – ANDP Manager ANPlan/ANDP Workshop, 28.

Facial Animation By: Shahzad Malik CSC2529 Presentation March 5, 2003.

The Camera Mouse: Visual Tracking of Body Features to Provide Computer Access for People With Severe Disabilities.

Humanoid Robot Head May Team Members: Client/Faculty Advisor: Dan Potratz (CprE) Tim Meer (EE) Dr. Alex Stoytchev Cody Genkinger (CprE) Jason Pollard.

Helsinki University of Technology Laboratory of Computational Engineering Modeling facial expressions for Finnish talking head Michael Frydrych, LCE,

Electronic visualization laboratory, university of illinois at chicago Designing an Expressive Avatar of a Real Person 9/20/2010 Sangyoon Lee, Gordon Carlson,

Eyes Alive Sooha Park - Lee Jeremy B. Badler - Norman I. Badler University of Pennsylvania - The Smith-Kettlewell Eye Research Institute Presentation Prepared.

99ATS Turbocharge your Hiring Process !!. ON TARGET Solution offered by 99ATS Overview Introduction Gaps in Recruitment Process Screenshot overview of.

Face Animation Overview with Shameless Bias Toward MPEG-4 Face Animation Tools Dr. Eric Petajan Chief Scientist and Founder face2face animation, inc.

New Features in Release 9.2 (July 27, 2009). 2 Release 9.2 New Features Updated Shopping Experience Home/Shop page Shop at the top search New Hosted Supplier.

Chapter 7. BEAT: the Behavior Expression Animation Toolkit

Three Topics Facial Animation 2D Animated Mesh MPEG-4 Audio.

APML, a Markup Language for Believable Behavior Generation Soft computing Laboratory Yonsei University October 25, 2004.

Zavod za telekomunikacije Igor S. Pandžić Department of telecommunications Faculty of electrical engineering and computing University of Zagreb, Croatia.

4Focus Remote Control & HUD App for AR.Drone & Windows 8.

1 Mpeg-4 Overview Gerhard Roth. 2 Overview Much more general than all previous mpegs –standard finished in the last two years standardized ways to support:

November 15, Already there, or soon: multiple scenes on same window, save projects across sessions. 2.2D graphs and dynamic bar charts (User specifies.

Realistic Modeling of Animatable Faces in MPEG-4 Marco Fratarcangeli and Marco Schaerf University of Rome “La Sapienza”

S The European Up-Front Risk Assessment Tool (EUFRAT) The European Up-Front Risk Assessment Tool (EUFRAT) (EUFRAT)

Lecture 15 – Social ‘Robots’. Lecture outline This week Selecting interfaces for robots. Personal robotics Chatbots AIML.

Presented by Matthew Cook INFO410 & INFO350 S INFORMATION SCIENCE Paper Discussion: Dynamic 3D Avatar Creation from Hand-held Video Input Paper Discussion:

Principles of Information Systems, Sixth Edition Systems Investigation and Analysis Chapter 12.

1 Reconstructing head models from photograph for individualized 3D-audio processing Matteo Dellepiane, Nico Pietroni, Nicolas Tsingos, Manuel Asselot,

MIRALab Where Research means Creativity SVG Open 2005 University of Geneva 1 Converting 3D Facial Animation with Gouraud shaded SVG A method.

Toward a Unified Scripting Language 1 Toward a Unified Scripting Language : Lessons Learned from Developing CML and AML Soft computing Laboratory Yonsei.

Software quality factors

Feedback Elisabetta Bevacqua, Dirk Heylen,, Catherine Pelachaud, Isabella Poggi, Marc Schröder.

Principles of Information Systems, Sixth Edition Systems Investigation and Analysis Chapter 12.

ITER- TBM Planning and Costing Activity DCLL TBM Mechanical Design ( ) & TBM-Port Interface ( ) Presented by Mo Dagher December

Supervisor: Dr. Elsayed Eissa Hemayed. o Marwa Ibrahim Lamey. Mayada Ibrahim Aly. o Mona Sherif Ahmed. o Suad Mohamed Barakat. o Marwa Ibrahim Lamey.

Animated Speech Therapist for Individuals with Parkinson Disease Supported by the Coleman Institute for Cognitive Disabilities J. Yan, L. Ramig and R.

Oct 12-14, 2003NSDL Challenges in Building Federation Services over Harvested Metadata Kurt Maly, Michael Nelson, Mohammad Zubair Digital Library.

Introduction to BIM Module 07 – Materials, Lights, and Rendering.

Field Effect Transistors (2)

Facial Motion Cloning Using Global Shape Deformation Marco Fratarcangeli and Marco Schaerf University of Rome “La Sapienza”

Change Blindness Images Li-Qian Ma 1, Kun Xu 1, Tien-Tsin Wong 2, Bi-Ye Jiang 1, Shi-Min Hu 1 1 Tsinghua University 2 The Chinese University of Hong Kong.

Virtual Tutor Application v1.0 Ruth Agada Dr. Jie Yan Bowie State University Computer Science Department.

Expressionbot: An Emotive Lifelike Robotic Face for Face-to- Face Communication Ali Mollahossenini, Gabriel Gairzer, Eric Borts, Stephen Conyers, Richard.

Facial Expression Analysis Theoretical Results –Low-level and mid-level segmentation –High-level feature extraction for expression analysis (FACS – MPEG4.

An Emotive Lifelike Robotics Face for Face-to-Face Communication

MikeTalk:An Adaptive Man-Machine Interface

Radio Propagation Simulation Based on Automatic 3D Environment Reconstruction D. He A novel method to simulate radio propagation is presented. The method.

Presents: Rally To Java Conversion Suite

Two extensions of our text-to-visual-speech (TTVS) system:

Multimodal Caricatural Mirror

Project #2 Multimodal Caricatural Mirror Intermediate report

Department Supervisor Location

EP SELF RM TERMINATED STEP UP TO COST SAVINGS SPACE SAVINGS

End-to-End Speech-Driven Facial Animation with Temporal GANs

Microsoft Dynamics CRM Record Cloning

Presentation transcript:

1 Chinese-Speaking 3D Talking Head Project No: H08040 Sang Siew Hoon Supervisor: Dr Ng Teck Khim

2 System Objectives create a realistic Chinese-speaking 3D talking model create a realistic Chinese-speaking 3D talking model -automatic -accurate and realistic -portable

3 Problems Lack of references on Chinese Lack of references on Chinese Involves various fields Involves various fields -Face modeling -Text-to-speech -Face Animation (Focus)

4 System Overview 5 modules 1. Face Model Preparation 2. Chinese Visemes 3. Text-to-Speech (TTS) 4. Animation 5. Rendering

5

6 Face Model Preparation

7 Face Modeling 3DS Max Face Modeling 3DS Max

8 Face Model Preparation Defining MPEG-4 feature points Defining MPEG-4 feature points

9 Face Model Preparation Assigning lip vertices Assigning lip vertices

10 Face Model Preparation Cloning morph targets Cloning morph targets FAP#3 open_jawFAP#4 lower_t_midlip

11 Chinese Visemes Visemes - the visual equivalent of a phoneme 1. Classification of Chinese Syllables 2. Definition of Chinese Visemes 3. Refining Dynamic Visemes

12 Chinese Visemes Definition of Chinese Visemes Definition of Chinese Visemes

13 Chinese Visemes Definition of Chinese Visemes Definition of Chinese Visemes

14 Chinese Visemes Definition of Chinese Visemes Definition of Chinese Visemes

15 Chinese Visemes Refining Dynamic Visemes Refining Dynamic Visemes 1. + e only  replace er 2. zh, z, y + I  drop i 3. d, l, g, j, zh, z, y, w + u | ü  drop 3. d, l, g, j, zh, z, y, w + u | ü  drop 4. j, y + ending with an  replace the an by en 5. j + complex finals headed by i and followed by more than one  drop i in complex finals

16 Text-to-Speech

17 Text-to-Speech

18 Animation 1. Coarticulation 2. Automatic Generation of Face Animation 3. Enhancing Realism

19 Animation Enhancing realism Enhancing realism - Eye blinks - Eyebrow raising - Gaze - Head Rotation

20 Results Accurate and realistic Accurate and realistic No discrepancies for new face models No discrepancies for new face models Automated results need slight manual intervention Automated results need slight manual interventionhttp://

21 Conclusion Contributions Contributions 1. own system to define Chinese visemes - adds to the current research works of Chinese talking heads (currently limited). Chinese talking heads (currently limited). 2. an automatic Chinese lip-synchronization system - saves an animator much time and effort

22 Conclusion System Limitations System Limitations 1. TTS limitation 2. FAPs conflicts 3. Not integrated – inconvenient for untrained users

23 Conclusion Future Works Future Works 1. Automatic audio signal processing 2. Resolve FAPs values conflicts 3. Automatic feature points assignment system 4. Integration of system