1 +86-551-5331800 +86-551-5331801 TEL: FAX: WEBSITE: © 2002 iFLYTEK. All rights reserved. This presentation is for informational.

Slides:



Advertisements
Similar presentations
Stage 2: The GOA Tool.
Advertisements

MSE602 ENGINEERING INNOVATION MANAGEMENT. TECHNOLOGY INNOVATION TECHNOLOGICAL INNOVATION PROCESSES APPROACHES TO INNOVATION CONCEPTS.
Cisco Confidential © 2012 Cisco and/or its affiliates. All rights reserved. 1 Cisco Partner Plus: Premium Enablement Accelerate Your Competitive Edge.
Ms. Carolyn Zhang’s Mandarin Class
MARKETING MANAGEMENT.
New Technology Environment Technology as a Strategic Asset Tom Lehman Lehman Associates, LLC Lehman Reports Association TRENDS Live September, 2014.
Speech Synthesis Markup Language V1.0 (SSML) W3C Recommendation on September 7, 2004 SSML is an XML application designed to control aspects of synthesized.
Speech Synthesis Markup Language SSML. Introduced in September 2004 XML based Assists the generation of synthetic speech Specifies the way speech is outputted.
The Living Literacy Framework and the E&I Literacy Action Plan Valerie Neaves Alberta Works Programs Alberta Asset Building Collaborative March 17, 2011.
Framing the Market Opportunity
HOLLOWAY CONSULTING. Class Announcements  Service Learning Assignment:  Progress Report should be completed one week after initial meeting with the.
©2007 by the McGraw-Hill Companies, Inc. All rights reserved. 2/e PPTPPT.
ObjectivesObjectives 1.A definition of planning and an understanding of the purposes of planning 2.Insights into how the major steps of the planning.
Unit 6 Teaching Pronunciation
MULTI LINGUAL ISSUES IN SPEECH SYNTHESIS AND RECOGNITION IN INDIAN LANGUAGES NIXON PATEL Bhrigus Inc Multilingual & International Speech.
© Prentice Hall, © Prentice Hall, ObjectivesObjectives 1.A definition of planning and an understanding of the purposes of planning.
Software Factory Assembling Applications with Models, Patterns, Frameworks and Tools Anna Liu Senior Architect Advisor Microsoft Australia.
Introduction to Business Analysis (and Competitive Strategy) Dr. Theodore H. K. Clark Associate Professor and Academic Director of MSc in Information Systems.
Chapter three Phonology
Chapter 15 Speech Synthesis Principles 15.1 History of Speech Synthesis 15.2 Categories of Speech Synthesis 15.3 Chinese Speech Synthesis 15.4 Speech Generation.
1 Speech synthesis 2 What is the task? –Generating natural sounding speech on the fly, usually from text What are the main difficulties? –What to say.
Marketing Management Chapter 1.
Google Online Marketing Challenge (GOMC)
Speech Synthesis Markup Language -----Aim at Extension Dr. Jianhua Tao National Laboratory of Pattern Recognition (NLPR) Institute of Automation, Chinese.
Microsoft Visual Basic 2012 CHAPTER ONE Introduction to Visual Basic 2012 Programming.
Study of Chinese Seed Project Informationization Shang Shuqi Qingdao Agricultural University Shandong,China Tel
Toshiba (China) R&D Center LOU Xiaoyan, LI Jian Research and Development Center, Toshiba China Suggestions on Tone and Word Boundary of Mandarin for SSML.
Data-driven approach to rapid prototyping Xhosa speech synthesis Albert Visagie Justus Roux Centre for Language and Speech Technology Stellenbosch University.
Guide to the Software Engineering Body of Knowledge Chapter 1 - Introduction.
Communicative Language Teaching Vocabulary
STANDARDIZATION OF SPEECH CORPUS Li Ai-jun, Yin Zhi-gang Phonetics Laboratory, Institute of Linguistics, Chinese Academy of Social Sciences.
How IPA is Used in SSML and PLS Paolo Baggia, Loquendo Wed. August 9 th, 2006.
4.03 Perform pre-sales activities to facilitate sales presentation.
Conversational Applications Workshop Introduction Jim Larson.
The development of Chinese characters
© 2012 Pearson Prentice Hall. All rights reserved. Strategy, Balanced Scorecard.
Working group meeting January Time sheets Accounting Topic sheets Handouts Quality plan Anything else? Topics for consideration.
Exploring XML-based Technologies and Procedures for Quality Evaluation from a Real-life Case Perspective Folkert de Vriend 1 & Giulio Maltese 2 1 Speech.
Korea Maritime and Ocean University NLP Jung Tae LEE
Welcome to Ms. Carolyn Zhang’s Mandarin Class. Middle School The Middle School years work as a bridge when choice is slowly introduced into the system,
Overview of CSSML Yan Jun, Department Manager Anhui USTC iFLYTEK Co., Ltd University of Science & Tech of China.
PETRA – the Personal Embedded Translation and Reading Assistant Werner Winiwarter University of Vienna InSTIL/ICALL Symposium 2004 June 17-19, 2004.
Introducing New Market Offerings. Managing New-Product Development Successful new product development should be: Customer-centered Team-centered Systematic.
Chapter 1 Management accounting: information for creating value and managing resources Copyright  2009 McGraw-Hill Australia Pty Ltd PowerPoint Slides.
Welcome to Ms. Carolyn Zhang’s Mandarin Class. Self-Introduction Education Background (English B.A & Education M.D Three years in Lower School The relationship.
Copyright  2003 McGraw-Hill Australia Pty Ltd, PPTs t/a Management Accounting: An Australian Perspective 3/e by Langfield-Smith, Thorne & Hilton Slides.
Rutgers Multimedia Chinese Teaching System (RMCTS) MERLOT International Conference, August 7-10, 2008.
A Study of Taiwanese High School Students' Production and Perception Performance in English Non-High Front Vowels Graduate Student: Wan-chun Tseng Advisor:
Introductions. One purpose of the introduction Your introduction needs to attract your reader! This is sometimes called a “hook.”
© Lehman Associations, LLC 2013 Technology as Strategy™ Tom Lehman Lehman Associates, LLC Lehman Reports 2014 Technology Institute NYSAE April, 2014.
ELEE 4303 Digital II Introduction to Verilog. ELEE 4303 Digital II Learning Objectives Get familiar with background of HDLs Basic concepts of Verilog.
New-Product Development and Product Life-Cycle Strategies
An Introduction to S3ML Beijing InfoQuick SinoVoice Speech Technology Corp. CHEN Ming, LV Shinan, LI Xiulin.
2-1 Visit UMT online at © UMT 2004 MKT100Version: PRINCIPLES OF MARKETING University of Management and Technology 1901 N. Fort.
Presentation by : ZhangCarolyn (Zhang, Yajie --- 张雅杰 ) Online Project.
The new GCSE 2018: Specification change as an opportunity to build best practice.
Copyright  2006 McGraw-Hill Australia Pty Ltd PPTs t/a Management Accounting: Information for managing and creating value 4e Slides prepared by Kim Langfield-Smith.
Lectures 2 & 3: Software Process Models Neelam Gupta.
Michael Saucier - OSIsoft Cliff Reeves - Microsoft Your Portal to Performance An Introduction to the RtPM Platform Copyright c 2004 OSIsoft Inc. All rights.
The Road to Literacy Development Native English Speakers vs. ELLs.
Microsoft Visual Basic 2015 CHAPTER ONE Introduction to Visual Basic 2015 Programming.
Consumer Software Companies in China 2015 Published on : October
ENTREPRENEURSHIP SABIR MALIK LECTURE 07. The Marketing Plan.
L197 Beginners' Chinese Module Team, The OpenUniversity Learning Chinese Characters – with ink, keyboard and mobile Apps Department of Languages The Open.
BALANCED SCORECARD ANALYSIS. What Is a Balanced Scorecard? A Measurement System? A Management System? A Management Philosophy?
Introduction 3 Learning Chinese: Learning Mandarin Chinese, Three Content Areas 1. Sounds: Mandarin, Pinyin 2. Shapes: Written Characters, Hanzi 3. Meanings:
Introduction to Visual Basic 2008 Programming
Introductions.
Chinese.
Presentation transcript:

TEL: FAX: WEBSITE: © 2002 iFLYTEK. All rights reserved. This presentation is for informational purposes only. iFLYTEK MAKES NO WARRANTIES, EXPRESS OR IMPLIED, IN THIS SUMMARY. ANHUI USTC iFLYTEK CO., LTD

Network Speech Technology Industry in China --Qiang Bai ANHUI USTC iFLYTEK Co., LTD.

3 General introduction of the speech industry

4 General introduction of Chinese speech industry Chinese speech technology achieved great progress in the past four years with the boom of Chinese economy. Chinese speech technology gradually occupies the key position in the Pan-Asian market by taking part into everyones everyday life.Chinese speech technology achieved great progress in the past four years with the boom of Chinese economy. Chinese speech technology gradually occupies the key position in the Pan-Asian market by taking part into everyones everyday life. In consideration of the weak foundation and brief history, Chinese speech technology market is still in a primary stage. The total market is limited, and much work is needed to cultivate.In consideration of the weak foundation and brief history, Chinese speech technology market is still in a primary stage. The total market is limited, and much work is needed to cultivate. Started in the mid 1980s, supported by the national government, now driving by the iFLYTEK, the research of Chinese speech synthesis technology has advanced rapidly. The synthesis effect could satisfy most of the practical application.Started in the mid 1980s, supported by the national government, now driving by the iFLYTEK, the research of Chinese speech synthesis technology has advanced rapidly. The synthesis effect could satisfy most of the practical application.

5 General introduction of Chinese speech industry The speech technology market of China is special. iFLYTEK occupied a 82% market share, with the predominant market position. Other suppliers are focusing on the lower-end market, far behind iFLYTEK in both technology and market.The speech technology market of China is special. iFLYTEK occupied a 82% market share, with the predominant market position. Other suppliers are focusing on the lower-end market, far behind iFLYTEK in both technology and market. Speech synthesis technology is the mainstream of market appliance. Several industries includes the major customers. Call centers o f telecommunication (value added services), financial services almost form the 70% of the present total market.Speech synthesis technology is the mainstream of market appliance. Several industries includes the major customers. Call centers o f telecommunication (value added services), financial services almost form the 70% of the present total market. The potential market of speech technology is huge and profitable.The potential market of speech technology is huge and profitable.

6 Present Situation of the technology

7 Challenges Chinese is special and complex. The text analysis and rhythm analysis of Chinese are hard to conduct.Chinese is special and complex. The text analysis and rhythm analysis of Chinese are hard to conduct. Theres no blank between two Chinese characters, the word boundary is hard to define.Theres no blank between two Chinese characters, the word boundary is hard to define. One Chinese character might have different pronunciation in different context, some pronunciation just exist in name.One Chinese character might have different pronunciation in different context, some pronunciation just exist in name. Special signs should be translate according to the Chinese reading habit. As a language with four tones, the rhythm feature of Chinese is very complex, such as tone sandhi and r-colloring.Special signs should be translate according to the Chinese reading habit. As a language with four tones, the rhythm feature of Chinese is very complex, such as tone sandhi and r-colloring. Mandarin is rhythm based, the mark up system is Pinyin but not international phonetic alphabet.Mandarin is rhythm based, the mark up system is Pinyin but not international phonetic alphabet. During the construction of grammar rule and dictionary, a systematical transcript framework is needed to mark the exceptive phenomenon. At the same time, some sophisticate employees with the announcer background is needed.During the construction of grammar rule and dictionary, a systematical transcript framework is needed to mark the exceptive phenomenon. At the same time, some sophisticate employees with the announcer background is needed. How to find out the best pair to concatenation in the huge corpus? Based on the intelligent text analysis technology, how to predict the most fluent speech parameter?How to find out the best pair to concatenation in the huge corpus? Based on the intelligent text analysis technology, how to predict the most fluent speech parameter? Those are all the challenges in Chinese speech synthesis. Those are all the challenges in Chinese speech synthesis. 7

8 Innovation iFLYTEK has the intelligent text analysis technology, which could solve the problems of word segmentation, polyphonic characters, special signs and rhythm level questions.iFLYTEK has the intelligent text analysis technology, which could solve the problems of word segmentation, polyphonic characters, special signs and rhythm level questions. By using a huge speech database(3000sentences), together with the human markups, the fluent sound could be generate through the data driven, prosodic prediction and unit selection algorithm.By using a huge speech database(3000sentences), together with the human markups, the fluent sound could be generate through the data driven, prosodic prediction and unit selection algorithm. With the latest hmm-based, and absolute data driven method, we could generate the fluent voice with a small speech database( ).With the latest hmm-based, and absolute data driven method, we could generate the fluent voice with a small speech database( ). We could simulate the target persons pronunciation by the MLLR based voice conversion method.We could simulate the target persons pronunciation by the MLLR based voice conversion method. 8

9 Innovation iFLYTEK is the chair party for Chinese national governments speech technology standard working group.iFLYTEK is the chair party for Chinese national governments speech technology standard working group. iFLYTEK has finished the Mandarin speech synthesis system general technology specification. This specification has been released as the national standard.iFLYTEK has finished the Mandarin speech synthesis system general technology specification. This specification has been released as the national standard. This standard laid the ground work for Chinese speech technologys rapid growthThis standard laid the ground work for Chinese speech technologys rapid growth the Mandarin speech synthesis system general technology specification includes the CSSML mark up language, which could satisfy the need for those exceptional characters synthesis.the Mandarin speech synthesis system general technology specification includes the CSSML mark up language, which could satisfy the need for those exceptional characters synthesis. 9

Innovation iFLYTEK is aspiring to take part in setting the international SSML standard. Thats a great support to the Chinese language.iFLYTEK is aspiring to take part in setting the international SSML standard. Thats a great support to the Chinese language. Based on the CSSMLs result, iFLYTEK offers the international SSML standard the suggestions of how to improve Chinese.Based on the CSSMLs result, iFLYTEK offers the international SSML standard the suggestions of how to improve Chinese. Following are the latest updated terms for Chinese in the new version of SSML1.1Following are the latest updated terms for Chinese in the new version of SSML1.1 –New sign is defined to support the work of specify the word boundary. –Extend the meaning of special phoneme sign to further support Chinese Pinyins mark up. –Offers a system to define the name. especially for the Chinese name in which some characters would change their pronunciations. –Offers the way to describe dialects. 10

11 Keep step forward With 20 years persistent efforts, the speech technology has achieved great progress. Year Naturalness < Naturalness is the key label of the synthesized speech. The subjective scoring method is introduced to express the similarity between the human voices and the synthesized voices. MOS Mean Opinion Score 5 is the best 1 is the worst.

Obstructive factors in the application of speech technology

13 Obstructive factors Artificial services are still the mainstream for the costArtificial services are still the mainstream for the cost China is rich in labor resources, artificial call centers are simple and cheap. So a full-scale speech technology solution is far ahead.China is rich in labor resources, artificial call centers are simple and cheap. So a full-scale speech technology solution is far ahead. The average annual salary of the sophisticated call center workers in Asia, (2006. $) India 3,334 China 2,558 Malaysia 5,442 Philippines 3,348 Thailand 3,656 Singapore 13,677 Source:callcentres.net

14 Obstructive factors Too many dialects increase the difficulty of the speech technologys popularization.Too many dialects increase the difficulty of the speech technologys popularization. Chinese is a multi- national country, with 56 nationalities in all. Chinese language is complex, statistic shows therere more than 3000 dialects in China. Thats a great challenge to the development of Chinese speech technology.Chinese is a multi- national country, with 56 nationalities in all. Chinese language is complex, statistic shows therere more than 3000 dialects in China. Thats a great challenge to the development of Chinese speech technology. –Based on the existing effect that speech technology could achieve, the practical application effect could be improved through customization. But the premise is a thoroughly understanding of the customers need and purpose. iFLYTEK is leading in speech technology market for we are experienced and we have the professional solutions and the splendid team. – iFLYTEK gained the national support for the project of set up the Chinese dialects evaluation and recognition database in 2004.

15 Obstructive factors Customers acceptance to the IT solution needs improvementCustomers acceptance to the IT solution needs improvement Comparing to the developed country, the informatization construction of China is weak and simple. Mostly because the Chinese customers rely more on the hardware rather than the software. Statistic shows that in the oversea countries the ratio of hardware, software and service is 1:2:4, while in china the ratio is less than 4:2:1.Comparing to the developed country, the informatization construction of China is weak and simple. Mostly because the Chinese customers rely more on the hardware rather than the software. Statistic shows that in the oversea countries the ratio of hardware, software and service is 1:2:4, while in china the ratio is less than 4:2:1. –The differed acceptance to software and hardware influence the living condition of Chinese software and service industry. Theres a long time period to overpass until the living condition turns better and everyones concept change. Customers habits need to be cultivate.Customers habits need to be cultivate. –Due to the development of Chinese informatization, telecommunication comes to Chinese residents just a few years; artificial services are the mainstream now. Self-service supported by the speech technology is not so popular as the developed country.

16 Obstructive factors The actualize capacity need progressThe actualize capacity need progress System Integration corporations (SI) are the main body to develop the application of speech technology in China. Therere more than 900 such companies in China without unit focus and steering. Thus much energy are spending on the duplicative develop and illicit competition. The whole industry is not canonical and ordered yet.System Integration corporations (SI) are the main body to develop the application of speech technology in China. Therere more than 900 such companies in China without unit focus and steering. Thus much energy are spending on the duplicative develop and illicit competition. The whole industry is not canonical and ordered yet. –Speech technology are gradually sophisticating, but SI companies still stay at the preliminary stage. Some work like customer needs generation, product design, testing, customization, customers cultivation etc. are in projecting absence. –As the leader of Chinese speech technology, iFLYTEK focused on the basic theories research, practical applications development, customers cultivate, partners training, etc. Now she has been the effective guarantee to the success of speech technology.

17 Current Situation of commercial application

18 Driving force of speech technologys application The top priority of northern Americans company is to satisfy the customers need, to help the customer find the information and service that they need as quick as possible. At the same time, they try to improve the efficiency and reduce the labor cost.The top priority of northern Americans company is to satisfy the customers need, to help the customer find the information and service that they need as quick as possible. At the same time, they try to improve the efficiency and reduce the labor cost. Source: Benchmark Portal, May 2005 Find the information Reduce the pressure Cut the cost Improve loyalty

19 Driving force of speech technologys application The top priority of Chinese company is profit. They want to use speech technology to offer more information and value added services. So, in china, the big customers all belong to the telecommunication industry.The top priority of Chinese company is profit. They want to use speech technology to offer more information and value added services. So, in china, the big customers all belong to the telecommunication industry. Major Goals Customers Deploy Speech Increase the income To offer the information and service Increase competitive edge Efficiency Source: iFYTEK Survey

20 Driving force of speech technologys application The developing speech technology is driven by some actual reasons. At the same time, weve found that Chinese customers comment on the speech technology is different from the sophisticated northern American market.The developing speech technology is driven by some actual reasons. At the same time, weve found that Chinese customers comment on the speech technology is different from the sophisticated northern American market. Source: NUANCE V-World Survey

21 Present Chinese speech technology market In the major application industries like telecommunication, financing, energy, transportation etc, iFLYTEK occupies a 82% market share; representing the trend of the market.In the major application industries like telecommunication, financing, energy, transportation etc, iFLYTEK occupies a 82% market share; representing the trend of the market. iFLYTEK has more than 800 partners in China. The total application cases are more than 6000, each second, millions of customers get their information and services through the iFLYTEK speech technology.iFLYTEK has more than 800 partners in China. The total application cases are more than 6000, each second, millions of customers get their information and services through the iFLYTEK speech technology. In China, there are many application cases in the value added services. Speech recognition technology is now being applied in the major industries like telecommunication, financing, energy, transportation etc.In China, there are many application cases in the value added services. Speech recognition technology is now being applied in the major industries like telecommunication, financing, energy, transportation etc.

Applications in all major industries

23 Chinese speech recognition applications

24 National project Key projectMulti-language information service for 2008 Olympic GamesKey projectMulti-language information service for 2008 Olympic Games iFLYTEK has been nominated as the only speech technology provider, and been named as the Best participant by the national 863 committee.iFLYTEK has been nominated as the only speech technology provider, and been named as the Best participant by the national 863 committee. The major function of it is information searching through telephone.

TEL: FAX: WEBSITE: © 2002 iFLYTEK. All rights reserved. This presentation is for informational purposes only. iFLYTEK MAKES NO WARRANTIES, EXPRESS OR IMPLIED, IN THIS SUMMARY. Thanks!