Download presentation
Presentation is loading. Please wait.
Published byLilian Henbest Modified over 10 years ago
2
1 +86-551-5331800 +86-551-5331801 http://www.iflytek.com TEL: FAX: WEBSITE: © 2002 iFLYTEK. All rights reserved. This presentation is for informational purposes only. iFLYTEK MAKES NO WARRANTIES, EXPRESS OR IMPLIED, IN THIS SUMMARY. ANHUI USTC iFLYTEK CO., LTD
3
Network Speech Technology Industry in China --Qiang Bai ANHUI USTC iFLYTEK Co., LTD.
4
3 General introduction of the speech industry
5
4 General introduction of Chinese speech industry Chinese speech technology achieved great progress in the past four years with the boom of Chinese economy. Chinese speech technology gradually occupies the key position in the Pan-Asian market by taking part into everyones everyday life.Chinese speech technology achieved great progress in the past four years with the boom of Chinese economy. Chinese speech technology gradually occupies the key position in the Pan-Asian market by taking part into everyones everyday life. In consideration of the weak foundation and brief history, Chinese speech technology market is still in a primary stage. The total market is limited, and much work is needed to cultivate.In consideration of the weak foundation and brief history, Chinese speech technology market is still in a primary stage. The total market is limited, and much work is needed to cultivate. Started in the mid 1980s, supported by the national government, now driving by the iFLYTEK, the research of Chinese speech synthesis technology has advanced rapidly. The synthesis effect could satisfy most of the practical application.Started in the mid 1980s, supported by the national government, now driving by the iFLYTEK, the research of Chinese speech synthesis technology has advanced rapidly. The synthesis effect could satisfy most of the practical application.
6
5 General introduction of Chinese speech industry The speech technology market of China is special. iFLYTEK occupied a 82% market share, with the predominant market position. Other suppliers are focusing on the lower-end market, far behind iFLYTEK in both technology and market.The speech technology market of China is special. iFLYTEK occupied a 82% market share, with the predominant market position. Other suppliers are focusing on the lower-end market, far behind iFLYTEK in both technology and market. Speech synthesis technology is the mainstream of market appliance. Several industries includes the major customers. Call centers o f telecommunication (value added services), financial services almost form the 70% of the present total market.Speech synthesis technology is the mainstream of market appliance. Several industries includes the major customers. Call centers o f telecommunication (value added services), financial services almost form the 70% of the present total market. The potential market of speech technology is huge and profitable.The potential market of speech technology is huge and profitable.
7
6 Present Situation of the technology
8
7 Challenges Chinese is special and complex. The text analysis and rhythm analysis of Chinese are hard to conduct.Chinese is special and complex. The text analysis and rhythm analysis of Chinese are hard to conduct. Theres no blank between two Chinese characters, the word boundary is hard to define.Theres no blank between two Chinese characters, the word boundary is hard to define. One Chinese character might have different pronunciation in different context, some pronunciation just exist in name.One Chinese character might have different pronunciation in different context, some pronunciation just exist in name. Special signs should be translate according to the Chinese reading habit. As a language with four tones, the rhythm feature of Chinese is very complex, such as tone sandhi and r-colloring.Special signs should be translate according to the Chinese reading habit. As a language with four tones, the rhythm feature of Chinese is very complex, such as tone sandhi and r-colloring. Mandarin is rhythm based, the mark up system is Pinyin but not international phonetic alphabet.Mandarin is rhythm based, the mark up system is Pinyin but not international phonetic alphabet. During the construction of grammar rule and dictionary, a systematical transcript framework is needed to mark the exceptive phenomenon. At the same time, some sophisticate employees with the announcer background is needed.During the construction of grammar rule and dictionary, a systematical transcript framework is needed to mark the exceptive phenomenon. At the same time, some sophisticate employees with the announcer background is needed. How to find out the best pair to concatenation in the huge corpus? Based on the intelligent text analysis technology, how to predict the most fluent speech parameter?How to find out the best pair to concatenation in the huge corpus? Based on the intelligent text analysis technology, how to predict the most fluent speech parameter? Those are all the challenges in Chinese speech synthesis. Those are all the challenges in Chinese speech synthesis. 7
9
8 Innovation iFLYTEK has the intelligent text analysis technology, which could solve the problems of word segmentation, polyphonic characters, special signs and rhythm level questions.iFLYTEK has the intelligent text analysis technology, which could solve the problems of word segmentation, polyphonic characters, special signs and rhythm level questions. By using a huge speech database(3000sentences), together with the human markups, the fluent sound could be generate through the data driven, prosodic prediction and unit selection algorithm.By using a huge speech database(3000sentences), together with the human markups, the fluent sound could be generate through the data driven, prosodic prediction and unit selection algorithm. With the latest hmm-based, and absolute data driven method, we could generate the fluent voice with a small speech database(500-1000).With the latest hmm-based, and absolute data driven method, we could generate the fluent voice with a small speech database(500-1000). We could simulate the target persons pronunciation by the MLLR based voice conversion method.We could simulate the target persons pronunciation by the MLLR based voice conversion method. 8
10
9 Innovation iFLYTEK is the chair party for Chinese national governments speech technology standard working group.iFLYTEK is the chair party for Chinese national governments speech technology standard working group. iFLYTEK has finished the Mandarin speech synthesis system general technology specification. This specification has been released as the national standard.iFLYTEK has finished the Mandarin speech synthesis system general technology specification. This specification has been released as the national standard. This standard laid the ground work for Chinese speech technologys rapid growthThis standard laid the ground work for Chinese speech technologys rapid growth the Mandarin speech synthesis system general technology specification includes the CSSML mark up language, which could satisfy the need for those exceptional characters synthesis.the Mandarin speech synthesis system general technology specification includes the CSSML mark up language, which could satisfy the need for those exceptional characters synthesis. 9
11
Innovation iFLYTEK is aspiring to take part in setting the international SSML standard. Thats a great support to the Chinese language.iFLYTEK is aspiring to take part in setting the international SSML standard. Thats a great support to the Chinese language. Based on the CSSMLs result, iFLYTEK offers the international SSML standard the suggestions of how to improve Chinese.Based on the CSSMLs result, iFLYTEK offers the international SSML standard the suggestions of how to improve Chinese. Following are the latest updated terms for Chinese in the new version of SSML1.1Following are the latest updated terms for Chinese in the new version of SSML1.1 –New sign is defined to support the work of specify the word boundary. –Extend the meaning of special phoneme sign to further support Chinese Pinyins mark up. –Offers a system to define the name. especially for the Chinese name in which some characters would change their pronunciations. –Offers the way to describe dialects. 10
12
11 Keep step forward With 20 years persistent efforts, the speech technology has achieved great progress. Year1995199820012004 Naturalness <3.03.03.84.3 Naturalness is the key label of the synthesized speech. The subjective scoring method is introduced to express the similarity between the human voices and the synthesized voices. MOS Mean Opinion Score 5 is the best 1 is the worst.
13
Obstructive factors in the application of speech technology
14
13 Obstructive factors Artificial services are still the mainstream for the costArtificial services are still the mainstream for the cost China is rich in labor resources, artificial call centers are simple and cheap. So a full-scale speech technology solution is far ahead.China is rich in labor resources, artificial call centers are simple and cheap. So a full-scale speech technology solution is far ahead. The average annual salary of the sophisticated call center workers in Asia, (2006. $) India 3,334 China 2,558 Malaysia 5,442 Philippines 3,348 Thailand 3,656 Singapore 13,677 Source:callcentres.net
15
14 Obstructive factors Too many dialects increase the difficulty of the speech technologys popularization.Too many dialects increase the difficulty of the speech technologys popularization. Chinese is a multi- national country, with 56 nationalities in all. Chinese language is complex, statistic shows therere more than 3000 dialects in China. Thats a great challenge to the development of Chinese speech technology.Chinese is a multi- national country, with 56 nationalities in all. Chinese language is complex, statistic shows therere more than 3000 dialects in China. Thats a great challenge to the development of Chinese speech technology. –Based on the existing effect that speech technology could achieve, the practical application effect could be improved through customization. But the premise is a thoroughly understanding of the customers need and purpose. iFLYTEK is leading in speech technology market for we are experienced and we have the professional solutions and the splendid team. – iFLYTEK gained the national support for the project of set up the Chinese dialects evaluation and recognition database in 2004.
16
15 Obstructive factors Customers acceptance to the IT solution needs improvementCustomers acceptance to the IT solution needs improvement Comparing to the developed country, the informatization construction of China is weak and simple. Mostly because the Chinese customers rely more on the hardware rather than the software. Statistic shows that in the oversea countries the ratio of hardware, software and service is 1:2:4, while in china the ratio is less than 4:2:1.Comparing to the developed country, the informatization construction of China is weak and simple. Mostly because the Chinese customers rely more on the hardware rather than the software. Statistic shows that in the oversea countries the ratio of hardware, software and service is 1:2:4, while in china the ratio is less than 4:2:1. –The differed acceptance to software and hardware influence the living condition of Chinese software and service industry. Theres a long time period to overpass until the living condition turns better and everyones concept change. Customers habits need to be cultivate.Customers habits need to be cultivate. –Due to the development of Chinese informatization, telecommunication comes to Chinese residents just a few years; artificial services are the mainstream now. Self-service supported by the speech technology is not so popular as the developed country.
17
16 Obstructive factors The actualize capacity need progressThe actualize capacity need progress System Integration corporations (SI) are the main body to develop the application of speech technology in China. Therere more than 900 such companies in China without unit focus and steering. Thus much energy are spending on the duplicative develop and illicit competition. The whole industry is not canonical and ordered yet.System Integration corporations (SI) are the main body to develop the application of speech technology in China. Therere more than 900 such companies in China without unit focus and steering. Thus much energy are spending on the duplicative develop and illicit competition. The whole industry is not canonical and ordered yet. –Speech technology are gradually sophisticating, but SI companies still stay at the preliminary stage. Some work like customer needs generation, product design, testing, customization, customers cultivation etc. are in projecting absence. –As the leader of Chinese speech technology, iFLYTEK focused on the basic theories research, practical applications development, customers cultivate, partners training, etc. Now she has been the effective guarantee to the success of speech technology.
18
17 Current Situation of commercial application
19
18 Driving force of speech technologys application The top priority of northern Americans company is to satisfy the customers need, to help the customer find the information and service that they need as quick as possible. At the same time, they try to improve the efficiency and reduce the labor cost.The top priority of northern Americans company is to satisfy the customers need, to help the customer find the information and service that they need as quick as possible. At the same time, they try to improve the efficiency and reduce the labor cost. Source: Benchmark Portal, May 2005 Find the information Reduce the pressure Cut the cost Improve loyalty
20
19 Driving force of speech technologys application The top priority of Chinese company is profit. They want to use speech technology to offer more information and value added services. So, in china, the big customers all belong to the telecommunication industry.The top priority of Chinese company is profit. They want to use speech technology to offer more information and value added services. So, in china, the big customers all belong to the telecommunication industry. Major Goals Customers Deploy Speech Increase the income To offer the information and service Increase competitive edge Efficiency Source: iFYTEK Survey
21
20 Driving force of speech technologys application The developing speech technology is driven by some actual reasons. At the same time, weve found that Chinese customers comment on the speech technology is different from the sophisticated northern American market.The developing speech technology is driven by some actual reasons. At the same time, weve found that Chinese customers comment on the speech technology is different from the sophisticated northern American market. Source: NUANCE V-World Survey
22
21 Present Chinese speech technology market In the major application industries like telecommunication, financing, energy, transportation etc, iFLYTEK occupies a 82% market share; representing the trend of the market.In the major application industries like telecommunication, financing, energy, transportation etc, iFLYTEK occupies a 82% market share; representing the trend of the market. iFLYTEK has more than 800 partners in China. The total application cases are more than 6000, each second, millions of customers get their information and services through the iFLYTEK speech technology.iFLYTEK has more than 800 partners in China. The total application cases are more than 6000, each second, millions of customers get their information and services through the iFLYTEK speech technology. In China, there are many application cases in the value added services. Speech recognition technology is now being applied in the major industries like telecommunication, financing, energy, transportation etc.In China, there are many application cases in the value added services. Speech recognition technology is now being applied in the major industries like telecommunication, financing, energy, transportation etc.
23
Applications in all major industries
24
23 Chinese speech recognition applications
25
24 National project Key projectMulti-language information service for 2008 Olympic GamesKey projectMulti-language information service for 2008 Olympic Games iFLYTEK has been nominated as the only speech technology provider, and been named as the Best participant by the national 863 committee.iFLYTEK has been nominated as the only speech technology provider, and been named as the Best participant by the national 863 committee. The major function of it is information searching through telephone.
26
25 +86-551-5331800 +86-551-5331801 http://www.iflytek.com TEL: FAX: WEBSITE: © 2002 iFLYTEK. All rights reserved. This presentation is for informational purposes only. iFLYTEK MAKES NO WARRANTIES, EXPRESS OR IMPLIED, IN THIS SUMMARY. Thanks!
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.