Pen Research Jay Pittman Development Lead Tablet PC Handwriting Recognition Microsoft Corporation Jay Pittman Development Lead Tablet PC Handwriting Recognition.

Slides:



Advertisements
Similar presentations
U.S. Government Language Requirements U.S. Government Language Requirements 7 September 2000 Everette Jordan Department of Defense
Advertisements

© 2009 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered.
© 2012 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered trademarks and/or.
Feature: Identity Management - Login © 2013 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or.
Paragon Software Group presents PenReader. Paragon Software Group – International Holding Founded in 1994 Location Germany (HQ), NL, Russia, USA, Japan.
© 2012 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered trademarks and/or.
Windows 7 Training. Windows ® 7 Compatibility Session 0 Isolation Isolation of Windows 7 Services.
© 2012 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered trademarks and/or.
© 2010 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered.
Feature: Reprint Outstanding Transactions Report © 2013 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product.
Feature: Purchase Requisitions - Requester © 2013 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names.
MIX 09 4/15/ :14 PM © 2009 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered.
Feature: Payroll and HR Enhancements © 2013 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or.
1 Linguistic Resources needed by Nuance Jan Odijk Cocosda/Write Workshop.
July 20, 2005Microsoft Tablet PC Microsoft’s Cursive Recognizer Jay Pittman and the entire Microsoft Handwriting Recognition Research and Development Team.
Windows 7 Training. Windows ® 7 Compatibility Installer Detection.
Windows 7 Training Microsoft Confidential. Windows ® 7 Compatibility Version Checking.
Multitenant Model Request/Response General Model.
In the knowledge society of the 21st century, language competence and inter-cultural understanding are not optional extras, they are an essential part.
Feature: Purchase Order Prepayments II © 2013 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are.
Announcing Demo Announcing.
Feature: OLE Notes Migration Utility
Feature: Web Client Keyboard Shortcuts © 2013 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are.
UNLIMITED. SIMULTANEOUS. NO CHECK-OUT. eREFERENCE.
Feature: SmartList Usability Enhancements © 2013 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names.
Session 1.
Built by Developers for Developers…. © 2009 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names.
 Rico Mariani Architect Microsoft Corporation.
© 2012 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered trademarks and/or.
Feature: Assign an Item to Multiple Sites © 2013 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names.
© 2012 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered trademarks and/or.
Feature: Print Remaining Documents © 2013 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or.
1 Translate and Translator Toolkit Universally accessible information through translation Jeff Chin Product Manager Michael Galvez Product Manager.
© 2012 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered trademarks and/or.
Feature: Document Attachment –Replace OLE Notes © 2013 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product.
Feature: Suggested Item Enhancements – Sales Script and Additional Information © 2013 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows.
Feature: Customer Combiner and Modifier © 2013 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are.
Feature: Employee Self Service Timecard Entry © 2013 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names.
Ian Ellison-Taylor General Manager Microsoft Corporation PC27.
© 2009 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or.
demo Instance AInstance B Read “7” Write “8”

customer.
demo © 2008 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names.
demo Demo.
demo QueryForeign KeyInstance /sm:body()/x:Order/x:Delivery/y:TrackingId1Z
Feature: Suggested Item Enhancements – Analysis and Assignment © 2013 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and.
Windows Azure SQL Data Sync Name Title Microsoft Corporation.
projekt202 © 2009 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are.
The CLR CoreCLRCoreCLR © 2009 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product.
© 2009 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered trademarks.
© 2008 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or.

Tel: Fax: P.O. Box: 22392, Dubai - UAE
Advanced Directives: What to Assess with Seniors
Localization and Globalization in Windows Runtime Apps
Возможности Excel 2010, о которых следует знать
Title of Presentation 11/22/2018 3:34 PM
Office Mac /30/2018 © 2010 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered.
Title of Presentation 12/2/2018 3:48 PM
1/3/2019 1:21 PM © 2007 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered.
8/04/2019 9:13 PM © 2006 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered.
Виктор Хаджийски Катедра “Металургия на желязото и металолеене”
Title of Presentation 5/12/ :53 PM
Шитманов Дархан Қаражанұлы Тарих пәнінің
Title of Presentation 5/24/2019 1:26 PM
日本初公開!? Vista の新機能を実演 とっちゃん わんくま同盟 7/23/2019 9:09 AM
Title of Presentation 7/24/2019 8:53 PM
Presentation transcript:

Pen Research Jay Pittman Development Lead Tablet PC Handwriting Recognition Microsoft Corporation Jay Pittman Development Lead Tablet PC Handwriting Recognition Microsoft Corporation

TDNN Time-Delayed Neural Network Ink is cut into segments via simple algorithm For Latin script, we just cut at bottoms For Arabic or Hindi, we may need a different algorithm Features are computed per segment Chebychev coefficients Variety of size metrics: width, height, offsets to neighbors, etc. Each supported letter or character has an output node Many have 2 outputs, for beginning of character and continuation of character Very large training sets are collected from tens of thousands of native writers Time-Delayed Neural Network Ink is cut into segments via simple algorithm For Latin script, we just cut at bottoms For Arabic or Hindi, we may need a different algorithm Features are computed per segment Chebychev coefficients Variety of size metrics: width, height, offsets to neighbors, etc. Each supported letter or character has an output node Many have 2 outputs, for beginning of character and continuation of character Very large training sets are collected from tens of thousands of native writers

Beam Search Very similar to speech recognition systems System lexicon Simple list of words (like a spellchecker) Stored as a trie Plus regular expressions for numbers, dates, times, currency, phone numbers, etc. Low-scoring sequences are “culled” from the trie as we go Converts character recognizer into word recognizer We can recognize a word even if you mangle one of the letters Supports sloppier (and therefore faster) handwriting Very similar to speech recognition systems System lexicon Simple list of words (like a spellchecker) Stored as a trie Plus regular expressions for numbers, dates, times, currency, phone numbers, etc. Low-scoring sequences are “culled” from the trie as we go Converts character recognizer into word recognizer We can recognize a word even if you mangle one of the letters Supports sloppier (and therefore faster) handwriting

… … … Output Matrix dog68 clog57 dug51 doom42 divvy37 ooze35 cloy34 doxy29 client22 dozy13 Ink Segments Top 10 List d 92 a 88 b 14 c 86 o 77 a 73 l 76 t 5 g 68 t 8 b 6 o 65 g 57 t 12 TDNN Space TDNN a b d o g a b t t c l o g t Lexicon e a … … … … … Beam Search a* a d* d g* g o* o TDNN + Beam Search

Personalization New In Vista Shape adaptation Collect samples from you Simple idea: continue same training we do at Microsoft, but only on your samples Implicit Explicit Text adaptation Collect your personal words from Word and outgoing s New In Vista Shape adaptation Collect samples from you Simple idea: continue same training we do at Microsoft, but only on your samples Implicit Explicit Text adaptation Collect your personal words from Word and outgoing s Capital I? dev RTM KKOMO dogfooding Qi trie featurize Herry

Open System Recognizer API is published Any recognizer may support this API A non-Microsoft recognizer can be installed, and it will be invoked by the inking platform Non-Microsoft recognizers available now in Japan, China, and Russia Compete with my group Or cover languages we don’t cover Or cover non-text (music, math, chemistry) Sorry: recognition result API is very text oriented Strings in Unicode Perhaps your text might be XML Recognizer API is published Any recognizer may support this API A non-Microsoft recognizer can be installed, and it will be invoked by the inking platform Non-Microsoft recognizers available now in Japan, China, and Russia Compete with my group Or cover languages we don’t cover Or cover non-text (music, math, chemistry) Sorry: recognition result API is very text oriented Strings in Unicode Perhaps your text might be XML Microsoft Tablet PC / Tablet PC Platform / Programming the Tablet PC / Creating a Recognizer Start / All Programs / Microsoft Tablet PC Platform SDK / Microsoft Tablet PC Platform SDK Documentation

Latin Orthography XP has U.S. English, U.K. English, German, French, Spanish, and Italian Vista adds Dutch and Brazilian Portuguese Currently working on Swedish, Danish, Norwegian (Bokmal), Finnish, Polish, Czech, Portugal Portuguese, Catalan, Romanian, Croatian, and Serbian No ship estimates are available Serbian is written in both Latin and Cyrillic Making plans for the next batch: Probably Bahasa Indonese (Indonesian), Hungarian, Turkish, Slovak, Slovene, Lithuanian, Estonian, Latvian, Vietnamese, Tagalog (Filipino), others TBD World’s largest orthography Largest count of languages Largest combined count of literates XP has U.S. English, U.K. English, German, French, Spanish, and Italian Vista adds Dutch and Brazilian Portuguese Currently working on Swedish, Danish, Norwegian (Bokmal), Finnish, Polish, Czech, Portugal Portuguese, Catalan, Romanian, Croatian, and Serbian No ship estimates are available Serbian is written in both Latin and Cyrillic Making plans for the next batch: Probably Bahasa Indonese (Indonesian), Hungarian, Turkish, Slovak, Slovene, Lithuanian, Estonian, Latvian, Vietnamese, Tagalog (Filipino), others TBD World’s largest orthography Largest count of languages Largest combined count of literates

East Asian Orthographies Ideographic Orthographies Completely different code base Focus on large count of characters XP has Japanese, Chinese (Simplified), Chinese (Traditional), and Korean Vista adds personalization, and improves cursive recognition Completely different code base Focus on large count of characters XP has Japanese, Chinese (Simplified), Chinese (Traditional), and Korean Vista adds personalization, and improves cursive recognition

Cyrillic and Greek Orthographies Same TDNN + beam search technology works equally well in Greek and Cyrillic But we must collect new training sets Cyrillic has more upper/lowercase confusion Russian Collection completed Currently working on recognizer No estimate on shipping date Some work also underway in Serbian Cyrillic: Russian, Ukrainian, Bulgarian, Serbian, Byelorussian, Macedonian, Kazakh Other former-Soviet Turkic republics are in a state of transition back to Latin scripts Serbian is written in both Latin and Cyrillic Same TDNN + beam search technology works equally well in Greek and Cyrillic But we must collect new training sets Cyrillic has more upper/lowercase confusion Russian Collection completed Currently working on recognizer No estimate on shipping date Some work also underway in Serbian Cyrillic: Russian, Ukrainian, Bulgarian, Serbian, Byelorussian, Macedonian, Kazakh Other former-Soviet Turkic republics are in a state of transition back to Latin scripts Serbian is written in both Latin and Cyrillic

Greek ΕλληνικήέδωσανμουγλώσσαΤη Ink by Dr. John Drakopoulos I was given a language that is Hellenic

Russian Ink by Vladimir Smirnov Кремль. Кремль, Ото слышал всех я про Kremlin, Kremlin. I’ve heard from everyone about [it]

Bi-Directional Orthographies Arabic and Hebrew Text written right-to-left, but numbers written left-to-right Arabic has its own digits Hebrew uses “western” digits No uppercase / lowercase distinction Arabic is cursive-only Up to 4 forms per letter (initial, medial, final, isolated) Hebrew is print-only 5 letters have a separate final form Abjabs (consonant alphabets) We have collections underway Some initial experimentation in both languages No estimate on shipping date Arabic script variations used for Farsi (Persian), Urdu, Kurdish, Azeri (in Iran), Pashto, Darwi, Baluchi, Sindhi (in Pakistan), Kashmiri (in Pakistan ) Text written right-to-left, but numbers written left-to-right Arabic has its own digits Hebrew uses “western” digits No uppercase / lowercase distinction Arabic is cursive-only Up to 4 forms per letter (initial, medial, final, isolated) Hebrew is print-only 5 letters have a separate final form Abjabs (consonant alphabets) We have collections underway Some initial experimentation in both languages No estimate on shipping date Arabic script variations used for Farsi (Persian), Urdu, Kurdish, Azeri (in Iran), Pashto, Darwi, Baluchi, Sindhi (in Pakistan), Kashmiri (in Pakistan )

Bi-Directional Example Arabic ١٢٣ 123 عمره [his] age عاماً Year[s] Ink by Ahmed Kamal

Bi-Directional Example Hebrew מבבל. Plimpton 322 [is] from Babylonia. 322 פלימפטון Ink by Ethan Zoller

Brahmic Orthographies Left to right, no uppercase / lowercase Abugidas (syllabic alphabets) Default vowel (short “Ə” or “uh” sound) is not written All other vowels require a vowel sign added to the consonant This includes a “no-vowel sign” Called halant (“choked”) in Hindi, virama in other Indic languages Vowel sound follows consonant sound There are also independent vowels Hindi Devanagari collection in progress Some initial experimentation No estimate on shipping date Brahmic scripts: Devanagari (Hindi, Marathi), Thai, Bengali, Gujarati, Gurmukhi (Punjabi), Tamil, Telugu, Kannada, Malayalam, Oriya, Sinhala, Khmer, Lao, Tibetan, Myanmar, Sindhi (in India), Kashmiri (in India) Left to right, no uppercase / lowercase Abugidas (syllabic alphabets) Default vowel (short “Ə” or “uh” sound) is not written All other vowels require a vowel sign added to the consonant This includes a “no-vowel sign” Called halant (“choked”) in Hindi, virama in other Indic languages Vowel sound follows consonant sound There are also independent vowels Hindi Devanagari collection in progress Some initial experimentation No estimate on shipping date Brahmic scripts: Devanagari (Hindi, Marathi), Thai, Bengali, Gujarati, Gurmukhi (Punjabi), Tamil, Telugu, Kannada, Malayalam, Oriya, Sinhala, Khmer, Lao, Tibetan, Myanmar, Sindhi (in India), Kashmiri (in India)

Vowel Signs “KAA” “KU” “KUU” “KO” “KOW” “KE” “KEE” “KA” “KAI” “KAY” “KR” “K”

Consonant Clusters ST halant RING Unicode buffer: A transliteration of the English word “string”

Brahmic Comparison K Kh G Gh C Ch Devanagari: क ख ग घ च छ Bengali: ক খ গ ঘ চ ছ Gurmukhi: ਕ ਖ ਗ ਘ ਚ ਛ Gujarati: ક ખ ગ ઘ ચ છ Tamil: க ச Telugu: క ఖ గ ఘ చ ఛ Kannada: ಕ ಖ ಗ ಘ ಚ ಛ K Kh G Gh C Ch Devanagari: क ख ग घ च छ Bengali: ক খ গ ঘ চ ছ Gurmukhi: ਕ ਖ ਗ ਘ ਚ ਛ Gujarati: ક ખ ગ ઘ ચ છ Tamil: க ச Telugu: క ఖ గ ఘ చ ఛ Kannada: ಕ ಖ ಗ ಘ ಚ ಛ

Demo Title Name Title Group Name Title Group

Early Feedback As for Vista, I am in love with the B2 Tablet implementation! I can see where the effort is going! The TIP is awesome and the handwriting recognition is vastly improved. There is hardly a time where I have to correct my input - no matter how sloppy I am. Unsolicited feedback from a Vista Beta II user whose identity I do not know

© 2006 Microsoft Corporation. All rights reserved. Microsoft, Windows, Windows Vista and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries. The information herein is for informational purposes only and represents the current view of Microsoft Corporation as of the date of this presentation. Because Microsoft must respond to changing market conditions, it should not be interpreted to be a commitment on the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information provided after the date of this presentation. MICROSOFT MAKES NO WARRANTIES, EXPRESS, IMPLIED OR STATUTORY, AS TO THE INFORMATION IN THIS PRESENTATION.

Backup Slides

Devanaguri Example प “PA” [R sign] period [abbreviation] headline ल “LA” period [abbreviation] headline म “MA” ि [vowel sign] “I” ा [vowel sign] “AA” ि [vowel sign] “I” Pr. Ltd. [Private Limited]