Pre-SWOT Report. Online Handwritten Arabic OCR (Online Handwritten Recognition: OHR) Dr. Ashraf Al-Marakby Eng. Hesham Osman Eng. Randa Al-Anwar Dr. Mohamed.

Slides:



Advertisements
Similar presentations
1 Mid-Term Review of The Illinois Commitment Assessment of Achievements, Challenges, and Stakeholder Opinions Illinois Board of Higher Education April.
Advertisements

1 Survey Technology. Data Collection Tools Available in the Market 1. Paper Survey 2. Smart Paper 3. Cell Phones 4. Personal Digital Assistants - PDAs.
ONLINE ARABIC HANDWRITING RECOGNITION By George Kour Supervised by Dr. Raid Saabne.
Data lifecycle Data Management & Survey Conception
Routine Immunization and Pentavalent Vaccine Preparedness Assessment (RIPAS)
1 Egyptian Ministry of Communications and Information Technology Research and Development Centers of Excellence Initiative Data Mining and Computer Modeling.
Pre-SWOT Report. Offline Handwritten Arabic OCR (Intelligent Character Recognition ICR) Dr. Mohamed El-Mahallawy Eng. Hesham Osman Eng. Rana Abdou Dr.
The Decision-Making Process IT Brainpower
LYU0203 Smart Traveller with Visual Translator for OCR and Face Recognition Supervised by Prof. LYU, Rung Tsong Michael Prepared by: Wong Chi Hang Tsang.
| Alper Ortac | Computer Science Department | Ubiquitous Knowledge Processing Lab | © Prof. Dr. Iryna Gurevych | 1 Knowledge Management in Web.
Web Logs and Question Answering Richard Sutcliffe 1, Udo Kruschwitz 2, Thomas Mandl University of Limerick, Ireland 2 - University of Essex, UK 3.
Iris Recognition By Mohammed, Ashfaq Ahmed. Introduction Iris Recognition is a Biometric Technology which deals with identification based on the human.
Business Plan Guidelines. Purpose of Business Plan  Set Goals and Objectives for the Business  Resource Planning  Secure Funding.
ادارة الوثائق الالكترونية Naji Shukri Alzaza University of Palestine February 2010.
Top Objectives: 1.Increase web traffic and exposure 2.Become definitive authority on Coffee 3.Increase sales to coffee centric Food Service Operators 4.Engage.
1 / 23 Microsoft Tablet PC Technology Thomas Dodds Declan O’Gorman David Pickles Stephen Pond An overview of Microsoft Tablet PC technology and current.
Pre-SWOT Report. Printed Arabic OCR Dr. Mohamed El-Mahallawy Eng. Hesham Osman Eng. Rana Abdou Dr. Mohamed Waleed Fakhr Dr. Mohsen Rashwan.
Leaders Facilitate the Planning Process
ONLINE HANDWRITTEN GURMUKHI SCRIPT RECOGNITION AND ITS CHALLENGES R. K. SHARMA THAPAR UNIVERSITY, PATIALA.
When new product and customer Loyalty collide Case Analysis by Raj Kumar Singh.
A Smart-Pen Product VariSearch A Unique, Cross-language, Spelling-tolerant Search Engine Features and Application Area.
By Group 6 1. Adaptive Mapping 2 Adaptivity What is adaptivity? “A system is called adaptive if it is able to change its own characteristics automatically.
Handwritten Signatures Authentication using ANNs Committee Machines M.Heinen, F. Osório and P. Engel October Handwritten Signatures Authentication.
Post test survey of the General Census of Population and Housing.
Mark A. Evans, Klein ISD PDAs in the Classroom Session ID#: PW242 TCEA, Feb. 6, 2002, 2:30pm.
Deeper IBM Global Business Services © Copyright IBM Corporation 2008 Investor’s Cafe Investor-driven marketing of your location October 8 th 2008.
Serve first SoCoLoMo in 2013 Social, Commerce, Local & Mobile: Planning for Digital Success in 2013 Notice: Some images in this presentation require are.
ESPON Seminar 15 November 2006 in Espoo, Finland Review of the ESPON 2006 and lessons learned for the ESPON 2013 Programme Thiemo W. Eser, ESPON Managing.
Mapping Marketing Essentials: The “Four P’s” and You Product, Price, Promotion, Place (and Partners): What’s your take on them, as they apply to Gemini,
Their huge range make Ectaco the company to try for the harder to find languages and operating systems. Travellers often choose Ectaco software because.
Eng.Abed Al Ghani H. Abu Jabal Introduction to computers.
1 SmartSpaghetti: Use of Smart Devices to Solve Health Care Problems Mostafa Uddin,A. Gupta, T. Nadeem, K. Maly Sandip Godambe, Arno Zaritsky BIBM/BIH.
Data Capture Technology Statistical Centre Of IRAN Presented by : MS. SOMAYE AHANGAR Vice – Presidency for Strategic Planning and Supervision Statistical.
Online Arabic Handwriting Recognition Fadi Biadsy Jihad El-Sana Nizar Habash Abdul-Rahman Daud Done byPresented by KFUPM Information & Computer Science.
© 2010 IBM Corporation IBM Research - Ireland © 2014 IBM Corporation xStream Data Fusion for Transport Smarter Cities Technology Centre IBM Research.
The new online platform. Proposed Platform Evolution 5 year old platform New Platform for the next 5 years Focus in courses and Hot topics User Centric.
22CS 338: Graphical User Interfaces. Dario Salvucci, Drexel University. Lecture 10: Advanced Input.
Supervisor: Dr. Elsayed Eissa Hemayed. o Marwa Ibrahim Lamey. Mayada Ibrahim Aly. o Mona Sherif Ahmed. o Suad Mohamed Barakat. o Marwa Ibrahim Lamey.
Dynamic Learning Maps Alternate Assessment Transtions in West Virginia Melissa Gholson Office of Assessment.
What is Lenovo Goal Mission & Vision Business Groups SWOT Analysis
Automatic Discovery and Processing of EEG Cohorts from Clinical Records Mission: Enable comparative research by automatically uncovering clinical knowledge.
What is Lenovo Goal Mission & Vision Business Groups SWOT Analysis
Team Members Ming-Chun Chang Lungisa Matshoba Steven Preston Supervisors Dr James Gain Dr Patrick Marais.
By: Chen Shen Yijun Niu Ke Wang Yan Lu Azad Patwary Poonam Bhatt Marc Perez Stella Malla Steven Meikle Johny Tran.
Strategic Assessment of ICT Options Josh Woodard November 30, 2011 This presentation was developed for a Farmer to Farmer implementing partners workshop.
Distributed Pattern Recognition System, Web-based by Nadeem Ahmed.
Using Word Based Features for Word Clustering The Thirteenth Conference on Language Engineering 11-12, December 2013 Department of Electronics and Communications,
The Role of New Markets Advisors in Healthcare Case Study in Medical Devices.
Phone-Level Pronunciation Scoring and Assessment for Interactive Language Learning Speech Communication, 2000 Authors: S. M. Witt, S. J. Young Presenter:
Handwriting Recognition
M-Learning Application in Training at Universities Dr. Tran Trung Nguyen Viet Dung, M.A.
The Marketing Plan Chapter 2. Section 2.1: Marketing Planning  Good marketing requires good planning Research your company Study your business environment.
Global and Chinese Smart TV Camera Industry, 2016 Market Research Report
Arabic Handwriting Recognition Thomas Taylor. Roadmap  Introduction to Handwriting Recognition  Introduction to Arabic Language  Challenges of Recognition.
Global Video Surveillance System Market with Focus on Equipments: ( ) Global Video Surveillance System Market with Focus on Equipments: ( )
June 12, 2016CITALA'121 Cloud Computing Technology For Large Scale and Efficient Arabic Handwriting Recognition System HAMDI Hassen, KHEMAKHEM Maher
By: Shane Serafin.  What is handwriting recognition  History  Different types  Uses  Advantages  Disadvantages  Conclusion  Questions  Sources.
Big Data: Every Word Managing Data Data Mining TerminologyData Collection CrowdsourcingSecurity & Validation Universal Translation Monolingual Dictionaries.
How to use C OBI T implementation resources Brian Selby Director of C OBI T Initiatives ISACA.
Android phones have become the most sought after Smartphone devices for the slew of benefits they bring along. As a result, we have a.
Troikaa Translation Services. Troikaa - Introduction ≈ One-stop solution for handling your entire language related service requirements. We have always.
Fusion of Multiple Corrupted Transmissions and its effect on Information Retrieval Walid Magdy Kareem Darwish Mohsen Rashwan.
PRESENTATION: ONLINE REGISTRATION
Leaders Facilitate the Planning Process
Innovative Business Solutions
Dr. ElSayed Eissa Hemayed
Gesture Recognition Market Size
Textbook Engineering Web Applications by Sven Casteleyn et. al. Springer Note: (Electronic version is available online) These slides are designed.
Improving assessment and feedback processes with OCR technology
Marketing Management 2 Miss/ Eman Elfar
Presentation transcript:

Pre-SWOT Report. Online Handwritten Arabic OCR (Online Handwritten Recognition: OHR) Dr. Ashraf Al-Marakby Eng. Hesham Osman Eng. Randa Al-Anwar Dr. Mohamed Waleed Fakhr Dr. Mohsen Rashwan Eng. Eman Mostafa

1-Introduction and challenges The wide spread use of pen-based hand held devices such as PDAs, smart-phones, and tablet- PCs, increases the demand for high performance on-line handwritten recognition systems. These systems recognize text while the user is writing with an on-line writing device, capturing the temporal or dynamic information of the writing. This information includes the number, duration, and order of each stroke (a stroke is the writing from pen down to pen up).

Main Challenges in Arabic OHR Unconstrained writing problem Dotting problem Delayed Strokes problem: association between letters and their diacritical marks. Overlapping problem

2- Applications Education domain: Huge number of attractive applications for students and teachers, over tablet PCs and other devices (smart pens, smart boards, etc.) Online mapping of notes to text in online data collection (questionnaires, forms, etc.) Huge number of Mobile applications for business people and others.

3- State of the art in products (Latin script) OHR is a highly mature technology for Latin script with excellent performance. MicroSoft, RitePen, VisionObjects, QuickScript are a few very successful OHR solution providers for more than 20 languages. Most require no training, allow for user- defined dictionary and user adaptation. Performance is claimed to be excellent for unconstrained, continuous writing.

4- State of the art in products (Arabic script) Sakhr and ImagiNet both offer OHR products for most MS based devices (HTC, Pocket PC, PDA). Also, VisionObjects and QuickScript have OHR Arabic support and claim good performance. A comparison between these products on a standard benchmark is needed to find out the strengths and weaknesses of each.

5- State of the art in Research for Arabic OHR Focus mainly on producing true unconstrained continuous cursive writing. Focus on developing algorithms that can run in real time and on limited resources. Significant recent efforts: Most recent research employ Recurrent Neural Networks, HMMs, fusion of other pattern recognition techniques. Also, making use of the offline image as an extra source of information.

Competition ICDAR2009 The database consists of 23,251 Arabic words handwritten by more than 130 different writers (ADAB database: Tunisian City names). For testing, 2400 words are used written by 24 writers different than the ones in training. Best performance obtained by VisionObjects team: 99%. The system use neural networks with other PR techniques. Second best is MDLSTM by Alex Graves: 96%. Using a hierarchy of multidimensional recurrent neural networks.

6- Required Modules Pre-processing tools: delayed strokes, smoothing, resampling, etc. PAW or letter Segmentation and extraction tool Language models Feature extraction tools. Statistical training tools: HTK, SRI, Matlab, and many neural network tools. Error analysis tools: Need to be implemented.

7- Required Resources Major question: How many PAWs? And How many of them are most frequently used? (an estimate of 500 is given). Word annotated corpus (estimated 2000 pages by 2000 writers). Character/PAW annotated corpus for initial models to cover 10 instances for each PAW. Dictionaries with PAW transcriptions

8- Available Resources and Gaps ADAB database is the only large one available (limited domain, limited number of writers). Annotation and segmentation tools required. More data required.

9- LR proposed by ALTEC For the training data, we suggest 10,000 writers, one page per person. In the first phase, we will start with 2000 writers, each writing two pages (average of 50 words per page), which gives about 200,000 words. We could retain 150,000 words for training and 50,000 for benchmarking. The vocabulary issue must be addressed. Also, we need to ensure the fair coverage of the PAWs. Cairo university has annotation tools to assist manual segmentation of the online data, and Dr. Sherif Abdou will kindly make it available to ALTEC.

10- Preliminary SWOT analysis Strengths: 1.The expertise in DSP, pattern recognition, image processing, NLP, and stochastic methods 2.Potential to have huge amounts of annotated data. Weaknesses: 1.No comprehensive benchmarking available for Arabic OHR 2.No standard training database available for research community for Arabic OHR Opportunities: 1.Large market of such a tech. of over 300 million native speakers, plus other numerous interested parties, over a wide range of platforms (tablet PCs, smart phones, etc.)

Threats: 1.Other R&D groups all over the world (esp. in the US) is working hard and racing for more reliable products and for more applications. 2.Microsoft could make its OHR Arabic product open source when it is done.

11- Survey Specify the application that OHR recognition will be used for What is the data used/intended to train the system? What is the benchmark to test your system on? Would you be interested to contribute in the data collection. At what capacity? Would you be interested to buy Arabic OHR annotated data? Would you be interested to contribute in a competition How many persons working in this area in your team? What are their qualifications? What are the platforms supported/targeted in your application? What is the market share anticipated in your application? Would your application support any other languages? Explain.

List of Survey Targets Sakhr ImagiNet RDI Orange- Cairo IBM- Cairo Cairo University Ain Shams University Arab academy (AAST) AUC GUC Nile University Azhar university Helwan university Assuit university Other research Centers from outside Egypt Other companies that are users of the technology

12- Key Figures in this Field Dr. Alex Graves (TU Munich, Germany). Dr. Stephan Knerr (CEO, VisionObjects) Dr. Hazem AbdelAzeem (Egypt)