Center for Research in Urdu Language Processing PAN Localization Project A Regional Initiative to Develop Local Language Computing Capacity in Asia ثناء.

Slides:



Advertisements
Similar presentations
Pan African Localization Network
Advertisements

Introduction to Computational Linguistics
Interaction between Academia and Microsoft in Speech and Language Systems Kentaro Toyama Microsoft Research Chandar Sundaram, Andy Abbar, Alex Acero, Mythreyee.
Teaching Courses in Scientific Computing 30 September 2010 Roger Bielefeld Director, Advanced Research Computing.
Tafseer Ahmed Department of Computer Science University of Karachi Urdu on Linux International Support.
Language & Nation. Countries contain linguistic minorities –Where linguistic minorities are large they are more influential –Where linguistic minorities.
Status and Challenges of Local Language Computing and BRAC University’s Initiative Naushad UzZaman Research Programmer Center for Research on Bangla Language.
Languages & The Media, 5 Nov 2004, Berlin 1 New Markets, New Trends The technology side Stelios Piperidis
MULTI LINGUAL ISSUES IN SPEECH SYNTHESIS AND RECOGNITION IN INDIAN LANGUAGES NIXON PATEL Bhrigus Inc Multilingual & International Speech.
Block Community Portals Networking rural communities in the North East National Informatics Centre DIT, MC&IT, GOI.
Speech Translation on a PDA By: Santan Challa Instructor Dr. Christel Kemke.
CSE111: Great Ideas in Computer Science Dr. Carl Alphonce 219 Bell Hall Office hours: M-F 11:00-11:
GSC16-OBS-03 ITU-T GSC – 16 Observer Presentation Karen Higginbottom, JTC 1 Chair.
Spoken Language Systems: The Unfinished Agenda Raj Reddy School of Computer Science Carnegie Mellon University Pittsburgh September 21, 2006 The entire.
HLT Research and Development for Baltic Languages in Tilde Andrejs Vasiļjevs, Raivis Skadiņš Tilde Riga, October 27, 2004.
Internationalization of Java Platform Presenter: Ataru Nakazawa Advisor: Xiaoping Jia Date: January 23, 2004.
ÓC-DAC Noida’2004 Efforts in Language & Speech Technology Natural Language Processing Lab Centre for Development of Advanced Computing (Ministry of Communications.
Text-To-Speech System for Marathi Miss. Deepa V. Kadam Indian Institute of Technology, Bombay.
26 April 2001 Unicode and Windows XP, IUC 18 (Hong Kong) Unicode and Windows XP Cathy Wissink Program Manager, Globalization Windows Division Microsoft.
Information and Communication Technologies in the field of general education in Armenia NATIONAL CENTER OF EDUCATIONAL TECHNOLOGIES.
CAREERS IN LINGUISTICS OUTSIDE OF ACADEMIA CAREERS IN INDUSTRY.
Panel: Importance and role of the NRENs in supporting science Dr. ByungKyu Kim, Executive Officer/TEIN4 Project Manager.
Skills: none Concepts: introduction to and history of speech (with and without text) and music processing, audio file formats, the audio processing workflow,
Enlightening minds. Enriching lives. Tamil Digital Industry Badri Seshadri K.S.Nagarajan New Horizon Media.
Information Society Innovation Fund (ISIF) Grants Program Paul Wilson APNIC 29.
Connecting with South Asian Customers: Developing Cultural Awareness _______________________________ OLA Super Conference – 2008 Session 1003 Sarala Uttangi.
Research Component on Technology Concluding Thoughts Sarmad Hussain Center for Research in Urdu Language Processing National University of Computer and.
1 Computational Linguistics Ling 200 Spring 2006.
1 Dieter Schwela Stockholm Environment Institute, York Presentation at the CAI-Asia Internal Secretariat Coordinating Meeting Bangkok, 10 July 2005.
NLP Related Activities in Thailand Virach Sornlertlamvanich Information Research and Development Division National Electronics and Computer Technology.
Sustainability of the work and PANL10n network: Vision beyond 2010 Regional Conference on Localized ICT Development & Dissemination Across Asia PAN Localization.
Summary Report Survey on Research and Development of Machine Translation in Asian Countries Virach Sornlertlamvanich Information Research and Development.
Licensing and Distribution of Resources and Software PAN L10n Perspective Sarmad Hussain Center for Research in Urdu Language Processing National University.
Module 5 A system where in its parts perform a unified job of receiving inputs, processes the information and transforms the information into a new kind.
Information Society Innovation Fund (ISIF) Grants Program Paul Wilson APNIC 27.
21st September 2004localisation and the digital divide1 and the Development and the Information Society Economic divides Language divides Cultural divides.
LTI Education Committee Report Alon Lavie LTI Retreat March 2, 2012.
Third Conference December, Basm 28 Years >450,000 Terminologies >250 Scientific field.
* Property of STI Page 1 of 18 Software: Systems and Applications Basic Computer Concepts Software  Software: can be divided into:  systems software.
Virtual Platform for Education Cooperation in the Americas Webinar Technical Secretariat of the Inter-American Committee on Education-CIE Department of.
Huda Sarfraz Center for Research in Urdu Language Processing, National University of Computer and Emerging Sciences cases of local language content development.
Introduction to Computing Muhammad Saeed. Topics Course Description Overview of Areas Contact Information.
ADD and SNLP in Thailand Virach Sornlertlamvanich Thai Computational Linguistics Lab. (TCL), NICT Asia Research Center, Thailand
General IT Knowledge Topic: NiDA Presentation by: Eat Sarith.
For Regional Integration ESCWA Information & Communication Technology Division 1 19 December 2006M. Farah 1 FOSS: Needs and Opprtunities Mansour Farah.
Information Assurance – A Technology Transfer Success Story Deidre W. Evans, Edward L. Jones, Christy L. Chatmon Computer and Information Sciences Department.
T h e A A A H Asia-Pacific Action Alliance on Human Resources for Health.
ARTIFICIAL INTELLIGENCE FOR SPEECH RECOGNITION. Introduction What is Speech Recognition?  also known as automatic speech recognition or computer speech.
Role of Policy in Local Language Computing ثناء گل مرکز تحقیقات اردو پاکستان ، ۲۰۰۵ Sana GUL Pakistan, 2005.
Utkal University We Work On Image Processing Speech Processing Knowledge Management.
1 An Introduction to Computational Linguistics Mohammad Bahrani.
PAKISTAN Sana SHAMS.  Technology been the focus: Scope for maturing upon process\methods: ◦ End-User Training, Content development  This year to prototype.
ICT Developments in Lao PDR Mr. Snith XAPHAKDY Director Telecom Division Ministry of Communication, Transport, Post and Construction. Lao P.D.R.
Computational Linguistics Courses Experiment Test.
Cases of Local Language Content Development and Dissemination across Developing Asia: Examples from PAN Laos L10n project By Valaxay DALALOY National Authority.
Hitoshi ISAHARA National Institute of Information and Communications Technology (NICT) Sustainability of the work and PAN L10n network: Vision Beyond 2010.
Basic Element of Electronics Data Processing Hardware Hardware Software Software Networking Networking Person involved in Computer Fields Person involved.
Speech Recognition through Neural Networks By Mohammad Usman Afzal Mohammad Waseem.
Prepared by: Shammur Absar Chowdhury, CRBLP
Knowledge Networking for Rural Development with ENRAP
SPEECH TECHNOLOGY An Overview Gopala Krishna. A
Gender Affairs Programme
Objectives and Plan of Action
Introduction to Computer Science
HLT Research and Development for Baltic Languages in Tilde
Sana Shams PAN Localization project
Technology Development
Natural Language Processing
Sana Shams PAN Localization project
Sarmad Hussain Internationalized Domain Names (IDN) Programs Director
Presentation transcript:

Center for Research in Urdu Language Processing PAN Localization Project A Regional Initiative to Develop Local Language Computing Capacity in Asia ثناء گل مرکز تحقیقات اردو پاکستان ، ۲۰۰۵ SANA GUL Center for Research in Urdu Language Processing Pakistan, 2005

Center for Research in Urdu Language Processing Introduction to Center for Research in Urdu Language Processing Introduction to PAN Localization Project Scope of Localization & Introduction to this Training Presentation Highlights

Center for Research in Urdu Language Processing

CRULP Objectives ► To conduct linguistic research for Urdu and regional languages ► To participate in standardization efforts in Urdu and regional languages ► To evolve computational models of Urdu and regional languages ► Promote content development in Urdu and regional languages

Center for Research in Urdu Language Processing CRULP Research ► Linguistics ► Script Processing ► Language Processing ► Speech Processing

Center for Research in Urdu Language Processing CRULP Resources ► Team  4 Full-time Faculty Members  Adjunct Faculty  12 Graduate Students  45 Undergraduate Students  25 Full-time staff

Center for Research in Urdu Language Processing CRULP Coursework ► Phonetics and Phonology ► Morphology and Syntax ► Digital Signal Processing ► Random Variables and Stochastic Processes ► Speech Processing ► Computational Linguistics ► Image Processing ► Calligraphy and Font Development

Center for Research in Urdu Language Processing CRULP Research - Linguistics ► Areas  Acoustic Phonetics  Phonology  Morphology  Syntax

Center for Research in Urdu Language Processing CRULP Research - Script ► Font Development: Nafees Font Family  Nafees Nasta’leeq, Nafees Naskh, Nafees Pakistani Naskh (Urdu, Punjabi, Pashto, Sindhi, Balochi, Siraiki)  Freely downloadable from  Supported mainly by UNDP/IDRC/APNIC Small Grants Program and partially by Microsoft, Pakistan ► Optical Character Recognition  Naskh (segmentation based)  Nasta’leeq (Ligature based)

Center for Research in Urdu Language Processing

Nasta ’ leeq Kufi Sulus Diwani Riqa Naskh وَ سخرَالشَمسَ وَالقمرَ

Center for Research in Urdu Language Processing CRULP Research - Language ► Corpus Development ► Computational Linguistic Applications  Spell Checker  Grammar Checker  Lexicon  English to Urdu Machine Translation

Center for Research in Urdu Language Processing CRULP Research - Speech ► Text to Speech Synthesis ► Automatic Speech Recognition

Center for Research in Urdu Language Processing Projects ► Nafees Font Family ► Urdu Localization Project ► Microsoft Spell Checker ► PAN Localization

Center for Research in Urdu Language Processing PAN Localization Project

Center for Research in Urdu Language Processing PAN Localization Project Partnership  PAN program of IDRC  CRULP at NUCES Objectives  Develop localization technology for Asian languages  Develop human resource to develop and use localized computing  Research into policy framework to develop local language computing Timelines  January 2004 till December 2006

Center for Research in Urdu Language Processing PAN L10n Project Collaborations 1. BRAC University, Bangladesh 2. Department of IT, Ministry of Information and Communications, Bhutan 3. Khmer Computerization Committee, National ICT Development Agency, Cambodia 4. Science Technology and Environment Agency, Laos 5. Madan Puraskar Pustakalaya & Tribhuvan University Nepal 6. University of Colombo School of Computing, Sri Lanka 7. …

Center for Research in Urdu Language Processing Salient PAN L10n Project Outputs Localization Technology Asian Localization Peer Support Network Bibliography of Asian Localization Who’s Who of Asian Localization Multi-lingual Website: Asian Localization Handbook

Center for Research in Urdu Language Processing Country-wise Project Outputs

Center for Research in Urdu Language Processing Scope of Localization

Center for Research in Urdu Language Processing Localization “enabling computing experience according to linguistic culture of the user”

Center for Research in Urdu Language Processing Localization Requirements Standards Basic Applications Intermediate Applications Advanced Applications Soft Issues

Center for Research in Urdu Language Processing Standards Character Set Keyboard/Keypad layout Locale Collation Sequence Terminology Translation Fonts (?) …

Center for Research in Urdu Language Processing Basic Applications Character set encoding(s) Utility for converting among various encodings Keyboard/Keypad drivers Collation algorithm Local language interface Fonts for various devices …

Center for Research in Urdu Language Processing Intermediate Applications Find/Replace utility Natural language processor/Bidirectional processor Lexicon Spell checker …

Center for Research in Urdu Language Processing Advanced Applications Grammar checker Automatic speech recognition Text to speech system Automatic machine translation Optical character recognition Handwriting recognition Speech to speech translation …

Center for Research in Urdu Language Processing Introduction to Training Objectives  Overview scope of localization  Study in detail basic issues regarding localization standards and development  Develop Asian peer support network

Center for Research in Urdu Language Processing Summary of Topics Encoding Standards Font Development Localization on Microsoft Platform Localization on Linux Platform Defining Normalization and Collation Overview Advanced Applications Overview Software Engineering

Center for Research in Urdu Language Processing شکر یہ SANA GUL Regional Research Officer PAN Localization project (