A Multilingual Internet for South Asia

Slides:



Advertisements
Similar presentations
REGIONAL NEWS CHANNELS
Advertisements

By : Swaran Lata Country Manager,W3C India Office 6,CGO complex, Electronics Niketan New Delhi
Text #ICANN50. Text #ICANN50 IDN Variant TLD Program GNSO Update Saturday 21 June 2014.
Books and Bibliographic Information In and Out of South Asia Ramesh K. Mittal Director D.K. Agencies (P) Ltd. International Booksellers, Publishers & Subscription.
Ideal Lingua Translations Ideal Lingua Translations is a leading Translation Services Provider which offers:  Highest Quality Language Solutions 
CALTS, UNIV. OF HYDERABAD. SAP, LANGUAGE TECHNOLOGY CALTS has been in NLP for over a decade. It has participated in the following major projects: 1. NLP-TTP,
Knowledge Sharing Platform Empowering Communities through regional Content and Services C. Kathiresan C-DAC, Hyderabad, India Session V : e-Content & ICT.
Media: Text “Words and symbols in any form, spoken or written, are the most common system of communication.” ~ unknown.
Languages of Asia Part 2: South Asia
Language, Ethnicity, and Disparities in Contemporary India
1 Updated as of 1 July 2014 Issues of the day at ICANN Internationalized Domain Names (IDNs) KISA-ICANN Language Localisation Project Module 2.3.
IDN & EAI in India: Universal Acceptance of IDN Important Note: This document was prepared for a private audience. Copyrights for images included.
Unicode & W3C Jataayu Software C. Kumar January 2007.
Internationalized Domain Names (IDNs) Yale A2K2 Conference New Haven, USA April 27, 2007 Ram Mohan Building a Sustainable Framework.
CcTLD IDN TF Report ccTLD Meeting, Rio de Janero Mar. 25, 2003 Young-Eum Chair, ccTLD IDN TF.
1 CS 502: Computing Methods for Digital Libraries Lecture 4 Text.
IFRE: Delhi, India Nandini Raghuraman MS-2. India  Largest democracy in the world  Population: 1,147,995,898 (July, 2008)  Life expectancy at birth:
Modular InfoTech’s Modular Infotech is proud to offer Tools and Components enabled with Indian language so as to address each & every client located across.
India Jai Hind!. Cuisine Places Culture Languages Dresses Traditions.
Building digital libraries in Indian languages: case studies with Hindi and Kannada B.S. Shivaram Trainee ( ) National Center for Science Information.
Let ’ s GeoParty! Stump Trivia Edition India: Test #3.
Internationalized Domain Names Dr. Cary Karp MUSENIC Project Manager Second MUSENIC Project Workshop Stockholm, March 2004 MUSENIC – The Museum Network.
21st September 2004localisation and the digital divide1 and the Development and the Information Society Economic divides Language divides Cultural divides.
National Internet Exchange of India.IN Registry Operation Update.
Issues in IDN APTLD Meeting in Taipai Feb. 24, 2003 Young-Eum Lee.
India. Homework. Due next lesson. Complete your presentation. If you choose to do a PowerPoint it should be e mailed to your Geography teacher by 08:00.
UNICODE & Indic Scripts
Legal Informatics & E-Governance as tools for the Knowledge Society LEFIS Seminar, Reykjavik (Iceland), July 12-13, 2007 Oleksandr Pastukhov MPhil (Koretsky.
An ISO 9001:2008 Company With all the tools you need to compute in Indian Languages.
© 2015 albert-learning.com Indian languages Indian Languages.
Interesting Facts about. In India there are 17 major languages and 844 other spoken language. Some of them are Telugu, Hindi, Tamil, Gujarati, Malayalam.
ALR 2013 Some observations Pushpak Bhattacharyya, ALR Chair.
Securing Future Growth: Getting Ready for IPv6 NOW! ccTLD Workshop, 8 th April 2011 Noumea, New Caledonia Miwa Fujii, Senior IPv6 Program Specialist, APNIC.
ICANN Regional Outreach Meeting, Dubai 1–3 April Toward a Global Internet Paul Twomey President and CEO 1 April 2008 ICANN Regional Meeting 1–3.
Mauryan Dynasty Drove out foreigners made capital in Ganges Valley
Bal Mukunda Bhandari Tribhuvan University Kathmandu, Nepal.
India “We can do no great things; only small things with great love.” -Mother Teresa of Calcutta.
Topic: Classification of World Languages
INDIA - Where is it? - It is in Asia.. What is the population? What is the population? ► -There are people.
INDIA’S APPEARANCE TILL 59 th INDEPEDANCE DAY 1906.
Crescendo Transcriptions Pvt. Ltd. Translation Manual.
The next 10 years of web globalization John Yunker Byte Level Research.
Introduction to the Internet
SSLTC EDUCATIONAL ACADEMY PRIVATE LIMITED
Chapter 5: Languages.
Kit Purpose To raise awareness about multilingual education by providing insights into the value of Mother Tongue-Based Multilingual Education which respects.
Language-in-education policies in Southeast Asia
Director of Data Communications
Instructor Materials Chapter 5 Providing Network Services
Language-in-education policies in Southeast Asia
MULTILINGUALISM ON THE INTERNET
We Translate… You Market!!
Presented by Nalini Sankar
IDN Variant TLDs Program Update
Multi-stakeholder Governance
Testing Challenges in Indic Languages
Text.
Mauryan Dynasty Ashoka ( B.C.)
Global History and Geography / 10th Grade Boys and Girls Br. Siraj
Unit 1.4 Wired and Wireless Networks Lesson 3
Evolution, Need, Role and Relevance
India Geography and Languages
Lets Look at UASG / IDN / EAI
DECEMBER, 18th, 1973 UN Arabic Language Day is observed annually on December 18. The event was established by the UN  (UNESCO) in 2010 seeking "to celebrate multilingualism.
India Geography and Languages
India Geography and Languages
Indian Languages Market: The Complex Script
Internet and the world wide web (www)
Sarmad Hussain Internationalized Domain Names (IDN) Programs Director
Satellite Alert Channel
Presentation transcript:

A Multilingual Internet for South Asia 24 May, 2017 The Radisson Hotel, Kathmandu The Internet Corporation for Assigned Names and Numbers  An ICANN Event A Multilingual Internet for South Asia Udaya Narayana Singh E-mail: unsciil@yahoo.com Chair-Professor, Amity Centre for Linguistic Studies (ACLiS), Amity University Haryana Pachgaon, Gurgaon (India) Neo-Brahmi Panel

The Internet Revolution Managing the World of Web We’re aware how ICANN makes our life simpler on the web where machines are known by a set of numbers? From the World Wide Web in 1989, there have been many milestones – DARPA in 1969 to Electronic Mails ‘71 to the Protocols (TCP/IP) in ‘74, and launch of Netscape & Yahoo in 1994? IE (’95) & Hotmail (‘96), & Google (‘98) changed the game. The Internet Revolution Managing the World of Web Community efforts like Wikipedia (2001) and Social networking sites Facebook (2004) & Twitter (2006) have changed our daily lives. We are here to discuss how ICANN, in its 10th year (2018) could emerge as managing a truly multilingual and multi-scriptal web which will mean even larger traffic flow.

Next Big Net Revolution What will the next big revolution? With 277 million Internet users or 26% population in India and 17.7% in Nepal, Devanagari-based domain names will open a multilingual gateway ensuring a mass participation of users on the web. We are receiving information on how other languages/scripts are advancing in a big way – from Arabic to Chinese. Ideally, South Asia must not lag behind. These web-based services based on our writing systems will alter the ‘Big Picture’ in education, healthcare, finance and services sectors. Since most of our scripts have emerged from Brāhmī, the Generation Panel is rightly called the Neo- Brāhmī Panel. A 4th-5th Century Pallava Inscription from Bujong Valley, Malaysia – showing how Brāhmī had spread far and wide

SPREAD OF BRAHMI IN ASIA

Brāhmī had an Early Divergence Source: Peter T. Daniels & William Bright (1996) The World's Writing Systems, OUP.

Cf. Mahesh Kulkarni’s 2008-Report on ‘Development of Open Type Fonts’ C-DAC Complications Each one of writing system has numerous difficulties because of the way their syllabic structures are captured by the graphics – protruding below, above, and on the aisles as well as through hanging letters. Script Grammar for Indic systems must capture their aesthetics. Only Sindhi uses system – Naskh (linear-seamless) but Kashmiri & Urdu use Nastaliq (Right-to-left plus Top-to-bottom): Nastaaliq Naskh

The big picture of languages is complicated in India

PLOTTING THE 123-ODD LANGUAGES OF NEPAL

The Big Picture Four Major Language Families & a few minor ones India has 22 Constitutional languages, 29 spoken by million plus (E.g Bhili, Gondi, Khasi, Tulu, Mundari, Ho etc), 24 Akademi awards 35 of them publish 3954 newspapers as early as in 1971 122 big languages (10k+ speakers) & 146 in radio networks. Nepal too has many large languages with Nepali as official lg The Big Picture ROM SAN MAN Four Major Language Families & a few minor ones 14 major writing systems in use but 66 scripts in all. 97.7% speak only 20-odd Indo-Aryan & Dravidian languages 2.32% of Indians speak abt 97% of other languages 1576 rationalized mother-tongues & 1796 not yet classified

Multi-scriptality - a reality! One Script for Many Languages Devanagari is used by official languages : Hindi, Marathi, Maithili, Konkani, Nepali, Sanskrit, Sindhi, Bodo, Dogri and Santhali. Many Sindhi & Kashmiri also use Devanagari. The Eastern Brahmi Base (close to Bengali) is used by Assamese, Bangla, & Manipuri. Modi was used primarily by Marathi & Gujarati until 19th century and also by other language speakers. Many Scripts for One Language In contrast, there could also be languages like Konkani written in Roman, Devanagari, Kannada, and Malayalam scripts. Earlier Santali written in Roman, Ol Chiki, Oriya and Bangla. Or, the Bodo used Roman, Devanagari, & Assamese. Modi Script sample

Scripts that Neo-Brahmi Generation Panel is working upon ISO 15924 Code Language ISO 639-3 Code Bengali Beng Assamese – অসমীয়া Asm Bengali – বাংলা Ben Manipuri – মনিপুরি Mni Devanāgarī Deva Bodo – बड़ो Brx Dogri – डोगरी Dgo Hindi – हिन्दी Hin Kashmiri – कॉशुर, Kạ̄šur, Koshur Kas Konkani – कोंकणी, Knn Maithili – मैथिली, মৈথিলী Mai Marathi – मराठी Mar Nepali – नेपाली Npi Sanskrit – संस्कृतम्, संस्कृतावाक् San Santali/Santhali – संथाली Sat Sindhi – सिंधी Snd Gujarati Gujr Gujarati – ગુજરાતી Guj Gurumukhi Guru Punjabi – ਪੰਜਾਬੀ Pan Kannada Knda Kannada – ಕನ್ನಡ Kan Malayalam Mlym Malayalam – മലയാളം Mal Oriya(Odia) Orya Odia – ଓଡ଼ିଆ Ory Tamil Taml Tamil – தமிழ், Tamizh Tam Telugu Telu Telugu – తెలుగు Tel Scripts that Neo-Brahmi Generation Panel is working upon

Internationalized Domain Names (IDN) An IDN could have letters with diacritics, as in German/French. One could choose characters from the Unicode repertoire for registration of IDNs. We are a part of the six ICANN groups working for Arabic, Chinese, Cyrillic, Greek, Latin & Devanagari - and we need to take decisions that are of long-term implications. Since DNS understands only Latin characters, hyphens and digits, the IDN is converted to Punycode (sequence of Latin characters, digits and hyphen) before the DNS processing. Let us therefore list out our challenges and concerns here in this meeting. Problems come because the visual variations possible are enormous. Even in Latin, a phished Website – paypa1.com could keep popping up for paypal.com This will be like www.मुद्रा.भारत being phished as www. मुद्गा.भारत So we will have to take all precautions.

Challenges Due To Variations & Complications – e.g. অংক vs অঙ্ক vs অঙক The .भारत Initiative The Government of India had already taken a step forward by acquiring an ccTLD IDN .bharat to make it part of government’s Digital India project. Dr Data will speak more about it An ideal IDN policy would spell out how conveniently an IDN second level name in local languages could be registered and maintained in a hassle free manner – hence the involvement of authorities such as NIXI. That is why the Guiding Principles of ICANN that Dr Sarmad has mentioned There are still many challenges to meet; e.g. Challenges Due To Variations & Complications – e.g. অংক vs অঙ্ক vs অঙক এত [æt̪o] "so much", এ্যাকাডেমী [ækademi] "academy", অ্যামিবা [æmiba] "amoeba" The hardware-software requirements, security issues, 100% uptime reliable services, maintenance of interoperable technical standards, and making it scalable and market-friendly

The Future of Multilingual Web in South Asia Hindi (422m), Bengali (83m), Telugu (75m), Marathi (71m), Tamil (60m), and Nepali (12 m) etc being the most spoken tongues in South Asia, and proficiency in English being negligible, a Web that is only English-based will always be a remote dream. The elites may prefer English, but if more than half of our so many million netizens cannot read the info on the medicines on the web, or cannot see the warnings on the net, or can’t understand government documents, or bank transaction menus, these services would not be reaching them – leaving a large gap. The IDN-enablement will lead to an explosion of Indigeneous languages content. Our research, our creative texts, and our products would then be original and not derivative.  

There are obviously many more issues that need to be resolved when all Brāhmī-based Scripts are to be used for Internationalized Domain Name Registration. Thank you