Unicode from a distance…

Slides:



Advertisements
Similar presentations
Web 2.0 Programming 1 © Tongji University, Computer Science and Technology. Web Web Programming Technology 2012.
Advertisements

Chungnam National University DataBase System Lab
Globalization Gotchas
Unicode Mark Davis Unicode Consortium President IBM Chief SW Globalization Architect
Draft Java/ICU Internationalization Architecture Mark Davis.
Unicode Mark Davis Unicode Consortium President IBM Chief SW Globalization Architect.
The World Wide Web and the Internet MIS XLM.B Jack G. Zheng May 13 th 2008.
1 st DSpace User Group Meeting March 10-11, 2004 William J Nixon, Project Manager (Service Development) DAEDALUS Putting Eprints Software into the User.
Japanese Records and Whether or not to Switch from MARC 8 to Unicode Storage (with an Innovative Interfaces Millennium local system) The University of.
The creation of "Yaolan.com" A Site for Pre-natal and Parenting Education in Chinese by James Caldwell DAE Interactive Marketing a Web Connection Company.
© 1998, Progress Software Corporation 1 Migration of a 4GL and Relational Database to Unicode Tex Texin International Product Manager.
LONE STAR TECHNOLOGIES, INC. Investor Presentation 2004.
Business Unit or Product Name © 2007 IBM Corporation Almaden Research Center IBM Service Science Research Services-- The Next Major Frontier for Research.
1 Character Conversions and Mapping Tables Presented By: Markus Scherer George Rhoten Raghuram (Ram) Viswanadha.
Impact of OASIS UIMA Standard on Apache UIMA OASIS Unstructured Information Management Architecture (UIMA) TC
Reliable Interoperation between Open Office & MS office by UOML Alex Wang Chair/OASIS UOML TC Chairman / Sursen Co.
EURES: The European Employment Service. Working abroad An opportunity to: - meet new people - improve language and other skills - experience new cultures.
Equinix Internet Exchange Paris RIPE 62, 6 May, 2011.
QDV 7 Overview A powerful estimating tool designed to match up with your own specific methodologies.
Publishing Workflow for InDesign Import/Export of XML
Web development  World Wide Web (web) is the Internet system for hypertext linking.  A hypertext document (web page) is an online document. It contains.
Input Validation For Free Text Fields ADD Project Members: Hagar Offer & Ran Mor Academic Advisor: Dr Gera Weiss Technical Advisors: Raffi Lipkin & Nadav.
ASP.NET Programming with C# and SQL Server First Edition
© 2013 IBM Corporation Information Management Discovering the Value of IBM InfoSphere Information Analyzer IBM Software Group 1Discovering the Value of.
Chapter 11 ASP.NET JavaScript, Third Edition. 2 Objectives Learn about client/server architecture Study server-side scripting Create ASP.NET applications.
System Integration (Cont.) Week 7 – Lecture 2. Approaches Information transfer –Interface –Database replication –Data federation Business process integration.
Times Zones.
ASCII and Unicode. ASCII Inside a computer, EVERYTHING is a number – that includes music, sound, and text. In the early days of computers, every manufacturer.
E3/38, L.G.F, Sector – 7, Rohini, Delhi – Contact: +91 – , ,
UNDERSTANDING WEB AND WEB PROJECT PLANNING AND DESIGNING AND EFFECTIVE WEBSITE Garni Dadaian.
Sophia Antipolis, September 2006 Multilinguality, localization and internationalization Miruna Bădescu Finsiel Romania.
San José, CA – September, 2004 Localizing with XLIFF and ICU Markus Scherer Raghuram (Ram) Viswanadha IBM San.
INTRODUCTION TO WEB DATABASE PROGRAMMING
M. Taimoor Khan * Java Server Pages (JSP) is a server-side programming technology that enables the creation of dynamic,
Chapter 16 The World Wide Web Chapter Goals Compare and contrast the Internet and the World Wide Web Describe general Web processing Describe several.
16-1 The World Wide Web The Web An infrastructure of distributed information combined with software that uses networks as a vehicle to exchange that information.
XML The Overview. Three Key Questions What is XML? What Problems does it solve? Where and how is it used?
File Formats Chapter 9 Bit Literacy. File formats are often ignored by users Applications automatically save files in the application’s format All formats.
Dr Lisa Wise 9/08/2002 The Evolution and Devolution of the Webmaster Role Dr Lisa Wise.
Address : - E3/38, L.G.F, Sector – 7, Rohini, Delhi – Manoj Rana | Sr. Marketing Manager Mobile No : – Office : ,
Computer Structure & Architecture 7c - Data Representation.
IBM Globalization Center of Competency © 2006 IBM Corporation IUC 29, Burlingame, CAMarch 2006 Automatic Character Set Recognition Eric Mader, IBM Andy.
COMPUTER PROGRAMMER GABE HARRIS. WHAT THEY DO They write programs in a variety of computer languages, such as C++ and Java They update and expand existing.
Prepared by: Ng, Rica I. III-6 Beed Time Zone of the World.
21 st International Unicode Conference Dublin, Ireland, May Folded Trie: Efficient Data Structure for All of Unicode Vladimir Weinstein
University Of Palestine. Department of Information Technology.
Chapter 18 IP: Internet Protocol Addresses
Implementation Issues Mark Davis Properties.
Latitude, Longitude, and Maps.  Cartography is the science of making maps.  Use of imaginary grid of parallel and vertical lines to give exact locations.
Word of the Day Nautical Mile: One minute of arc length of latitude or longitude at the equator, or 1852m.
IP1 The Underlying Technologies. What is inside the Internet? Or What are the key underlying technologies that make it work so successfully? –Packet Switching.
Understanding Character Encodings Basics of Character Encodings that all Programmers should Know. Pritam Barhate, Cofounder and CTO Mobisoft Infotech.
Markus W. Scherer IBM Cupertino August 20 th, 2001Globalizing eBusiness – SDForum Unicode and XML Globalizing eBusiness Tools of the Trade: Unicode and.
1 Chapter 1 INTRODUCTION TO WEB. 2 Objectives In this chapter, you will: Become familiar with the architecture of the World Wide Web Learn about communication.
E-Business Infrastructure PRESENTED BY IKA NOVITA DEWI, MCS.
Introduction to the Internet
Great deals. Easy distribution.
INTERNATIONALIZATION
Workshop on XML-Based Library Applications 5
Dynamic Web Pages (Flash, JavaScript)
MSIS 655 Advanced Business Applications Programming
Unicode from a distance…
Chapter 3 Hardware and software 1.
PHP.
Chapter 3 Hardware and software 1.
What is HTML?.
Introduction to VSTS Database Professional
QoS Metadata Status 106th OGC Technical Committee Orléans, France
Presentation transcript:

Unicode from a distance… Mark Davis Chief Software Globalization Architect, IBM President, Unicode Consortium

Starting back a bit before Unicode…

1850: Where? When? Longitude non-standard Paris meridian Greenwich meridian Berlin meridian Time non-standard 7:16 Boston 6:52 DC 4:06 LA 3:51 SF That had to change…

That had to change… Telegraph → exact longitudes Railway → timezones Shipping → Prime Meridian Washington, 1884 France delays until 1914…

Uniformity Winning Of course, the French gave us all the metric system Portuguese mile Roman mile Hamburg mile US mile But we didn’t get metric time Still Babylonian… Why one and not the other?

Fast forward a few years

1985: Characters not Standardized – Data Exchange Limited ✗ Vladimir Jelicačačić Игорь Лукашев 徐順宏 ก๊กเฮงแซ่แต้ Bjørn Vestergård

That had to change…

No longer data “islands” Customers could be from any country Companies have heterogeneous systems People can’t tolerate it when text is lost or corrupted in transmission, or when lookups fail English / European languages only part of the world market…

GDP-PPP – 1975..2002

GDP-PPP– 2003..2010

Silicon Valley, 1991 - Unicode Vladimir Jelicačačić Игорь Лукашев 徐順宏 ก๊กเฮงแซ่แต้ Bjørn Vestergård The Unicode Standard provides: a unique code for every character in the world a model and architecture for every script properties and behavior, isolating programmers from details.

2004 – Unicode, the “Prime Meridian” of computing 96,000+ Characters (V4.0) Wide-ranging specifications for uniform cross-product behavior Used in every major operating system in all major office software as the core definition of text in XML, HTML, … as the core of Java, C#, C (with ICU), …

Website Globalization Websites present both static and composed data, the latter frequently backed by one or more databases Unicode makes the entire architecture vastly simpler from back-end databases to pages served to client People used to convert to legacy sets on output but less needed now, except special circumstances

Unicode Consortium Development of Key SW Globalization Standards Unicode Standard Other Specs: Sorting, Int’l Regular Expressions, Matching (case-insensitive), Line-breaking, Identifiers,… New Projects: Common Locale Data Repository Uniform date/time/number formatting, sorting,… across programs/platforms Open to new Members: Corporate, Associate, Specialist http://www.unicode.org/consortium/why_join.html

References ICU Longitude The Unicode Standard UTN #13: GDP by Language Einstein’s Clocks, Poincaré’s Maps More about Unicode: March 31 - April 2!