Download presentation
Presentation is loading. Please wait.
Published byMervyn Henry Modified over 8 years ago
1
PRESENTED BY : TAPAN MAJHI (U07CO277) PRASHANT LOHIA (U07CO280)
2
A website is a collection of related web pages, images, videos or other digital assets that are addressed relative to a common Uniform Resource Locator (URL)web pagesimagesvideosUniform Resource Locator
3
A Multilingual Website is the one which is represented in more than one language.
4
Extremely effective marketing tool. Direct communication with the target audience in your own language Creates more awareness of the website.
5
Agreement regarding the meaning of words. Example: “Please put the trunk in the boot”
6
Agreement on terminology particularly for words of a technical nature and agreed actions regarding slang and abbreviation. Example: In English ‘No.’ and ‘#’ may refer to Number but not in any other languages
7
Phrases and meaning which cannot be translated. some words have no equivalent in other languages Example : in japanese “itadakimasu” (ee-tah-dak–kee-masu), an expression of gratitude. Which means “thanks for the meal”
8
Direction of text Example English is written horizontally from left to right Arabic is written horizontally from right to left Chinese is written vertically from flowing right to left
9
Formats of such things as date time and names Example In britain and australia date format is day/month/year In america its month/day/year Where as in chinese its year/month/day
10
Spelling convention Example Britain and australia spells ‘colour’ while in american it spelt ‘color’
11
Considering the web presentation of an organization, Three broad types of websites have been found 1. Single home sites 2. Multi-home sites 3. Separate sites
12
Small parts translated into another language. Translate only a limited amount of text into another language. Some company uses third party services also E.g Babel fish( altavista ) to automatically translate block or entire pages of text.
14
Located under a single domain name. With a splash page presenting a choice of language. Each language sub-sites has the same layout and design.
17
Some websites merely present a gateway to generic content, rather than a customized presentation
20
I18n is planning and implementing products and services so that they can easily be localized for specific languages and cultures
21
Creating illustrations for documents in which the text can easily be changed to another language and allowing expansion room for this purpose Allowing space in user interfaces (for example, hardware labels, help pages, and online menus) for translation into languages that require more space Creating print or web site graphic images so that their text labels can be translated inexpensively
22
Leaving enough space in a brochure to drop in different length languages Separating the language elements from the graphic elements, or abstracting content from markup in a web application and software Using written examples that have global meaning
23
Insuring that the tools and product can support international character sets For software, ensuring data space so that messages can be translated from languages with single-byte character codes (such as English) into languages requiring multiple- byte character codes (such as Japanese Kanji)
24
One Universal Code for every character no matter what the platform, no matter what the program, no matter what the language. Unicode is not just a bunch of code points Initially it was a 2 byte code, that can support over 65,000 characters Unicode Standard, Version 4.0 provides codes for 96,447 characters Adopted by ISO as ISO 10464
25
Character HTML char ref Unicode name What your browser displays U+0041A or A LatinLatin capital letter A A LatinAA U+00DFß or ß Latin small letter Sharp S Sharp SSharp Sß U+00FEþ or þ Latin small letter Thorn Thorn þ U+0394Δ or Δ GreekGreek capital letter Delta Delta GreekDeltaΔ U+017DŽ or Ž Latin capital letter Z with caron (used in Central Europe) Ž U+0419Й or Й CyrillicCyrillic capital letter Short I Short I CyrillicShort IЙ U+05E7ק or ק HebrewHebrew letter Qof Qof HebrewQofק U+0645م or م ArabicArabic letter Meem Meem ArabicMeemم U+0E57๗ or ๗ ThaiThai digit 7 digit7 Thaidigit7๗ U+1250ቐ or ቐ Ge'ezGe'ez syllable Qha Qha Ge'ezQhaU+3042あ or あ HiraganaHiragana letter A (Japanese) Hiraganaあ U+53F6叶 or 叶 CJK Unified IdeographCJK Unified Ideograph-53F6 (Simplified Chinese "Leaf") Simplified Chinese CJK Unified IdeographSimplified Chinese叶 U+8449葉 or 葉 CJK Unified IdeographCJK Unified Ideograph-8449 (Traditional Chinese "Leaf") Traditional Chinese CJK Unified IdeographTraditional Chinese葉 U+B5AB떫 or 떫 HangulHangul syllable Tteolp (Korean "Ssangtikeut Eo Rieulbieup") syllable Hangulsyllable떫 U+16A0ᚠ or ᚠ RunicRunic letter Fehu Fehu RunicFehu To display all of the characters above, you may need to install one or more large multilingual fonts, like Code2000. Code2000
26
Universality Efficiency Characters, not glyphs Semantics Plain text Logical order Unification Convertibility Accurate
27
European scripts Latin, Greek, Cyrillic, Armenian, Georgian, IPA Bidirectional (Middle Eastern) scripts Hebrew, Arabic, Syriac, Thaana Indic (Indian and Southeast Asian) scripts Devanagari, Bengali, Gurmukhi, Gujarati, Oriya, Tamil, zTelugu, Kannada, Malayalam, Sinhala, Thai, Lao, Khmer, Myanmar, Tibetan, Philippine East Asian scripts Chinese (Han) characters, Japanese (Hiragana and Katakana), Korean (Hangul), Yi Other modern scripts Mongolian, Ethiopic, Cherokee, Canadian Aboriginal Historical scripts Runic, Ogham, Old Italic, Gothic, Deseret Punctuation and symbols Numerals, math symbols, scientific symbols, arrows, blocks, geometric shapes, Braille, musical notation, etc.
28
A glyph is an element of writing. More precisely, a glyph is an individual mark on a written medium that contributes to the meaning of what is written. For example: ’ I ’ & ‘ i ’ glyphs in Turkish represent different meaning(where as in english it means same).
29
Its is a plain text Its is a formatted text
30
Unicode standard avoid duplicate encoding of characters within Scripts across languages. In Chinese, Japanese and Korean many ideographs are common. The character code U+0057 "Y" is same in English, German and French
31
The Unicode standard includes an extensive database that specifies a large number of character properties, including: Name Type (e.g., letter, digit, punctuation mark) Decomposition Case and case mappings (for cased letters) Numeric value (for digits and numerals) Combining class (for combining characters) Directionality Line-breaking behavior Cursive joining behavior For Chinese characters, mappings to various other standards and many other properties
32
Step-1 Get the languages of the page Every time a page is loaded the script will detect what languages are used on the page by recursively detecting all the elements within the DOM-tree that are equipped with a lang attribute and build a list of all the different lang values that are used on the page.
33
Step 2: Get information from cookie Each time the select box is set to a another value, this value will be stored in a cookie on the visitor's system.
34
If a cookie is available, the script will look whether this preferred language is available on the page. If so, the select box will be set to that language and all other languages will be filtered out of the DOM-tree.
35
Step 3: Sniffing the client's preferred language Browsers declare the user's language first and second the preferred local dialect using iso 639 codes of 2 or 3 digits each.iso 639 E.g.: eng-“English” & GU-"Gujarati"
36
Next the script will check if the browser language is available on the page. If it is, then the select box will be set to that language and all other languages will be filtered out of the DOM-tree.
37
Step 4: Get first language used on page. If the browser language is not available on the page, then the script will make a decision of it's own, based on the value of the very first lang attribute that's found on the page.
38
All the actual filtering takes place within the client. Advantages : 1. Live switching 2. Search engine friendliness 3. No need for complex server side technology.
39
The application of Unicode permits you to mix every possible character system in one page. Unicode Server side filtering can also be done.
40
For the preparation of this presentation help from the following sources was taken. http://www.cornae.org/ http://www.cornae.org/ http://adelaide.academia.edu/documents/00 77/9020/ECRA-Hillier-2003.pdf http://adelaide.academia.edu/documents/00 77/9020/ECRA-Hillier-2003.pdf http://www.wikipedia.org/ http://www.wikipedia.org/ http://www.google.com/ http://www.google.com/
41
Thank You
42
function translator(pattern) { Var dest= 'http://www.google.com/translate?u=http://sitepoint.com&langpair=' + pattern + '&hl=en&ie=UTF8'; { window.open(dest,'subwindow','toolbar=yes,lo cation=yes, directories=yes,status=yes,scrollbars=yes,menubar= yes,resizable=yes,left=0,top=0'); } }
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.