Essential Skills for Computing Fonts

Slides:



Advertisements
Similar presentations
Tafseer Ahmed Department of Computer Science University of Karachi Urdu on Linux International Support.
Advertisements

Lecture 2 1 Encoding Schemes Encoding methods: a method of encoding at binary level to ensure identification and the use of a mixture of different character.
Unicode and the Web Nathan Schneider. Special Text In our interactions with computers, it is often desirable to use characters other than the standard.
Properties of Text CS336 Lecture 3:. 2 Information Retrieval Searching unstructured documents Typically text –Newspaper articles –Web pages Other documents.
Media: Text “Words and symbols in any form, spoken or written, are the most common system of communication.” ~ unknown.
Data Representation Kieran Mathieson. Outline Digital constraints Data types Integer Real Character Boolean Memory address.
1 HTML’s Transition to XHTML. 2 XHTML is the next evolution of HTML Extensible HTML eXtensible based on XML (extensible markup language) XML like HTML.
Review1 What is multilingual computing? Bilingual, trilingual, vs. Multilingual What are the fundamental issues in multi-lingual computing? –Representation.
Unicode, character sets, and a a little history. Historical Perspective First came EBCIDIC (6 Bits?) Then in the early 1960s came ASCII – Most computers.
Do Now: Take out your notebook and a pen. Good morning! Do Now: Take out your notebook and a pen. Good morning! Aim: What is a Font Family? Mr. Spaterella.
CHARACTERS Data Representation. Using binary to represent characters Computers can only process binary numbers (1’s and 0’s) so a system was developed.
Font Families By: Jonathan Bird. What is a font? First, what is a font? A font is a grouping of the same type and style of characters, often in the same.
Introduction to Computing Using Python Chapter 6  Encoding of String Characters  Randomness and Random Sampling.
ECA 228 Internet/Intranet Design I Meta Tags & Directories.
Introduction to Human Language Technologies Tomaž Erjavec Karl-Franzens-Universität Graz Tomaž Erjavec Lecture: Character sets
Unicode & W3C Jataayu Software C. Kumar January 2007.
Creating Interfaces: Localization Language & other issues character codes Homework: preparation for future topics.
Encoding and fonts Edward Garrett Software Developer, ELAR.
1 herbert van de sompel CS 502 Computing Methods for Digital Libraries Cornell University – Computer Science Herbert Van de Sompel
Digital Multimedia, 2nd edition Nigel Chapman & Jenny Chapman Chapter 10 This presentation © 2004, MacAvon Media Productions Characters & Fonts.
Week 4 Number Systems.
A Field Linguist’s Guide to Unicode Deborah Anderson Script Encoding Initiative (Universal Scripts Project) Dept. of Lings., UC-Berkeley LSA Panel: A Field.
1 CS 502: Computing Methods for Digital Libraries Lecture 4 Text.
B.Sc. Multimedia ComputingMedia Technologies Character Representation & Font Technology.
Globalisation & Computer systems Week 4 writing systems and their implications for globalisation character representation ASCII extended ASCII code pages.
Building digital libraries in Indian languages: case studies with Hindi and Kannada B.S. Shivaram Trainee ( ) National Center for Science Information.
Using the Unicode Standard for Linguistic Data: Preliminary Guidelines Deborah Anderson Researcher Dept. of Linguistics, UC Berkeley.
INFOCODING BASICS & EXAMPLES OF CURRENT USE Introduction to Computer Science Using Ruby (c) 2010 Gideon Frieder.
Text and Graphics September 26, Unit 3.
Character Encoding, F onts. Overview Why do character encoding and fonts matter to linguists? How can you identify problems? Why do these problems arise?
Globalisation & Computer systems Week 5/6 Character representation ACII and code pages UNICODE.
Complex Scripts* in Internet Explorer 5.0 *and Multilingual text F. Avery Bishop Senior Program Manager Microsoft Corporation.
Anlab ( ) Kim, Yangjung Characters & Fonts.
Data Files on Computers Text Files (ASCII) Files that can be created by typing on the keyboard while using a text editor such as notepad or TextEdit.
Week 7 Lecture 2 Globalization Support in the Database.
Representation of Characters
CIT3611 Software i18n Wk 4: Code sets, Online Help, Prototyping David Tuffley School of Computing & IT Griffith University.
+ 1 XML eXtensible Markup Language. + 2 XML Lecture Adapted from the work of Dr. Praveen Madiraju of Marquette University.
Understanding Character Encodings Basics of Character Encodings that all Programmers should Know. Pritam Barhate, Cofounder and CTO Mobisoft Infotech.
© 2001, Penn State University Encoding on the Internet Elizabeth J. Pyatt CETS.
DATA REPRESENTATION 4 Y. Colette Lemard February 2009.
1 Non-Numeric Data Representation V1.0 (22/10/2005)
Nat 4/5 Computing Science Data Representation Lesson 3: Storing Text
DATA REPRESENTATION - TEXT
Fonts.
Getting Started With HTML
Introduction to HTML.
HTML5 Basics.
Lesson Objectives Aims You should be able to:
Characters & Fonts Digital Multimedia, 2nd edition
XHTML Basics.
HTML/XML HTML Authoring.
XHTML Basics.
XHTML Basics.
Fonts.
Fluency with Information Technology
Exercise 8 – Software skills
Characters & Fonts Digital Multimedia, 2nd edition
COMS 161 Introduction to Computing
Unit 7 The Web Book Test.
Tutorial 1.3 Using Element Attributes
XHTML Basics.
Assist. Lecturer Safeen H. Rasool Collage of Science Department of IT
Lesson 5 – Controlling fonts with CSS
INFOCODING BASICS & EXAMPLES OF CURRENT USE
LING 388: Computers and Language
XHTML Basics.
Go over midterm. CSS catchup. Work session Homework: complete project
Summer 31: Web Authoring A trainee has started to create a single stylesheet to be used with the website. The stylesheet was not finished and contains.
Getting Started With Web Page Creation Using HTML
Presentation transcript:

Essential Skills for Computing Fonts André Schappo A.Schappo@lboro.ac.uk

To gain some understanding: Aims To gain some understanding: Fonts The extensive range of characters available Unicode At the end of this lecture students should be able to: Create a simple multi-language web page Essential Skills for Computing

What is this ? 好 Essential Skills for Computing

Terminology depends on context/discipline Could be described as: picture drawing chinese japanese script letter character glyph symbol Terminology depends on context/discipline Essential Skills for Computing

Computer Font A (Computer) Font: is a collection/grouping of characters (Character Set) its characters are (often) related each character has an associated glyph Essential Skills for Computing

A (Computer) Font character: is more a concept than an actuality a character has a consensus agreed label/function/meaning a character can be in many fonts “Exclamation Mark” is a character “Exclamation Mark” is a punctuation character Essential Skills for Computing

Glyphs Glyphs: a glyph is the visual representation/ manifestation of a character the “!” is the glyph for the “Exclamation Mark” character The glyphs of a font have a style (as designed by Font Author) Additional transformation (styling) can be applied by using applications (eg GIMP) Essential Skills for Computing

Otherwise it is “Look and See” Font Naming The character set and/or style is (sometimes) indicated by the font name eg Apple Braille Apple Symbol Hiragino Kaku Gothic Pro (ヒラギノ角ゴ) Edwardian Script ITC Otherwise it is “Look and See” ⾒ Use “Font Book” in Mac OSX to look at these fonts ⾒ Essential Skills for Computing

One Character, Many Fonts The character 好 is in fonts GungSeo Apple LiGothic Hiragino Kaku Osaka Arial Unicode MS .... and more ..... ⾒ Use “Character Palette” in MacOSX to see which fonts contain a character and to see the different styles. ⾒ Essential Skills for Computing

There are many fonts Problem each with its own character set and glyphs What coverage of characters does a computer have? It will certainly not be complete Problem Automatic Font Substitution ie which font to choose for a character Essential Skills for Computing

A font implementation of a “Universal Character Set” The Ideal A “Universal Character Set” that encompasses all characters, including: all natural languages all symbols Braille International Phonetics Alphabet (IPA) ...etc... A font implementation of a “Universal Character Set” Essential Skills for Computing

Available Character Sets There are many Character Sets, each with different character coverage ⾒ ISO-8859-1 ⾒ (Western European) ⾒ ISO-2022-JP ⾒ (Japanese) ⾒ GB2312 ⾒ (Simplified Chinese) ...etc... There are mappings between each Character Set and the font characters Essential Skills for Computing

Unicode is a character set that has extensive coverage, including: Basic Latin Hebrew Tibetan Mathematical Operators Geometric Shapes Hiragana Unihan (hanzi, hanja, kanji) Musical Symbols Braille and much much more Essential Skills for Computing

⾒ http://en.wikipedia.org/wiki/Unicode ⾒ ⾒ http://www.unicode.org/ ⾒ Unicode Info ⾒ Use “Character Palette” in MacOSX to see the Unicode character coverage ⾒ ⾒ http://en.wikipedia.org/wiki/Unicode ⾒ ⾒ http://www.unicode.org/ ⾒ Essential Skills for Computing

Currently there is NO font that contains all the Unicode characters. Unicode Fonts Currently there is NO font that contains all the Unicode characters. Each font contains a subset of Unicode There are two fonts (that I know of) that contain a major part of Unicode. Arial Unicode MS ⾒ http://en.wikipedia.org/wiki/Arial_Unicode_MS ⾒ Code2000 ⾒ http://code2000.net/ ⾒ Essential Skills for Computing

The Unicode Codepoint for 好 is U+597D Unicode Encodings The Unicode Codepoint for 好 is U+597D Different systems use different encodings for 好. Encodings include Unicode UTF-8: E5A5BD (Hexadecimal) Unicode UTF-16: 597D (Hexadecimal) HTML entity: &#22909 ⾒ Use UnicodeChecker to see character encodings ⾒ Essential Skills for Computing

This means that, such an application/system is multi-language capable UTF-8 A UTF-8 capable application/system can display/manipulate/transmit any character from the Unicode character set. This means that, such an application/system is multi-language capable Unicode UTF-8 is becoming a standard. Essential Skills for Computing

Email Systems generally allow for choice of Character Set when sending LU web mail is such a system Generally one should set the default as UTF-8 What happens when an incorrect Character Set is chosen? ⾒ I will demonstrate ⾒ Essential Skills for Computing

Web Pages - Character Sets ⾒ http://lboro.ac.uk/ ⾒ uses ISO-8859-1 ⾒ http://リコー.jp/ ⾒ uses ISO-2022-JP ⾒ UTF-8 Test Page ⾒ uses UTF-8 1 & 2 have constrained character sets 3 can use the full Unicode character set Use Unicode UTF-8 Essential Skills for Computing

Web Safe Fonts ⾒ web safe fonts ⾒ Use fonts that the client machine is likely to have Use Generic Font Families, such as: sans serif cursive serif Arial Unicode MS is a good candidate for a “Web Safe Font” ⾒ web safe fonts ⾒ Essential Skills for Computing

Glyph Safe Machine Glyph Safe Machine Ensure a comprehensive set of fonts that gives complete character coverage Then you will no longer see ▯▯▯▯▯▯ or ��� in your browser/email/word processor ...etc... Essential Skills for Computing

Multi-language/symbol Webpages Setting up a Unicode Webpage Save page as Unicode UTF-8 encoding In the header of your html page include the statement: <meta http-equiv="Content-Type" content="text/html; charset=utf-8"> You will now be able to put any Unicode Characters into your html page Whether or not they display correctly is dependent on the client machine. Essential Skills for Computing

The End There are font exercises for you to complete. These exercises will address the issues of Character Sets Client Machine Setup Essential Skills for Computing