LING 001 Introduction to Linguistics Spring 2010 Chinese writing system Reading Mar. 31 Writing systems II.

Slides:



Advertisements
Similar presentations
Review of HTML Ch. 1.
Advertisements

EDRD 6600 Trudie Hughes, Ph.D..
Chapter 2 phonology. The phonic medium of language Speech is more basic than writing. Reasons? Linguists studies the speech sounds.
Evaluating the Effect of Neighborhood Size on Chinese Word Naming and Lexical Decision Meng-Feng Li 1, Jei-Tun WU 1*, Wei-Chun Lin 1 and Fu-Ling Yang 1.
Digital Fundamentals Floyd Chapter 2 Tenth Edition
Learning to Read a Non- alphabetic Script - Chinese Or: “I have to learn how many characters?”
Chapter 8_2 Bits and the "Why" of Bytes: Representing Information Digitally.
Introduction to Computers and Programming. Some definitions Algorithm: Algorithm: A procedure for solving a problem A procedure for solving a problem.
Data Representation (in computer system) Computer Fundamental CIM2460 Bavy LI.
1 12/08/03SW Abingdon and Witney College Binary Converting to and from decimal.
Linguistic Phonics Co-ordinator Support Pack Linguistic Phonics.
CCE-EDUSAT SESSION FOR COMPUTER FUNDAMENTALS Date: Session III Topic: Number Systems Faculty: Anita Kanavalli Department of CSE M S Ramaiah.
COMPUTER FUNDAMENTALS David Samuel Bhatti
CHARACTERS Data Representation. Using binary to represent characters Computers can only process binary numbers (1’s and 0’s) so a system was developed.
Liu, Perfetti, & Wang (2006) as summarized by Scott Hajek.
Dale & Lewis Chapter 3 Data Representation
Introduction to Human Language Technologies Tomaž Erjavec Karl-Franzens-Universität Graz Tomaž Erjavec Lecture: Character sets
CCE-EDUSAT SESSION FOR COMPUTER FUNDAMENTALS Faculty: Anita Kanavalli Department of CSE M S Ramaiah Institute of Technology Bangalore E mail-
© 2009 Pearson Education, Upper Saddle River, NJ All Rights ReservedFloyd, Digital Fundamentals, 10 th ed Digital Fundamentals Tenth Edition Floyd.
Chapter 3 Representing Numbers and Text in Binary Information Technology in Theory By Pelin Aksoy and Laura DeNardis.
Representing text Each of different symbol on the text (alphabet letter) is assigned a unique bit patterns the text is then representing as.
Week 4 Number Systems.
Computer System Basics 1 Number Systems & Text Representation Computer Forensics BACS 371.
CMPT 120 How computers run programs Summer 2012 Instructor: Hassan Khosravi.
CSC 101 Introduction to Computing Lecture 9 Dr. Iftikhar Azim Niaz 1.
Data Representation S2. This unit covers how the computer represents- Numbers Text Graphics Control.
Chapter 2 Computer Hardware
INFOCODING BASICS & EXAMPLES OF CURRENT USE Introduction to Computer Science Using Ruby (c) 2010 Gideon Frieder.
Binary Code.
Character Encoding, F onts. Overview Why do character encoding and fonts matter to linguists? How can you identify problems? Why do these problems arise?
1 The Implicit and Explicit Learning of Orthographic Structure and Function of a New Writing System 指導教授: Chen Ming-Puu 報 告 者 : Chen Hsiu-Ju 報告日期:
1 3 Computing System Fundamentals 3.5 Data Representation.
Computer System Basics 1 Number Systems & Text Representation Computer Forensics BACS 371.
Globalisation & Computer systems Week 5/6 Character representation ACII and code pages UNICODE.
Mental Organs. Phrenology was an important part of popular culture in Victorian England and in Europe during the 19th century.
Representing Characters in a computer Pressing a key on the computer a code is generated that the computer can convert into a symbol for displaying or.
Big Ideas in Reading: Phonemic Awareness
Dyslexia What is it all about???. Where is the problem? The deficit lies in the language system, NOT in the visual system -- NOT an overall language problem…
1 Wilson Reading System “What is Intervention”. 2 The Gift of Learning to Read When we teach a child to read we change her life’s trajectory.
Reading and Language Arts Chapter 6. What Does the Lack of Phonemic Awareness Look Like?  Children lacking PA skills cannot: group words with similar.
How to teach Reading ( Phonics )
Representation of Characters
Lecture 3 Speech Sounds and Their Systems. Phonetics 语音学.
© 2009 Pearson Education, Upper Saddle River, NJ All Rights ReservedFloyd, Digital Fundamentals, 10 th ed Digital Logic Design Dr. Oliver Faust.
Characters CS240.
DATA REPRESENTATION 4 Y. Colette Lemard February 2009.
ASCII AND EBCDIC CODES By : madam aisha.
Representing Characters in a Computer System Representation of Data in Computer Systems.
Information Coding Schemes Group Member : Yvonne Tiffany Jurifah bt Junaidi Clara Jane George.
How Phonological and Language Deficits Impact Literacy Proficiency Sherry Comerchero ASHA Certified Speech-Language Pathologist April 4, 2007.
Chapter 11 Language. Some Questions to Consider How do we understand individual words, and how are words combined to create sentences? How can we understand.
A+ Computer Repair Lesson 3: Number System. Objectives Define binary, decimal, octal, and hexadecimal numbering systems. Define binary, decimal, octal,
Phonological Awareness Phonemic Awareness Phonics.
1.4 Representation of data in computer systems Character.
Lecture Coding Schemes. Representing Data English language uses 26 symbols to represent an idea Different sets of bit patterns have been designed to represent.
DATA REPRESENTATION - TEXT
Binary 1 Basic conversions.
Representing Information as bit patterns
Phnom Penh International University (PPIU)
Data Encoding Characters.
TOPICS Information Representation Characters and Images
Data Representation ASCII.
Representing Characters
Digital Representation
Fundamentals of Data Representation
Presenting information as bit patterns
INFOCODING BASICS & EXAMPLES OF CURRENT USE
Digital Representation of Data
ASCII and Unicode.
Presentation transcript:

LING 001 Introduction to Linguistics Spring 2010 Chinese writing system Reading Mar. 31 Writing systems II

LING 001 Introduction to Linguistics, Spring Origins of Chinese characters Legend has it that Cangjie, a historian-official who lived in the time of Huangdi (the Yellow Emperor), created Chinese characters at the inspiration of such natural objects as the sun, the moon, the stars, and footprints of animals and birds. Historical records and archeological finds reveal that Cangjie might, in fact, have been the first person to study and index the Chinese characters.

LING 001 Introduction to Linguistics, Spring Secrets discovered in Medicine In 1899, Chinese scholar Wang Yirong discovered from “dragon bones” - an ingredient of traditional Chinese medicine - some peculiar symbols. The symbols were Jiaguwen, or script written on oracle bones (tortoise shells and animal bones), and these particular bones were found to date back 3,000 years.

LING 001 Introduction to Linguistics, Spring Evolution of Chinese characters Oracle bone script Bronze script Small seal script Clerical script Standard script Grass script Running script Simplified script

LING 001 Introduction to Linguistics, Spring Structure of Chinese characters Pictograms: 日 ‘sun, day’, 人 ‘person’ Indicatives: 上 ‘up’ 下 ‘down’ 凹 ‘concave’ 凸 ‘convex’ Semantic-semantic compounds: 休 ‘rest’ = 人 + 木 (person + tree) Semantic-phonetic compounds: 蝗 (虫 + 皇) ‘locust’ = ‘INSECT + huang2’

LING 001 Introduction to Linguistics, Spring Structure of Chinese characters Most semantic-phonetic compounds (also called phonetic compounds) have a left-right structure, having their semantic radicals on the the left and phonetic radicals on the right.

LING 001 Introduction to Linguistics, Spring 來 lái = wheat lái = come 來 = wheat/come 麥 = wheat 來 = come Chinese Rebus: Phonetic Loans I cansee you

LING 001 Introduction to Linguistics, Spring Chinese characters Note: The 3,500 most frequently used characters in Modern Chinese would cover 99.48% of a 2 million character corpus, where the first 2,500 characters accounted for 97.97%.

LING 001 Introduction to Linguistics, Spring Computer coding of characters Decimal (base 10): 0, 1, 2, 3, 4, 5, 6, 7, 8, 9 e.g., 107 = 1* * *10 0 Binary (base 2): 0, 1 e.g., = 1* * * * *2 0 (107) Bit: the basic unit in computer, represents either 1 or 0, on or off. Byte: A sequence of eight bits. It is used as a fundamental unit in modern computers. Also called octet. Characters are represented by binary numbers in computer

LING 001 Introduction to Linguistics, Spring Computer coding of characters American Standard Code for Information Interchange (ASCII) a 7-bit character set: range from control codes and formatting: Escape, Tab, Space, – punctuations, numbers, and English letters : ! " # $ % & ' ( ) * +, -. / : ; A B C D E F G H I J K L M N O P Q R S T U V W X Y Z [ \ ] ^ _ ` a b c d e f g h i j k l m n o p q r s t u v w x y z { | } ~ e.g., “A” (65) ISO It uses 8 bits, and contains ASCII as a subset. In addition to the ASCII characters, ISO contains various accented characters and other letters needed for writing languages of Western Europe, and some special characters.

LING 001 Introduction to Linguistics, Spring Computer coding of characters Some languages (Chinese, Japanese, Korean, etc.) have more than 256 characters. Encoding standards for these languages use sequences of bytes for characters. Because different standards use different numbers of bytes, the computer can’t tell whether a given byte is a whole character or part of a character; corruption of one byte can corrupt the whole data stream. Unicode: 21-bit encoding space allows for 1,114,112 characters; 95,156 code point values assigned to characters in Unicode 3.2; 879,626 code point values reserved for future character assignments.

LING 001 Introduction to Linguistics, Spring UTF-8 For ASCII characters, the 21-bit value is truncated to 8 bits: For other characters, the 21-bit value is turned into a sequence of two, three, or four 8-bit values:

LING 001 Introduction to Linguistics, Spring Phonology in reading development Phonological processes are generally considered to be important for developing word reading skills. “The best predictor of reading difficulty in kindergarten or first grade is the inability to segment words and syllables into constituent sound units (phonemic awareness).” (Lyon, 1995) “Reading and phonemic awareness are mutually reinforcing: Phonemic awareness is necessary for reading, and reading, in turn, improves phonemic awareness still further.” (Shaywitz, 2003)

LING 001 Introduction to Linguistics, Spring Dyslexia Phonological deficits are the most significant and consistent cognitive marker of dyslexic children. Auditory Analysis Test: it asks a child to segment words into their underlying phonological units and then to delete specific phonemes from the words, e.g., say ‘block’ without ‘buh’. Even in high school students, phonological awareness was the best indicator of reading ability.

LING 001 Introduction to Linguistics, Spring Phonological activation in reading Skilled readers activate phonological representations in reading. Lexical decision: In a lexical decision paradigm, subjects are presented with a written stimulus, and are asked to answer whether or not the stimulus in question is a word of their language. A typical finding is that participants take more time to reject pseudohomophones foils than controls foils. Naming: In the naming paradigm, subjects are again presented with a written stimulus, but this time they are asked to pronounce the stimulus aloud. — to “name” the word that is on the screen. A typical finding is that participants take shorter time to name the word if a homophone (prime) is presented before the target word.

LING 001 Introduction to Linguistics, Spring Phonological activation in reading Eye movement: Target words were read faster (shorter fixation duration) when a phonologically similar word, e.g., homophone, was presented briefly at the onset of fixation on the target region (Rayner et al. 1995). The prime for a given target (e.g., beach) was either identical to the target (beach), a phonologically similar word (the homophone beech), a visually similar nonhomophone (bench), or a dissimilar word (noise). Comparing fixation times on the target when it was preceded by the homophone versus the visually similar word.

LING 001 Introduction to Linguistics, Spring Phonological activation in reading ERP (event-related potential): Target words had smaller ERPs (averaged electrical activity in the brain) when a phonologically similar word or syllable was presented before the target word. (Ashby 2010).

LING 001 Introduction to Linguistics, Spring Phonological activation in reading The morphemic nature of Chinese writing leads easily to the assumption of a close connection between graphic form and meaning. characters are not alphabetic. On average, 11characters share a single pronunciation if disregarding tone, about four homophones for each character if tone is considered. 石室诗士施氏, 嗜狮, 誓食十狮。 氏时时适市视狮。 十时, 适十狮适市。 是时, 适施氏适市。 氏视是十狮, 恃矢势, 使是十狮逝世。 氏拾是十狮尸, 适石室。 石室湿, 氏使侍拭石室。 石室拭, 氏始试食是十狮。 食时, 始识是十狮, 实十石狮尸。 试释是事。

LING 001 Introduction to Linguistics, Spring Phonological activation in reading Like in English, participants in Chinese take shorter time to name the target word if a homophone (prime) is presented before it (Perfetti & Tan 1998). Graphic information begins the identification process and it the first to show a priming effect Phonological information precedes semantic information in primed naming

LING 001 Introduction to Linguistics, Spring Phonological activation in reading Analysis of 19 published brain mapping studies (fMRI) of phonological processing in reading, six with Chinese and 13 with alphabetic languages, found significant differences between languages (Tan et al. 2005) The left middle frontal gyrus is responsible for addressed phonology in Chinese. Left temporoparietal regions mediate assembled phonology in alphabetic languages. More on language and brain later.