ITEC 1011 Introduction to Information Technologies 2. Data Formats Chapt. 3.

Slides:



Advertisements
Similar presentations
KFUPM COE 202: Digital Logic Design Number Systems Part 3 Courtesy of Dr. Ahmad Almulhem.
Advertisements

Technology ICT Option: Data Representation. Data Representation In our everyday lives, we communicate with each other using analogue data. This data takes.
TK 2123 COMPUTER ORGANISATION & ARCHITECTURE Lecture 4: Data in The Computer Dr. Masri Ayob.
Data Representation (in computer system) Computer Fundamental CIM2460 Bavy LI.
Number Systems & Logic Gates Day 1
© BYU 02 NUMBERS Page 1 ECEn 224 Binary Number Systems and Codes.
CCE-EDUSAT SESSION FOR COMPUTER FUNDAMENTALS Date: Session III Topic: Number Systems Faculty: Anita Kanavalli Department of CSE M S Ramaiah.
COMPUTER FUNDAMENTALS David Samuel Bhatti
CODING SYSTEMS CODING SYSTEMS CODING SYSTEMS. CHARACTERS CHARACTERS digits: 0 – 9 (numeric characters) letters: alphabetic characters punctuation marks:
9/15/09 - L3 CodesCopyright Joanne DeGroat, ECE, OSU1 Codes.
CHARACTERS Data Representation. Using binary to represent characters Computers can only process binary numbers (1’s and 0’s) so a system was developed.
1 JCM 106 Computer Application for Journalism Lecture 1 – Introduction to Computing.
CCE-EDUSAT SESSION FOR COMPUTER FUNDAMENTALS Faculty: Anita Kanavalli Department of CSE M S Ramaiah Institute of Technology Bangalore E mail-
©Brooks/Cole, 2003 Chapter 2 Data Representation.
Chapter 2 Data Representation. Define data types. Visualize how data are stored inside a computer. Understand the differences between text, numbers, images,
Sem 1 v2 Chapter 14: Layer 6 - The Presentation layer.
Chapter 3 Representing Numbers and Text in Binary Information Technology in Theory By Pelin Aksoy and Laura DeNardis.
Representing text Each of different symbol on the text (alphabet letter) is assigned a unique bit patterns the text is then representing as.
Binary Numbers and ASCII and EDCDIC Mrs. Cueni. Data Representation  Human speech is analog because it uses continuous signals (waves) that vary in strength.
Numbers and Number Systems
Data Usually computing systems are complex devices, dealing with a vast array of information categories.
Lecture 2 Character Codes and Low-Structure Text Document Formats.
Bits & Bytes: How Computers Represent Data
Institute of Technology Sligo - Dept of Computing Sem 1 Chapter 14: Layer 6 - The Presentation layer.
Data Representation S2. This unit covers how the computer represents- Numbers Text Graphics Control.
1 Inside “The Box”. 2 CRT Display Keyboard Mouse “The Box” CD-ROM Drive Floppy Disk Drive Computing Devices – Old and New.
Chapter 2 Computer Hardware
Binary Arithmetic & Data representation
Cis303a_chapt03-2a.ppt Range Overflow Fixed length of bits to hold numeric data Can hold a maximum positive number (unsigned) X X X X X X X X X X X X X.
1 INFORMATION IN DIGITAL DEVICES. 2 Digital Devices Most computers today are composed of digital devices. –Process electrical signals. –Can only have.
ITEC 1000 “Introduction to Information Technology”
CS151 Introduction to Digital Design
Compsci Today’s topics l Binary Numbers  Brookshear l Slides from Prof. Marti Hearst of UC Berkeley SIMS l Upcoming  Networks Interactive.
Document Formats How to Build a Digital Library Ian H. Witten and David Bainbridge.
Chapter 1 Data Storage © 2007 Pearson Addison-Wesley. All rights reserved.
Data Representation Conversion 24/04/2017.
Data Representation, Number Systems and Base Conversions
Data Representation (in computer system). Data Representation How do computers represent data? b The computers are digital Recognize.
Chap 14 Presentation Layer Andres, Wen-Yuan Liao Department of Computer Science and Engineering De Lin Institute of Technology
Data Representation. What is data? Data is information that has been translated into a form that is more convenient to process As information take different.
MULTIMEDIA Multimedia is the field concerned with the computer- controlled integration of text, graphics, drawings, still and moving images (Video), animation,
Chapter 1 Data Storage © 2007 Pearson Addison-Wesley. All rights reserved.
M204 - Data Representation
ECE – III SEM BINARY CODES
DATA REPRESENTATION 4 Y. Colette Lemard February 2009.
ASCII AND EBCDIC CODES By : madam aisha.
Information Coding Schemes Group Member : Yvonne Tiffany Jurifah bt Junaidi Clara Jane George.
Layer 6 Presentation Layer. Overview Now that you have learned about Layer 5 of the OSI model, it is time to look at Layer 6, the presentation layer.
2. Data Formats. Introduction Examples pp Real World Data Computer Data Input device Dear Mom: Keyboard … Digital camera …
Software Design and Development Storing Data Part 2 Text, sound and video Computing Science.
Dr. Nermin Hamza. Agenda Signed Numbers Properties of Switching Algebra.
Lecture Coding Schemes. Representing Data English language uses 26 symbols to represent an idea Different sets of bit patterns have been designed to represent.
Chapter 1: Data Storage.
Binary Representation in Text
Binary Representation in Text
Computer Science: An Overview Eleventh Edition
4th Edition, Irv Englander
Bits & Bytes How Computers Represent Data
Representing Information as bit patterns
Phnom Penh International University (PPIU)
TOPICS Information Representation Characters and Images
Ch2: Data Representation
Computer Data Types Basics of Computing.
Chapter 2 Data Representation.
Chapter 1 Number System RGGP, Narwana.
2. Data Formats Chapt. 3.
Learning Intention I will learn how computers store text.
ASCII and Unicode.
Presentation transcript:

ITEC 1011 Introduction to Information Technologies 2. Data Formats Chapt. 3

ITEC 1011 Introduction to Information Technologies Introduction Examples pp Real World Data Computer Data Input device Dear Mom: Keyboard … Digital camera …

ITEC 1011 Introduction to Information Technologies Format must be appropriate The internal representation must be appropriate for the type of processing to take place (e.g., text, images, sound)

ITEC 1011 Introduction to Information Technologies Rules/Conventions Proprietary formats –Unique to a product or company –E.g., Microsoft Word, Corel Word Perfect, IBM Lotus Notes Standards –Evolve two ways: Proprietary formats become de facto standards (e.g., Adobe PostScript, Apple Quick Time) Committee is struck to solve a problem (Motion Pictures Experts Group, MPEG) pp

ITEC 1011 Introduction to Information Technologies Standards Organizations ISO – International Standards Organization CSA – Canadian Standards Association ANSI – American National Standards Institute IEEE – Institute for Electrical and Electronics Engineers Etc.

ITEC 1011 Introduction to Information Technologies Examples of Standards Type of DataStandards AlphanumericASCII, EBCDIC, Unicode ImageJPEG, GIF, PCX, TIFF Motion pictureMPEG-2, Quick Time SoundSound Blaster, WAV, AU Outline graphics/fontsPostScript, TrueType, PDF

ITEC 1011 Introduction to Information Technologies Why Standards? Standard are “arbitrary” They exist because they are –Convenient –Efficient –Flexible –Appropriate –Etc.

ITEC 1011 Introduction to Information Technologies Alphanumeric Data Problem: Distinguishing between the number 123 (one hundred and twenty-three) and the characters “123” (one, two, three) Four standards for representing letters (alpha) and numbers –BCD – Binary-coded decimal –ASCII – American standard code for information interchange –EBCDIC – Extended binary-coded decimal interchange code –Unicode pp

ITEC 1011 Introduction to Information Technologies Next 2 slides Standard Alphanumeric Formats BCD ASCII EBCDIC Unicode

ITEC 1011 Introduction to Information Technologies Binary-Coded Decimal (BCD) Four bits per digit DigitBit pattern Note: the following bit patterns are not used:

ITEC 1011 Introduction to Information Technologies Example = ? (in BCD)

ITEC 1011 Introduction to Information Technologies Next 22 slides Standard Alphanumeric Formats BCD ASCII EBCDIC Unicode

ITEC 1011 Introduction to Information Technologies The Problem Representing text strings, such as “ Hello, world”, in a computer

ITEC 1011 Introduction to Information Technologies Codes and Characters Each character is coded as a byte Most common coding system is ASCII (Pronounced ass-key) ASCII = American National Standard Code for Information Interchange Defined in ANSI document X

ITEC 1011 Introduction to Information Technologies ASCII Features 7-bit code 8 th bit is unused (or used for a parity bit) 2 7 = 128 codes Two general types of codes: –95 are “Graphic” codes (displayable on a console) –33 are “Control” codes (control features of the console or communications channel)

ITEC 1011 Introduction to Information Technologies ASCII Chart

ITEC 1011 Introduction to Information Technologies

ITEC 1011 Introduction to Information Technologies Most significant bit Least significant bit

ITEC 1011 Introduction to Information Technologies e.g., ‘a’ =

ITEC 1011 Introduction to Information Technologies 95 Graphic codes

ITEC 1011 Introduction to Information Technologies 33 Control codes

ITEC 1011 Introduction to Information Technologies Alphabetic codes

ITEC 1011 Introduction to Information Technologies Numeric codes

ITEC 1011 Introduction to Information Technologies Punctuation, etc.

ITEC 1011 Introduction to Information Technologies “Hello, world” Example ======================== Binary Hexadecimal C 6F 2C C 64 Decimal Hello, worldHello, world ======================== ========================

ITEC 1011 Introduction to Information Technologies Common Control Codes CR0Dcarriage return LF0Aline feed HT09horizontal tab DEL7Fdelete NULL00null Hexadecimal code

ITEC 1011 Introduction to Information Technologies

ITEC 1011 Introduction to Information Technologies Terminology Learn the names of the special symbols –[ ] brackets –{ }braces –( )parentheses ‘at’ sign –& ampersand –~tilde

ITEC 1011 Introduction to Information Technologies

ITEC 1011 Introduction to Information Technologies Escape Sequences Extend the capability of the ASCII code set For controlling terminals and formatting output Defined by ANSI in documents X and X The escape code is ESC = 1B 16 An escape sequence begins with two codes: ESC [ 1B 16 5B 16

ITEC 1011 Introduction to Information Technologies Examples Erase display:ESC [ 2 J Erase line:ESC [ K

ITEC 1011 Introduction to Information Technologies Next 1 slides Standard Alphanumeric Formats BCD ASCII EBCDIC Unicode

ITEC 1011 Introduction to Information Technologies EBCDIC Extended BCD Interchange Code (pronounced ebb’-se-dick) 8-bit code Developed by IBM Rarely used today IBM mainframes only

ITEC 1011 Introduction to Information Technologies Next 2 slides Standard Alphanumeric Formats BCD ASCII EBCDIC Unicode

ITEC 1011 Introduction to Information Technologies Unicode 16-bit standard Developed by a consortia Intended to supercede older 7- and 8-bit codes

ITEC 1011 Introduction to Information Technologies Unicode Version Improves on version 2.0 Includes the Euro sign (20AC 16 = ) From the standard: …contains 38,887 distinct coded characters derived from the supported scripts. These characters cover the principal written languages of the Americas, Europe, the Middle East, Africa, India, Asia, and Pacifica.

ITEC 1011 Introduction to Information Technologies Keyboard Input Key (“scan”) codes are converted to ASCII ASCII code sent to host computer Received by the host as a “stream” of data Stored in buffer Processed Etc. pp. 69

ITEC 1011 Introduction to Information Technologies Shift Key inhibits bit 5 in the ASCII code Key(s) ASCII code Character aAaA a aShift

ITEC 1011 Introduction to Information Technologies Control Key inhibits bits 5 & 6 in the ASCII code Key(s) ASCII code Character c ETX c cCtrl Control code

ITEC 1011 Introduction to Information Technologies Other Input OCR – optical character recognition Bar code readers Voice/audio input Punched cards Images / objects Pointing devices pp

ITEC 1011 Introduction to Information Technologies OCR Hello, world Page of text Optical scan … Computer file

ITEC 1011 Introduction to Information Technologies Other Input OCR – optical character recognition Bar code readers Voice/audio input Punched cards Images / objects Pointing devices pp

ITEC 1011 Introduction to Information Technologies Bar Codes An automatic identification (Auto ID) technology that streamlines identification and data collection See

ITEC 1011 Introduction to Information Technologies Other Input OCR – optical character recognition Bar code readers Voice/audio input Punched cards Images / objects Pointing devices pp

ITEC 1011 Introduction to Information Technologies Voice/audio Input Input device: microphone Audio input is “digitized” and stored Processed in two ways –As is (no recognition) –Recognized and converted to alphanumeric data (ASCII) Digitize …

ITEC 1011 Introduction to Information Technologies Other Input OCR – optical character recognition Bar code readers Voice/audio input Punched cards Images / objects Pointing devices pp

ITEC 1011 Introduction to Information Technologies Punched Cards Invented by Herman Hollerith (founder of IBM) Each card holds 80 characters

ITEC 1011 Introduction to Information Technologies Other Input OCR – optical character recognition Bar code readers Voice/audio input Punched cards Images / objects Pointing devices pp

ITEC 1011 Introduction to Information Technologies Images Typically images are pictures that are optically scanned and saved as a “bit map” or in some other format Many formats –gif, jpeg, …

ITEC 1011 Introduction to Information Technologies Typical “Save As” Dialog

ITEC 1011 Introduction to Information Technologies Objects Images made of geometrically definable shapes Offer efficiency, flexibility, small size, etc.

ITEC 1011 Introduction to Information Technologies Other Input OCR – optical character recognition Bar code readers Voice/audio input Punched cards Images / objects Pointing devices pp

ITEC 1011 Introduction to Information Technologies Pointing Devices Originally used for specifying coordinates (x, y) for graphical input Today used as general purpose device for “graphical user interfaces” (GUIs)

ITEC 1011 Introduction to Information Technologies Thank you