Source Encoding and Compression

Slides:



Advertisements
Similar presentations
Data Compression CS 147 Minh Nguyen.
Advertisements

Entropy and Information Theory
An introduction to Data Compression
T h e U n i v e r s i t y o f B r i t i s h C o l u m b i a Bi-Level Image Compression EECE 545: Data Compression by Dave Tompkins.
Compression Techniques. Digital Compression Concepts ● Compression techniques are used to replace a file with another that is smaller ● Decompression.
Fundamental limits in Information Theory Chapter 10 :
CS336: Intelligent Information Retrieval
Information Theory Eighteenth Meeting. A Communication Model Messages are produced by a source transmitted over a channel to the destination. encoded.
Data Representation CS105. Data Representation Types of data: – Numbers – Text – Audio – Images & Graphics – Video.
1 Chapter 1 Introduction. 2 Outline 1.1 A Very Abstract Summary 1.2 History 1.3 Model of the Signaling System 1.4 Information Source 1.5 Encoding a Source.
Klara Nahrstedt Spring 2014
Lecture 4 Source Coding and Compression Dr.-Ing. Khaled Shawky Hassan
Data Compression Basics & Huffman Coding
1 Lossless Compression Multimedia Systems (Module 2) r Lesson 1: m Minimum Redundancy Coding based on Information Theory: Shannon-Fano Coding Huffman Coding.
Data dan Teknologi Multimedia Sesi 08 Nofriyadi Nurdam.
ELG5126 Source Coding and Data Compression
Spring 2015 Mathematics in Management Science Binary Linear Codes Two Examples.
Lecture 10 Data Compression.
Lecture 1 Contemporary issues in IT Lecture 1 Monday Lecture 10:00 – 12:00, Room 3.27 Lab 13:00 – 15:00, Lab 6.12 and 6.20 Lecturer: Dr Abir Hussain Room.
Noiseless Coding. Introduction Noiseless Coding Compression without distortion Basic Concept Symbols with lower probabilities are represented by the binary.
Text Compression Spring 2007 CSE, POSTECH. 2 2 Data Compression Deals with reducing the size of data – Reduce storage space and hence storage cost Compression.
Computer Vision – Compression(2) Hanyang University Jong-Il Park.
Survey on Improving Dynamic Web Performance Guide:- Dr. G. ShanmungaSundaram (M.Tech, Ph.D), Assistant Professor, Dept of IT, SMVEC. Aswini. S M.Tech CSE.
Multimedia Specification Design and Production 2012 / Semester 1 / L3 Lecturer: Dr. Nikos Gazepidis
Digital Image Processing, 2nd ed. © 2002 R. C. Gonzalez & R. E. Woods Chapter 8 – Image Compression.
Image Compression (Chapter 8) CSC 446 Lecturer: Nada ALZaben.
Compression.  Compression ratio: how much is the size reduced?  Symmetric/asymmetric: time difference to compress, decompress?  Lossless; lossy: any.
1 Source Coding and Compression Dr.-Ing. Khaled Shawky Hassan Room: C3-222, ext: 1204, Lecture 5.
1 Classification of Compression Methods. 2 Data Compression  A means of reducing the size of blocks of data by removing  Unused material: e.g.) silence.
Lossless Compression CIS 465 Multimedia. Compression Compression: the process of coding that will effectively reduce the total number of bits needed to.
Computer Vision – Compression(1) Hanyang University Jong-Il Park.
Huffman Code and Data Decomposition Pranav Shah CS157B.
Advances in digital image compression techniques Guojun Lu, Computer Communications, Vol. 16, No. 4, Apr, 1993, pp
CS Spring 2011 CS 414 – Multimedia Systems Design Lecture 6 – Basics of Compression (Part 1) Klara Nahrstedt Spring 2011.
Lecture 4: Lossless Compression(1) Hongli Luo Fall 2011.
CS654: Digital Image Analysis
CS654: Digital Image Analysis Lecture 34: Different Coding Techniques.
Digital Image Processing Lecture 22: Image Compression
Data Compression lecture1. References Khalid Sayood, Introduction to Data Compression, Morgan Kaufman Publ. 2nd ed. 2000, Mark Nelson, Jean Loop Gailly,
STATISTIC & INFORMATION THEORY (CSNB134) MODULE 11 COMPRESSION.
Multi-media Data compression
Chapter 7 Lossless Compression Algorithms 7.1 Introduction 7.2 Basics of Information Theory 7.3 Run-Length Coding 7.4 Variable-Length Coding (VLC) 7.5.
SEAC-3 J.Teuhola Information-Theoretic Foundations Founder: Claude Shannon, 1940’s Gives bounds for:  Ultimate data compression  Ultimate transmission.
Computer Sciences Department1. 2 Data Compression and techniques.
Submitted To-: Submitted By-: Mrs.Sushma Rani (HOD) Aashish Kr. Goyal (IT-7th) Deepak Soni (IT-8 th )
Advanced Course on Databases (Tietokantojen jatkokurssi)
Component 1.9 Security and Data Management
IMAGE PROCESSING IMAGE COMPRESSION
Digital Image Processing Lecture 20: Image Compression May 16, 2005
Shannon Entropy Shannon worked at Bell Labs (part of AT&T)
3.3 Fundamentals of data representation
IMAGE COMPRESSION.
Data Compression.
Multimedia Outline Compression RTP Scheduling Spring 2000 CS 461.
Introduction to Information theory
Introduction to Information Theory- Entropy
Image Compression The still image and motion images can be compressed by lossless coding or lossy coding. Principle of compression: - reduce the redundant.
Data Compression.
Huffman Coding, Arithmetic Coding, and JBIG2
Data Compression CS 147 Minh Nguyen.
Information Theory Michael J. Watts
Context-based Data Compression
Digital Multimedia Coding
Why Compress? To reduce the volume of data to be transmitted (text, fax, images) To reduce the bandwidth required for transmission and to reduce storage.
UNIT IV.
CSE 589 Applied Algorithms Spring 1999
Image Transforms for Robust Coding
Foundation of Video Coding Part II: Scalar and Vector Quantization
COMS 161 Introduction to Computing
Lecture 1 Introduction and preliminaries (Chapter 0)
Presentation transcript:

Source Encoding and Compression Jukka Teuhola University of Turku Dept. of Information Technology Spring 2014 SEAC-1 J.Teuhola 2014

General Self-study course, starting lecture: 14.1.2014 Extent: 5 sp (3 cu) Level: Advanced Preliminary knowledge: Data structures and algorithms I, basics of probability calculus Material: Lecture notes and Powerpoint slides available via the course homepage. No textbook is needed. Homework: 10 small exercise tasks will be given. Solutions must be submitted to the lecturer before taking the examination. Minimum: 5 solutions acceptably solved. Examinations: Three attempts; March, April, May 2014 SEAC-1 J.Teuhola 2014

Optional literature T. C. Bell, J. G. Cleary, I. H. Witten: Text Compression, 1990. R. W. Hamming: Coding and Information Theory, 2nd ed., Prentice-Hall, 1986. K. Sayood: Introduction to Data Compression, 3rd ed., Morgan Kaufmann, 2006. K. Sayood: Lossless Compression Handbook, Academic Press, 2003. I. H. Witten, A. Moffat, T. C. Bell: Managing Gigabytes: compressing and indexing documents and images, Morgan Kaufmann, 1999. Miscellaneous articles SEAC-1 J.Teuhola 2014

Contents Basic concepts Coding-theoretic foundations Information-theoretic foundations Basic source coding methods Predictive models for text compression Dictionary models for text compression Compression of digital images SEAC-1 J.Teuhola 2014

1. Basic concepts Data compression: Purposes: Basic approaches: Minimize the size of information representation. Reduce the redundancy of the original representation. Purposes: Save storage space. Reduce transmission time. Basic approaches: Lossless compression: decompression into exactly the original form (typical for text). Lossy compression: decompression into approximately the original form (typical for signals and images). SEAC-1 J.Teuhola 2014

Basic concepts (cont.) Fields of coding theory: Source coding: purpose to minimize the size Channel coding: detection and correction of transmission errors. Also: cryptography: Encryption of private/secret information Source encoding Channel decoding Model Sink Communication channel Errors SEAC-1 J.Teuhola 2014

Basic concepts (cont.) Phases of data compression: Other viewpoints: Modelling of the source Source encoding (called also entropy coding), using the model Other viewpoints: Speed of compression / decompression Size of the model Classification by lengths of coding units: Fixed-to-fixed coding Variable-to-fixed coding Fixed-to-variable coding Variable-to-variable coding SEAC-1 J.Teuhola 2014

Examples of models 1. Character distribution Char Prob A 0.10 B 0.05 C 0.08 D 0.06 E 0.15 ….. ….. 2. Successor distribution Char Succ Prob A A 0.01 A B 0.20 A C 0.10 A D 0.25 ….. ….. …… B A 0.15 B B 0.02 B C 0.01 B D 0.01 ….. ….. ….. 3. Dictionary Word Prob ALL 0.02 ALWAYS 0.01 ARE 0.05 AS 0.03 AT 0.02 BASIC 0.01 BEGIN 0.01 ….. ….. SEAC-1 J.Teuhola 2014

Basic concepts (cont.) Main classes of text compression methods: Dictionary methods Statistical methods Classification based on availability of the source: Off-line methods On-line methods Classification based on the status of the model: Static methods Semiadaptive methods Adaptive methods Measurement of compression efficiency: Compression ratio: Source size / compressed size Bits per source symbol (character, pixel, etc.) SEAC-1 J.Teuhola 2014

Illustration of a static method Background knowledge of the source data types Model Source message Encoder Decoder Decoded message Derived once Send Use Read Write SEAC-1 J.Teuhola 2014

Illustration of a semiadaptive method Model Source message Encoder Decoder Decoded message 3. Send 4. Send Use 5. Use 2. Read 6. Write 1. Build SEAC-1 J.Teuhola 2014

Illustration of an adaptive method Models are updated dynamically, based on the already processed part of the source, known to both encoder and decoder. Model Source message Encoder Decoder Decoded message Send Use Read Write Dynamic update Initial model fixed Processed part SEAC-1 J.Teuhola 2014