Lecture 7 Data Compression

Slides:



Advertisements
Similar presentations
CREATING a HUFFMAN CODE EVERY EGG IS GREEN E ///// V/V/ R // Y/Y/ I/I/ S/S/ N/N/ Sp /// V/V/ Y/Y/ I/I/ S/S/ N/N/ R // Sp /// G /// E /////
Advertisements

Data Compression CS 147 Minh Nguyen.
Michael Alves, Patrick Dugan, Robert Daniels, Carlos Vicuna
February 1 & 31 Csci 2111: Data and File Structures Week4, Lectures 1 & 2 Organizing Files for Performance.
Folk/Zoellick/Riccardi, File Structures 1 Objectives: To get familiar with: Data compression Storage management Internal sorting and binary search Chapter.
February 1 & 31 Files Organizing Files for Performance.
File Processing - Organizing file for Performance MVNC1 Organizing Files for Performance Chapter 6 Jim Skon.
CPSC 231 Organizing Files for Performance (D.H.) 1 LEARNING OBJECTIVES Data compression. Reclaiming space in files. Compaction. Searching. Sorting, Keysorting.
CSCI 3 Chapter 1.8 Data Compression. Chapter 1.8 Data Compression  For the purpose of storing or transferring data, it is often helpful to reduce the.
HUFFMAN TREES CSC 172 SPRING 2002 LECTURE 24. Prefix Codes Consider a binary trie representing a code
1 File Structure n File as a stream of characters l No structure l Consider students registered in a course Joe SmithSC Kathy LeeEN Albert.
Organizing files for performance Chapter Data compression Advantages of reduced file size Redundancy reduction: state code example Repeating sequences:
Data Compression Basics & Huffman Coding
Folk/Zoellick/Riccardi, File Structures 1 Objectives: To get familiar with: Data compression Storage management Internal sorting and binary search Chapter.
Management Information Systems Lection 06 Archiving information CLARK UNIVERSITY College of Professional and Continuing Education (COPACE)
Lecture 10 Data Compression.
Lecture Objectives  To learn how to use a Huffman tree to encode characters using fewer bytes than ASCII or Unicode, resulting in smaller files and reduced.
 The amount of data we deal with is getting larger  Not only do larger files require more disk space, they take longer to transmit  Many times files.
File Structures CIS 256 Chapter 0 Summer 2011 Dr. Ahmad Saifan.
Image Compression (Chapter 8) CSC 446 Lecturer: Nada ALZaben.
Introduction to Algorithms Chapter 16: Greedy Algorithms.
CS654: Digital Image Analysis Lecture 34: Different Coding Techniques.
Huffman’s Algorithm 11/02/ Weighted 2-tree A weighted 2-tree T is an extended binary tree with n external nodes and each of the external nodes is.
Comp 335 File Structures Data Compression. Why Study Data Compression? Conserves storage space Files can be transmitted faster because there are less.
1 Algorithms CSCI 235, Fall 2015 Lecture 30 More Greedy Algorithms.
Chapter Five Making Connections Efficient: Multiplexing and Compression Data Communications and Computer Networks: A Business User’s Approach Eighth Edition.
1Computer Sciences Department. 2 Advanced Design and Analysis Techniques TUTORIAL 7.
Computer Sciences Department1. 2 Data Compression and techniques.
1 CS122A: Introduction to Data Management Lecture #14: Indexing Instructor: Chen Li.
Submitted To-: Submitted By-: Mrs.Sushma Rani (HOD) Aashish Kr. Goyal (IT-7th) Deepak Soni (IT-8 th )
Data Compression Michael J. Watts
3.3 Fundamentals of data representation
File Compression 3.3.
Vocabulary byte - The technical term for 8 bits of data.
Welcome to ….. File Organization.
COMP261 Lecture 22 Data Compression 2.
IMAGE PROCESSING IMAGE COMPRESSION
Data Coding Run Length Coding
Compression & Huffman Codes
Tries 07/28/16 11:04 Text Compression
Assignment 6: Huffman Code Generation
Madivalappagouda Patil
Digital Image Processing Lecture 20: Image Compression May 16, 2005
CPSC 231 Organizing Files for Performance (D.H.)
3.3 Fundamentals of data representation
Increasing Information per Bit
Data Compression.
Chapter Five Making Connections Efficient: Multiplexing and Compression Data Communications and Computer Networks: A Business User’s Approach Eighth Edition.
Huffman Coding Based on slides by Ethan Apter & Marty Stepp
Data Compression.
Pengantar Multimedia Data compression.
Data Compression CS 147 Minh Nguyen.
Data Compression If you’ve ever sent a large file to a friend, you may have compressed it into a zip archive like the one on this slide before doing so.
Why Compress? To reduce the volume of data to be transmitted (text, fax, images) To reduce the bandwidth required for transmission and to reduce storage.
Topic 3: Data Compression.
Merge Sort 11/28/2018 2:21 AM The Greedy Method The Greedy Method.
Lecture 15: Bitmap Indexes
Advanced Algorithms Analysis and Design
Chapter 11 Data Compression
Presenting information as bit patterns
Huffman Encoding Huffman code is method for the compression for standard text documents. It makes use of a binary tree to develop codes of varying lengths.
Binary Trees: Motivation
Data Structure and Algorithms
Chap6. Organizing Files for Performance
Introduction to Data Structures
File Compression Even though disks have gotten bigger, we are still running short on disk space A common technique is to compress files so that they take.
Quantizing Compression
Algorithms CSCI 235, Spring 2019 Lecture 30 More Greedy Algorithms
Huffman Coding Greedy Algorithm
Algorithms CSCI 235, Spring 2019 Lecture 31 Huffman Codes
Presentation transcript:

Lecture 7 Data Compression File Structures

Previous Lecture

Field and record organization Sequential search and direct access Lecture 6 Fundamental File Structure Concepts Managing Files of Records Field and record organization Sequential search and direct access Seeking

Today Lecture

Overview In this lecture, we continue to focus on file organization, but with a different motivation. This time we look at ways to organize or re-organize files in order to improve performance.

Outline Data Compression: how to make files smaller

Data Compression An Overview Data Compression: Encoding the information in a file in such a way that it takes less space Question: Why do we want to make files smaller? Answer: To use less storage, i.e., saving costs To transmit these files faster, decreasing access time or using the same access time, but with a lower and cheaper bandwidth To process the file sequentially faster.

Using Compact Notation

Run-Length Encoding

Run-length Encoding Example

Variable-Length Codes

Huffman Code

Huffman Code Example

Huffman Tree

Irreversible Compression

Lempel-Ziv Codes

Lempel-Ziv Codes Example

Lempel-Ziv Codes Example

Lempel-Ziv Codes Example

Assignment "aaabbcbcdddeab" "I AM SAM, SAM I AM" Encode using Lempel-Ziv the file containing the following characters, drawing the corresponding digital tree "aaabbcbcdddeab" "I AM SAM, SAM I AM"

Next Lecture

Organizing Files for Performance Reclaiming space in files Sorting and Binary Searching

Questions?