GP-ZIP Genetic Programing File Compression By: Dj Gerena.

Slides:



Advertisements
Similar presentations
Information Retrieval in Practice
Advertisements

T.Sharon-A.Frank 1 Multimedia Compression Basics.
Digital Color 24-bit Color Indexed Color Image file compression
Michael Alves, Patrick Dugan, Robert Daniels, Carlos Vicuna
PrasadL07IndexCompression1 Index Compression Adapted from Lectures by Prabhakar Raghavan (Yahoo, Stanford) and Christopher Manning.
Image Compression, Transform Coding & the Haar Transform 4c8 – Dr. David Corrigan.
Compression & Huffman Codes
School of Computing Science Simon Fraser University
Compression Techniques. Digital Compression Concepts ● Compression techniques are used to replace a file with another that is smaller ● Decompression.
CSc 461/561 CSc 461/561 Multimedia Systems Part B: 1. Lossless Compression.
Compression JPG compression, Source: Original 10:1 Compression 45:1 Compression.
A Data Compression Algorithm: Huffman Compression
Compression & Huffman Codes Fawzi Emad Chau-Wen Tseng Department of Computer Science University of Maryland, College Park.
CS 206 Introduction to Computer Science II 04 / 29 / 2009 Instructor: Michael Eckmann.
T.Sharon-A.Frank 1 Multimedia Image Compression 2 T.Sharon-A.Frank Coding Techniques – Hybrid.
 Wisegeek.com defines Data Compression as:  “Data compression is a general term for a group of technologies that encode large files in order to shrink.
Data dan Teknologi Multimedia Sesi 08 Nofriyadi Nurdam.
Data Compression Gabriel Laden CS146 – Dr. Sin-Min Lee Spring 2004.
Image Compression - JPEG. Video Compression MPEG –Audio compression Lossy / perceptually lossless / lossless 3 layers Models based on speech generation.
CSE Lectures 22 – Huffman codes
CS559-Computer Graphics Copyright Stephen Chenney Image File Formats How big is the image? –All files in some way store width and height How is the image.
Cosc 2150: Computer Organization Chapter 2a Data compression.
Data Compression For Images. Data compression or source coding is the process of encoding information using fewer bits (or other information-bearing units)
JPEG C OMPRESSION A LGORITHM I N CUDA Group Members: Pranit Patel Manisha Tatikonda Jeff Wong Jarek Marczewski Date: April 14, 2009.
Data Compression. How Is This Possible? Entire King James Bible : 4,834,757 bytes Zip Archive Containing It: 1,339,843 bytes.
Video Basics. Agenda Digital Video Compressing Video Audio Video Encoding in tools.
: Chapter 12: Image Compression 1 Montri Karnjanadecha ac.th/~montri Image Processing.
Fundamental Structures of Computer Science Feb. 24, 2005 Ananda Guna Lempel-Ziv Compression.
Images 01/29/04 Resources: Yale Web Style Guide The GIF Controversy Unisys - lzw.
Image Compression (Chapter 8) CSC 446 Lecturer: Nada ALZaben.
Multimedia Data Introduction to Lossless Data Compression Dr Sandra I. Woolley Electronic, Electrical.
The LZ family LZ77 LZ78 LZR LZSS LZB LZH – used by zip and unzip
Compression No. 1  Seattle Pacific University Data Compression Kevin Bolding Electrical Engineering Seattle Pacific University.
File Compression Techniques Alex Robertson. Outline History Lossless vs Lossy Basics Huffman Coding Getting Advanced Lossy Explained Limitations Future.
Homework #5 New York University Computer Science Department Data Structures Fall 2008 Eugene Weinstein.
Addressing Image Compression Techniques on current Internet Technologies By: Eduardo J. Moreira & Onyeka Ezenwoye CIS-6931 Term Paper.
Lossless Compression CIS 465 Multimedia. Compression Compression: the process of coding that will effectively reduce the total number of bits needed to.
1 Image Formats. 2 Color representation An image = a collection of picture elements (pixels) Each pixel has a “color” Different types of pixels Binary.
COMPRESSION. Compression in General: Why Compress? So Many Bits, So Little Time (Space) CD audio rate: 2 * 2 * 8 * = 1,411,200 bps CD audio storage:
Huffman Code and Data Decomposition Pranav Shah CS157B.
CS430 © 2006 Ray S. Babcock LZW Coding Lempel-Ziv-Welch.
Lecture 4: Lossless Compression(1) Hongli Luo Fall 2011.
A New Operating Tool for Coding in Lossless Image Compression Radu Rădescu University POLITEHNICA of Bucharest, Faculty of Electronics, Telecommunications.
CS654: Digital Image Analysis Lecture 34: Different Coding Techniques.
Hanyang University Hyunok Oh Energy Optimal Bit Encoding for Flash Memory.
Digital Image Processing Lecture 22: Image Compression
Main Index Contents 11 Main Index Contents Complete Binary Tree Example Complete Binary Tree Example Maximum and Minimum Heaps Example Maximum and Minimum.
Sound (analogue signal). time Sound (analogue signal) time.
Multi-media Data compression
Huffman code and Lossless Decomposition Prof. Sin-Min Lee Department of Computer Science.
Computer Sciences Department1. 2 Data Compression and techniques.
IS502:M ULTIMEDIA D ESIGN FOR I NFORMATION S YSTEM M ULTIMEDIA OF D ATA C OMPRESSION Presenter Name: Mahmood A.Moneim Supervised By: Prof. Hesham A.Hefny.
Lossless Compression-Statistical Model Lossless Compression One important to note about entropy is that, unlike the thermodynamic measure of entropy,
Submitted To-: Submitted By-: Mrs.Sushma Rani (HOD) Aashish Kr. Goyal (IT-7th) Deepak Soni (IT-8 th )
Component 1.9 Security and Data Management
File Compression 3.3.
Design & Analysis of Algorithm Huffman Coding
Compression & Huffman Codes
Digital Image Processing Lecture 20: Image Compression May 16, 2005
Lesson Objectives Aims You should know about: 1.3.1:
File Compression 3.3.
Applied Algorithmics - week7
AP CSP: Bytes, File Sizes, and Text Compression
Data Compression.
Why Compress? To reduce the volume of data to be transmitted (text, fax, images) To reduce the bandwidth required for transmission and to reduce storage.
Data Compression.
UNIT IV.
Data Compression.
Chapter 8 – Compression Aims: Outline the objectives of compression.
Presentation transcript:

GP-ZIP Genetic Programing File Compression By: Dj Gerena

Road Map Compression Refresher Problems With Generic Compression What GP-Zip Is What GP-Zip Does Conclusion

Purpose Use fewer bits to represent data Reduce file size on disk Increase transmission speed

Lossy vs. Lossless Lossy Removes unnecessary bits Similar colors in a photograph Cannot recreate original file Lossless Does not delete bits Able to rebuild to original state

(Very) Basic Lossless Compression Algorithm Look for common words Create a dictionary When I do good, I feel good. When I do bad, I feel bad. That's my religion. (17 words) 1- When2- bad 4- I1- That’s 3- do1- my 2- good1- religion 2- feel (9 Words)

Problems With Generic Compression No algorithm can reduce every file No-free lunch Can’t guarantee never to increase file size Time may depended on type of file

GP-Zip Genetic Programming Best Application heterogeneous collections of data I.E. Large unsorted folders of data Developed by students of University of Essex, Wivenhoe, United Kindom Been in progress since 2008

Optimizes compression Breaks file into 5 KB blocks Analyzes file type Block passed to proper compression method “waiting area” Program will “predict” best compression method “Waiting area” can be 1600 B to 1 MB (increasing by 1600 B) Can not exceed original file size.

Actual Compression Utilizes Other Compression Methods Arithmetic Coding (AC) Lempel-Ziv-Welch LZW Prediction by Partial Matching (PPMD) Run Length Encoding (RLE) Boolean Minimization

Predicting type Attempt to predict compression ratio for each type of compression Byte Frequency Distribution (BFD) A histogram of the # of character appearances over total # of characters Byte-series Treat bytes as integers and applies a non-linear function to detect similarities between data types.

Predicting Type Decision tree Leaves represent classifications Branches represent conjunctions of features If certain features are present, proceed to leaf

Predicting Type Output of the tree is estimate compression ratio Can use either or both analyzer results to calculate final estimate

Files and Blocks

Finalizing A header is created after breaking down files Blue print for decompression Header will “glue” some blocks [PPMD][PPMD][LZW][LZW][LZW] [PPMD] [LZW] After all bits and blocks are analyzed All blocks are compressed as a labeled All compressed blocks are wrapped with the header

Conclusion Inefficiency of Standard Compression What Compression Is What GP-Zip Does How GP-Zip Works

Work Cited Evolutionary Synthesis of Lossless Compression Algorithms with GP- zip3, Proceedings of the IEEE World Congress on Computational Intelligence, IEEE Ahmed Kattan and Riccardo Poli, "Evolutionary lossless compression with GP-ZIP*," in Proceedings of the 10th annual conference on Genetic and evolutionary computation, Atlanta, Georgia, USA, 2008, 2008, pp Ahmad Kattan and Riccardo Poli, Evolutionary Lossless Compression with GP-ZIP, Proceedings of the IEEE World Congress on Computational Intelligence, IEEE 2008.