Wan Accelerators: Optimizing Network Traffic with Compression Introduction & Motivation Results for Trained Files Another Compression Method Approach &

Slides:



Advertisements
Similar presentations
Data compression. INTRODUCTION If you download many programs and files off the Internet, we have probably encountered.
Advertisements

T.Sharon-A.Frank 1 Multimedia Compression Basics.
15 Data Compression Foundations of Computer Science ã Cengage Learning.
Data Compression CS 147 Minh Nguyen.
Michael Alves, Patrick Dugan, Robert Daniels, Carlos Vicuna
SIMS-201 Compressing Information. 2  Overview Chapter 7: Compression Introduction Entropy Huffman coding Universal coding.
Data Compression Michael J. Watts
Lecture04 Data Compression.
Compression & Huffman Codes
Lecture 6: Huffman Code Thinh Nguyen Oregon State University.
Compression Techniques. Digital Compression Concepts ● Compression techniques are used to replace a file with another that is smaller ● Decompression.
CSCI 3 Chapter 1.8 Data Compression. Chapter 1.8 Data Compression  For the purpose of storing or transferring data, it is often helpful to reduce the.
Text Operations: Coding / Compression Methods. Text Compression Motivation –finding ways to represent the text in fewer bits –reducing costs associated.
Compression & Huffman Codes Fawzi Emad Chau-Wen Tseng Department of Computer Science University of Maryland, College Park.
1 Accelerating Multi-Patterns Matching on Compressed HTTP Traffic Authors: Anat Bremler-Barr, Yaron Koral Presenter: Chia-Ming,Chang Date: Publisher/Conf.
Document and Query Forms Chapter 2. 2 Document & Query Forms Q 1. What is a document? A document is a stored data record in any form A document is a stored.
Lecture 4 Source Coding and Compression Dr.-Ing. Khaled Shawky Hassan
Fundamentals of Multimedia Chapter 7 Lossless Compression Algorithms Ze-Nian Li and Mark S. Drew 건국대학교 인터넷미디어공학부 임 창 훈.
Data Compression Basics & Huffman Coding
September,2012 File Compression 8/6/ Compiled By:- Solomon W. Demissie.
1 Lossless Compression Multimedia Systems (Module 2) r Lesson 1: m Minimum Redundancy Coding based on Information Theory: Shannon-Fano Coding Huffman Coding.
Lossless Compression Multimedia Systems (Module 2 Lesson 3)
Data Compression Gabriel Laden CS146 – Dr. Sin-Min Lee Spring 2004.
8. Compression. 2 Video and Audio Compression Video and Audio files are very large. Unless we develop and maintain very high bandwidth networks (Gigabytes.
1 Lossless Compression Multimedia Systems (Module 2 Lesson 2) Summary:  Adaptive Coding  Adaptive Huffman Coding Sibling Property Update Algorithm 
Source Coding-Compression
Page 110/6/2015 CSE 40373/60373: Multimedia Systems So far  Audio (scalar values with time), image (2-D data) and video (2-D with time)  Higher fidelity.
 The amount of data we deal with is getting larger  Not only do larger files require more disk space, they take longer to transmit  Many times files.
Survey on Improving Dynamic Web Performance Guide:- Dr. G. ShanmungaSundaram (M.Tech, Ph.D), Assistant Professor, Dept of IT, SMVEC. Aswini. S M.Tech CSE.
Multimedia Specification Design and Production 2012 / Semester 1 / L3 Lecturer: Dr. Nikos Gazepidis
CS 111 – Sept. 10 Quiz Data compression –text –images –sounds Commitment: –Please read rest of chapter 1. –Department picnic next Wednesday.
Multimedia Data Introduction to Lossless Data Compression Dr Sandra I. Woolley Electronic, Electrical.
Compression.  Compression ratio: how much is the size reduced?  Symmetric/asymmetric: time difference to compress, decompress?  Lossless; lossy: any.
Huffman Coding and Decoding TAIABUL HAQUE NAEEMUL HASSAN.
1 Source Coding and Compression Dr.-Ing. Khaled Shawky Hassan Room: C3-222, ext: 1204, Lecture 5.
Compression No. 1  Seattle Pacific University Data Compression Kevin Bolding Electrical Engineering Seattle Pacific University.
Lossless Compression CIS 465 Multimedia. Compression Compression: the process of coding that will effectively reduce the total number of bits needed to.
Huffman Codes Juan A. Rodriguez CS 326 5/13/2003.
CS654: Digital Image Analysis Lecture 34: Different Coding Techniques.
Hanyang University Hyunok Oh Energy Optimal Bit Encoding for Flash Memory.
Digital Image Processing Lecture 22: Image Compression
Bahareh Sarrafzadeh 6111 Fall 2009
Chapter 3 Data Representation. 2 Compressing Files.
Lossless Decomposition and Huffman Codes Sophia Soohoo CS 157B.
Chapter 7 Lossless Compression Algorithms 7.1 Introduction 7.2 Basics of Information Theory 7.3 Run-Length Coding 7.4 Variable-Length Coding (VLC) 7.5.
Lecture 12 Huffman Algorithm. In computer science and information theory, a Huffman code is a particular type of optimal prefix code that is commonly.
Page 1KUT Graduate Course Data Compression Jun-Ki Min.
Huffman code and Lossless Decomposition Prof. Sin-Min Lee Department of Computer Science.
Lampel ZIV (LZ) code The Lempel-Ziv algorithm is a variable-to-fixed length code Basically, there are two versions of the algorithm LZ77 and LZ78 are the.
LZW (Lempel-Ziv-welch) compression method The LZW method to compress data is an evolution of the method originally created by Abraham Lempel and Jacob.
Submitted To-: Submitted By-: Mrs.Sushma Rani (HOD) Aashish Kr. Goyal (IT-7th) Deepak Soni (IT-8 th )
Data Compression Michael J. Watts
HUFFMAN CODES.
Data Coding Run Length Coding
Compression & Huffman Codes
Digital Image Processing Lecture 20: Image Compression May 16, 2005
Increasing Information per Bit
Data Compression.
Lesson Objectives Aims You should know about: 1.3.1:
Applied Algorithmics - week7
The Greedy Method and Text Compression
Information of the LO Subject: Information Theory
Lempel-Ziv-Welch (LZW) Compression Algorithm
Data Compression CS 147 Minh Nguyen.
The Huffman Algorithm We use Huffman algorithm to encode a long message as a long bit string - by assigning a bit string code to each symbol of the alphabet.
Why Compress? To reduce the volume of data to be transmitted (text, fax, images) To reduce the bandwidth required for transmission and to reduce storage.
Chapter 11 Data Compression
UNIT IV.
15 Data Compression Foundations of Computer Science ã Cengage Learning.
15 Data Compression Foundations of Computer Science ã Cengage Learning.
Presentation transcript:

Wan Accelerators: Optimizing Network Traffic with Compression Introduction & Motivation Results for Trained Files Another Compression Method Approach & Programs Bartosz Agas, Marvin Germar, & Christopher Tran Rutgers School of Engineering Dictionary: How It’s Built Citations Dipperstein, Michael. "Huffman Code Discussion and Implementation." Michael.dipperstein.com. Michael Dipperstein. Web. 2 Apr Nelson, Mark and Jean-Loup Gailly. The Data Compression Book, Second Edition. Cambridge, MA: IDG Books Worldwide, Inc., Schumacher III, James. "Lempel Ziv Compression." PlanetSourceCode. Exhedra Solutions, Inc., 18 Apr Web. 30 Mar Use a compression algorithm to create an evolving dictionary with an encoder and decoder Encoder reads in a file and updates the dictionary to reflect new probabilities of more frequent string of data. The compressed data is sent over a PTP connection. Decoder will read metadata bit by bit and decompress the files using the trained dictionary from the encoder. A WAN accelerator is an appliance that can maximize the services of a point-to-point(PTP) connection providing high quality performance using minimal resources. There are various methods that a WAN accelerator uses to optimize resources such as caching repetitive data, compression, and latency optimization. With such means, a WAN accelerator can analyze the packets of data being transferred over the PTP connection and only send the critical or compressed differentials. Consider two local networks where network_1 is connected to network_2 via a PTP connection and the networks frequently send each other similar data packets such as IP headers, s, and documents. A lot of these data packets contain repetitive data segments that do not need to utilize the resources of the PTP connection. Via the aforementioned methods, a WAN accelerator significantly improves the transfer process and minimizes monetary expenses. Investors can save money required for higher PTP connections. Huffman Coding Results for Specific File Types Results show that a company using a WAN accelerator will have reduced size file transmissions, ultimately saving money. The results shown here use a dictionary trained on a batch of files, which can reduce the network’s overall traffic activity A Huffman tree is created to develop a dictionary. This dictionary adapts to the data being transferred giving preferential treatment to data that is sent more frequently. A WAN accelerator’s dictionary learns from all data transfer and is updated unlike normal compression methods that deletes it dictionary after it outputs a file. By using an updateable dictionary, a company can use a WAN accelerator to optimize their network because the dictionary is trained specifically for the company’s common files and Huffman coding produces a dictionary that persist over numerous data transmissions. Given a 6 symbol alphabet with the following symbol probabilities: A = 1, B = 2, C = 4, D = 8, E = 16, F = 32 Step 1. Combine A and B into AB with a probability of 3. Step 2. Combine AB and C into ABC with a probability of 7. Step 3. Combine ABC and D into ABCD with a probability of 15. Step 4. Combine ABCD and E into ABCDE with a probability of 31. Step 5. Combine ABCDE and F into ABCDEF with a probability of 63. The Following tree results: ABCDEF / \ (0)F ABCDE / \ (10)E ABCD / \ (110)D ABC / \ (1110)C AB / \ (11110)A B(11111) To summarize this example, ‘F’ has the highest probability from the letters in this alphabet so it is represented with the shortest bit string possible and in this case just ‘0’. Huffman coding is a technique that uses lossless data compression to reduce the amount of bits required to represent a string of symbols. The algorithm accomplishes its goals by allowing symbols to vary in length. Shorter codes are assigned to the most frequently used symbols, and longer codes to the symbols which appear less frequently in the string. Huffman coding is tested on different file types to determine its effects. Certain file types compressed better than others so the data type being compressed matters. Text files and document files such as.doc or.xls compressed better. Other file types such as.docx..xlsx, and.png are already compressed and recompressing them does not make them smaller. Lempel-Ziv-Welch is another lossless data compression algorithm. It is more efficient than Huffman coding in optimizing network traffic. Similar to Huffman coding, the data type being compressed matters