COMS-E6998 Information Theory in Computer Science

Slides:



Advertisements
Similar presentations
CS B553: A LGORITHMS FOR O PTIMIZATION AND L EARNING aka “Neural and Genetic Approaches to Artificial Intelligence” Spring 2011 Kris Hauser.
Advertisements

Lecture 40 CSE 331 Dec 11, Announcements Solutions to HW 10 and graded HW 9 at end of the lecture Review session on Monday: see blog for details.
CSE 501 Research Overview Atri Rudra
Lecture 41 CSE 331 Dec 10, HW 10 due today Q1 in one pile and Q 3+4 in another I will not take any HW after 1:15pm.
eGovernance Under guidance of Dr. P.V. Kamesam IBM Research Lab New Delhi Ashish Gupta 3 rd Year B.Tech, Computer Science and Engg. IIT Delhi.
COMP 14 – 02: Introduction to Programming Andrew Leaver-Fay August 31, 2005 Monday/Wednesday 3-4:15 pm Peabody 217 Friday 3-3:50pm Peabody 217.
 MODERN DATABASE MANAGEMENT SYSTEMS OVERVIEW BY ENGINEER BILAL AHMAD
EECS 395/495 Algorithmic Techniques for Bioinformatics General Introduction 9/27/2012 Ming-Yang Kao 19/27/2012.
ISE420 Algorithmic Operations Research Asst.Prof.Dr. Arslan M. Örnek Industrial Systems Engineering.
Winter 2015 COMP 2130 Introduction to Computer Systems Computing Science Thompson Rivers University Introduction and Overview.
Advanced Topics in Distributed Systems Fall 2011 Instructor: Costin Raiciu.
Estimating Entropy for Data Streams Khanh Do Ba, Dartmouth College Advisor: S. Muthu Muthukrishnan.
Introduction and Overview Summer 2014 COMP 2130 Introduction to Computer Systems Computing Science Thompson Rivers University.
RESOURCES, TRADE-OFFS, AND LIMITATIONS Group 5 8/27/2014.
Introduction to Science Informatics Lecture 1. What Is Science? a dependence on external verification; an expectation of reproducible results; a focus.
Algorithmic, Game-theoretic and Logical Foundations
11 Algorithmic Techniques for Massive Data (COMS ) Alex Andoni.
&d1 Algorithms and networks Period 3, 2010/2011. &d2 Today Graphs and networks and algorithms: what and why? This course: organization Case introduction:
Management of Broadband Media Assets on Wide Area Networks Lars-Olof Burchard.
Week-6 (Lecture-1) Publishing and Browsing the Web: Publishing: 1. upload the following items on the web Google documents Spreadsheets Presentations drawings.
Lecture 00: Introduction
RAIK 283 Data Structures and Algorithms
Advanced Algorithms Analysis and Design
Accelerating Peer-to-Peer Networks for Video Streaming
Welcome to CS 4390/CS5381: Introduction to Formal Methods
Graduate Seminar In Machine Learning
Jacob R. Lorch Microsoft Research
Improving searches through community clustering of information
LWIP TCP/IP Stack 김백규.
Zhu Han University of Houston Thanks for Professor Dan Wang’s slides
Lower bounds for approximate membership dynamic data structures
Vocabulary Prototype: A preliminary sketch of an idea or model for something new. It’s the original drawing from which something real might be built or.
Principles of Network Applications
Software Design and Architecture
Grid Computing.
Introduction of ECE665 Computer Algorithms
Vocabulary Prototype: A preliminary sketch of an idea or model for something new. It’s the original drawing from which something real might be built or.
Cloud Computing By P.Mahesh
Computer Science 102 Data Structures CSCI-UA
Software Architecture in Practice
CS410: Text Information Systems (Spring 2018)
Add intelligence to Dynamics AX with Cortana Intelligence suite
EE 193: Parallel Computing
Andy Wang Operating Systems COP 4610 / CGS 5765
CS 179 Project Intro.
Objective of This Course
Predicting Traffic Dmitriy Bespalov.
Jeremy Bolton, PhD Assistant Teaching Professor
Andy Wang Operating Systems COP 4610 / CGS 5765
C.U.SHAH COLLEGE OF ENG. & TECH.
cse 521: design and analysis of algorithms
Andy Wang Operating Systems COP 4610 / CGS 5765
ECE 544 Project III Description and Timeline March 23, 2018
Lecture 00: Introduction
Source Encoding and Compression
ITEC 2620M Introduction to Data Structures
Jason Klaus, Duncan Elliott Confidential
Course Overview CS 4640 Programming Languages for Web Applications
CompSci 1: Principles of Computer Science Lecture 1 Course Overview
CS 2530 Intermediate Computing Dr. Schafer
Introduction to Cryptography
Lecture 5: Writing Page
Principles of Imperative Computation
Minwise Hashing and Efficient Search
Online Analytical Processing Stream Data: Is It Feasible?
Andy Wang Operating Systems COP 4610 / CGS 5765
Scalable light field coding using weighted binary images
Course Introduction Data Visualization & Exploration – COMPSCI 590
CS 232 Geometric Algorithms: Lecture 1
Welcome to the First-Year Experience!
Presentation transcript:

COMS-E6998 Information Theory in Computer Science Columbia , Fall 2017

Administrativia Instructor : Omri Weinstein TA : Zhenrui Liao Lectures: Tuesday 4:10-6:20 pm (10-15 min break) . Office Hours: Omri: Tue, 2:30-3:30 pm (or by apt). Zhenrui: Wed 2pm-4pm . Recitations: Optional, possibly a couple to bring up to speed . Original motivation: Netflix, Amazon Instant Video (+pics?) Abstraction of typical scenario: clients miss information during transmissions of network . So this problem models a scenario where clients miss some informatino and NW wishes to minimize re-transmission rate, Course Level : 6K (graduate) -- proof heavy but not technical bootcamp. No prerequisites beyond mathematical maturity + probability theory. Books/HW/reading list – on website (http://infotheorytcs.wikischolars.columbia.edu) .

Grading HW: Bi-weekly (~5-6 in total) : 40% . Final Project : 40% . Scribing 1 lecture : 20% (at most 13 lectures, up to 2 students a lecture) . HW policy: - 5-6 exercises per HW, each totals to 125 pts. - Collaborations allowed as long as you write your own solution separately! (new concepts takes time to sink in, hard to keep up without practice). Original motivation: Netflix, Amazon Instant Video (+pics?) Abstraction of typical scenario: clients miss information during transmissions of network . So this problem models a scenario where clients miss some informatino and NW wishes to minimize re-transmission rate,

Final Research Projects Research/Reading based : Choose a course-related paper, summarize project and prepare 10-min talk (for final presentations day). You’re expected to say “something new” about problem (connection/extension). “Meta project”: related to storage, compression and search in modern file systems (more on this soon). Project based on unexplored problem in dynamic data structures – both theory and applied aspects (many directions to choose from, more on this soon..) Implementation based (storage) : Focus on 1 sub-topic in storage project, implement algorithm on real data repositories (more soon). Small groups allowed. Original motivation: Netflix, Amazon Instant Video (+pics?) Abstraction of typical scenario: clients miss information during transmissions of network . So this problem models a scenario where clients miss some informatino and NW wishes to minimize re-transmission rate, - Consult + converge early on (e.g., during office hours or schedule meeting) - Project proposal deadline : October 10th.

Introduction

Motivating story for this course Engineering perspective: Traditional “home” of Information Theory (IT). IT underlies most modern digital technology (e.g., compression, coding, distributed storage…). Explore some of this theory, applications, extensions. Complexity perspective: Computational Complexity: “How many resources required to solve given computational problem?” (space, time, energy, neural layers, etc.) Holy grail : Unconditional lower bounds on such resources. Unfortunately, majority of Complexity built on conditional hardness results  Original motivation: Netflix, Amazon Instant Video (+pics?) Abstraction of typical scenario: clients miss information during transmissions of network . So this problem models a scenario where clients miss some informatino and NW wishes to minimize re-transmission rate, One of major achievements of TCS : understanding “information bottleneck” of computational models (as opposed to the efficiency of processing data).

The Complexity Theoretic View Course “birthright” : increasing role of information-theoretic mindset and methods in understanding computation (hopefully twist your minds as well). Most topics we’ll encounter are active research fields (e.g., streaming, LPs, circuits complexity, static/dynamic DS...). Many open problems… Original motivation: Netflix, Amazon Instant Video (+pics?) Abstraction of typical scenario: clients miss information during transmissions of network . So this problem models a scenario where clients miss some informatino and NW wishes to minimize re-transmission rate, What are we about to see in this course ? …

Data Compression and Interactive Compression Data compression, a.k.a Shannon’s “source coding” problem (today) : What is the essential amount of communication communication required to transmit/encode a random dataset (mention few natural and practical compression schemes, e.g., Huffman, JPEG). Later in the course: Does Shannon’s compression theory extend to interactive scenarios, i.e., “can we compress conversations”?

Streaming Real-time systems (financial databases, monitoring networks) store huge, rapidly changing data, which is too large to store explicitly. x1 x2 xn x +3 +17 +3 - 6 -15 +3 Long sequence of updates in [M] (e.g., IP addresses of packets passing through a router, DNA sequence, transactions in sensor networks…) Goal: Compute some function f(x) of the final stream (whp), with minimum space. Trivial solution: ~ n lg M = O(n) bits of space – too expensive  Relaxation: approximate f(x) (whp) using sublinear space. (ex: Fp := i |xi|p)

Data Structures : Time-Space tradeoffs Nearest Neighbor problem (cornerstone in most ML apps, optimization, DBs…) Rd Preprocess n pts into smallest possible memory, allowing fast answering of exp(d) >> n queries. ? Trivial solutions: Either exp(d) space (store all answers), or ~ n time (scan entire database) . Original motivation: Netflix, Amazon Instant Video (+pics?) Abstraction of typical scenario: clients miss information during transmissions of network . So this problem models a scenario where clients miss some informatino and NW wishes to minimize re-transmission rate, Are there better tradeoffs ? Some databases are dynamic – maintain them cheaply, online (like streaming)…

Compression and Storage “Dictionary” problem: Dropbox server needs to store file collection X = (X1, X2, …, Xn) » ¹ of multiple users, s.t individual files can be retrieved quickly (few memory accesses) . This field will bring us to a specific natural DS problem that arises in Storage systems. Realistic scenario – Files correlated (Info¹(X) ¿ i Info¹ (Xi) is space benchmark) . Tradeoffs b/w storage space and decoding time (# memory accesses) “locally decodable” compression schemes??

We’ll see later in the course: - Theoretical problem is mostly open (understanding it in general notoriously hard) - For some joint distributions, hopeless to have small space and fast (“local”)decoding… But, wait a minute – real life distributions aren’t like that. 2 natural and important data types (distributions) : (1) Visual ; (2) Genomic . (correlations are huge and dictionaries are important) Final project proposal: Theoretical and Applied. Suggested projects : (a) Define generative model for correlated file albums per data type, then try to design similarity search and compression algs (ideally dynamic). Work with real data repositories. (b) how to search ? (c) how to compress? (d) realistic problem is dynamic – can we handle updates? Original motivation: Netflix, Amazon Instant Video (+pics?) Abstraction of typical scenario: clients miss information during transmissions of network . So this problem models a scenario where clients miss some informatino and NW wishes to minimize re-transmission rate, Will host special session (40-min) on this project late September (doodle poll).

Interactive Compression Data compression (today’s topic). Does Shannon’s classical extend to interactive scenarios? Can we compress conversations ? Linear-Program LBs LPs = powerful algorithmic tool for solving (approximating) combinatorial optimization problems. How large (# vars/constraints) does an LP need to be in order to produce a good sln to resource allocation/packing problems? Original motivation: Netflix, Amazon Instant Video (+pics?) Abstraction of typical scenario: clients miss information during transmissions of network . So this problem models a scenario where clients miss some informatino and NW wishes to minimize re-transmission rate, Common denominator of all these applications? Communication and Information Theory (surprisingly, all implicitly embed some distributed problem over a channel…)

Rough plan (syllabus) Information Theory 101, Data Compression. Communication Complexity. Information Complexity and interactive compression. Applications to Streaming. Applications to Data Structures (static & dynamic). Original motivation: Netflix, Amazon Instant Video (+pics?) Abstraction of typical scenario: clients miss information during transmissions of network . So this problem models a scenario where clients miss some informatino and NW wishes to minimize re-transmission rate,