LightSync: Unsynchronized Visual Communication over Screen-Camera Links Wenjun Hu Hao Gu Qifan Pu.

Slides:



Advertisements
Similar presentations
Wireless Networks Should Spread Spectrum On Demand Ramki Gummadi (MIT) Joint work with Hari Balakrishnan.
Advertisements

Kagawa National College of Technology ShogoYamashita, Ayako Yano, Chisato Mouri, Hideki Tenzou, An AR Radiation Field Visualization System for Monitoring.
Production Process What events happen during the production process?
Data Compression CS 147 Minh Nguyen.
VSMC MIMO: A Spectral Efficient Scheme for Cooperative Relay in Cognitive Radio Networks 1.
STYROFOAM Robert LiKamWa, David Ramirez, Jason Holloway Advisors: Lin Zhong, Behnaam Aazhang, Ashok Veeraraghavan Rice University STYROFOAM – Mobicom.
NIKON COOLPIX S6200. PANORAMA ASSIST Panorama- An unbroken view of an entire surrounding area. Directions for using panorama assist- Use this setting.
Hand Gesture for Taking Self Portrait Shaowei Chu and Jiro Tanaka University of Tsukuba Japan 12th July 15 minutes talk.
Strata: Layered Coding for Scalable Visual Communication Wenjun Hu Jingshu Mao Zihui Huang Yiqing Xue Junfeng She Kaigui Bian Guobin (Jacky) Shen.
C HAPTER 18: E LECTRONIC S PECIAL E FFECTS. V OCABULARY : Chromakey: Type of key effect where a specific color can be blocked from the key camera’s input.
EyePhone: Activating Mobile Phones With Your Eyes Emiliano Miluzzo, Tianyu Wang, Andrew T. Campbell CS Department – Dartmouth College, Hanover, NH, USA.
Motivation Application driven -- VoD, Information on Demand (WWW), education, telemedicine, videoconference, videophone Storage capacity Large capacity.
Living with Interference in Unmanaged Wireless Environments David Wetherall, Daniel Halperin and Tom Anderson Intel Research & University of Washington.
Virtual Dart: An Augmented Reality Game on Mobile Device Supervisor: Professor Michael R. Lyu Prepared by: Lai Chung Sum Siu Ho Tung.
Video enhances, dramatizes, and gives impact to your multimedia application. Your audience will better understand the message of your application.
1 MURI review meeting 09/21/2004 Dynamic Scene Modeling Video and Image Processing Lab University of California, Berkeley Christian Frueh Avideh Zakhor.
Fractal Image Compression
CS4514 Networks1 Distributed Dynamic Channel Selection in Chaotic Wireless Networks By: Matthias Ihmig and Peter Steenkiste Presented by: James Cialdea.
Virtual Dart – An Augmented Reality Game on Mobile Device Supervised by Prof. Michael R. Lyu LYU0604Lai Chung Sum ( )Siu Ho Tung ( )
Enhancing Reliability to Boost the Throughput over Screen-Camera Links Anran Wang, Shuai Ma, Chunming Hu, Jinpeng Huai, Chunyi Peng, Guobin Shen.
Remote Surveillance Vehicle Design Review By: Bill Burgdorf Tom Fisher Eleni Binopolus-Rumayor.
EEL 6935 Embedded Systems Long Presentation 2 Group Member: Qin Chen, Xiang Mao 4/2/20101.
Local Asynchronous Communications. Bit-wise data transmission Data transmission requires: Encoding bits as energy Transmitting energy through medium Decoding.
Slotted Programming for Sensor Networks Roland Flury Roger Wattenhofer Distributed Computing Group ETH Zurich, Switzerland D ISTRIBUTED C OMPUTING.
+ Video Compression Rudina Alhamzi, Danielle Guir, Scott Hansen, Joe Jiang, Jason Ostroski.
Digital Cameras Caitlin Stamper. The first commercially available digital camera was the 1990 Dycam Model 1 (Logitech Fotoman). It used a CCD image sensor,
The Digital Image.
Digital Imaging Systems –I/O. Workflow of digital imaging Two Competing imaging format for motion pictures Film vs Digital Video( TV) Presentation of.
BAR CODE SCANNER A.ANUSHA (06N61A0402). What is bar code? A barcode is a machine readable representation of information. Barcode stores data in widths.
What’s Making That Sound ?
Digital Video and Multimedia If images can portray a powerful message then video (as a series of related images) is a serious consideration for any multimedia.
Input Devices Manual and Automatic By Laura and Gracie.
Enhancing Reliability to Boost the Throughput over Screen-Camera Links Anran Wang, Shuai Ma, Chunming Hu, Jinpeng Huai, Chunyi Peng, Guobin Shen.
By Qian Deng MobiUS: Enable Together-Viewing Video Experience across Two Mobile Devices.
CMPD273 Multimedia System Prepared by Nazrita Ibrahim © UNITEN2002 Multimedia System Characteristic Reference: F. Fluckiger: “Understanding networked multimedia,
2010 Olin Project Idea Keith Gendreau Jeff Livas
Video Video.
DIGITAL Video. Video Creation Video captures the real world therefore video cannot be created in the same sense that images can be created video must.
Context-Aware Interactive Content Adaptation Iqbal Mohomed, Jim Cai, Sina Chavoshi, Eyal de Lara Department of Computer Science University of Toronto MobiSys2006.
Video Compression: Performance evaluation of available codec software Sridhar Godavarthy.
Video Data Hiding using Forbidden Zone and Selective Embedding Submitted Under Team Members.
MULTIMEDIA Multimedia Industries WJEC GCSE ICT. STARTER  Multimedia is the combination of different elements:  Multimedia is used in many different.
December 9, 2014Computer Vision Lecture 23: Motion Analysis 1 Now we will talk about… Motion Analysis.
Compression of Real-Time Cardiac MRI Video Sequences EE 368B Final Project December 8, 2000 Neal K. Bangerter and Julie C. Sabataitis.
Doc.: IEEE a Submission March 2015 Hsin-Mu (Michael) Tsai, NTUSlide 1 Project: IEEE P Working Group for Wireless Personal Area.
Figure ground segregation in video via averaging and color distribution Introduction to Computational and Biological Vision 2013 Dror Zenati.
A Robust Method for Lane Tracking Using RANSAC James Ian Vaughn Daniel Gicklhorn CS664 Computer Vision Cornell University Spring 2008.
Barcodes, MMS, and the Internet’s Cheapest Prices Greg McGrath & Greg Maier Advisors: Professor Cotter, Professor Rudko ECE-499 March 01, 2008.
A CCELEROMETER. Accelerometers are devices that can measure acceleration, but in smartphones, they're able to detect changes in orientation and tell the.
Electrical and Computer Engineering Smart Goggles To Chong Ryan Offir Matt Ferrante James Kestyn Advisor: Dr. Tilman Wolf Team Wolf.
Submission Doc: IEEE a Slide 1 Project: IEEE Working Group for Wireless Personal Area Networks (WPANs) Submission Title: Display.
1 Basics of Video Multimedia Systems (Module 1 Lesson 3) Summary: r Types of Video r Analog vs. Digital Video r Digital Video m Chroma Sub-sampling m HDTV.
Industrial and Scientific Applications
Date of download: 6/9/2016 Copyright © 2016 SPIE. All rights reserved. Experimental setup. Two separate PCs were used to avoid potential losses when streaming.
Digital Video Representation Subject : Audio And Video Systems Name : Makwana Gaurav Er no.: : Class : Electronics & Communication.
ENTERFACE 08 Project 9 “ Tracking-dependent and interactive video projection ” Mid-term presentation August 19th, 2008.
Digital Image -M.V.Ramachandranwww.youtube.com/postmanchandru
NETWORK VIDEO SURVEILLANCE. CCTV Closed-Circuit Television (CCTV) is the use of video cameras to transmit signal to a specific place on a designated device.
1.1 Know the element in basic communication system
Data Compression.
Image Segmentation Classify pixels into groups having similar characteristics.
Data Compression.
Course Activity On-off keying.
Data Compression CS 147 Minh Nguyen.
Lab 1 Camera Communication.
Range Imaging Through Triangulation
COlor Barcode stReaming for smArtphones -Drew Carpenter
Kyoungwoo Lee, Minyoung Kim, Nikil Dutt, and Nalini Venkatasubramanian
Chapter 14 Shading Models.
Scalable light field coding using weighted binary images
Presentation transcript:

LightSync: Unsynchronized Visual Communication over Screen-Camera Links Wenjun Hu Hao Gu Qifan Pu

2D barcodes everywhere

(“Linking” to online information)

A video of barcodes instead “Transmitting” information (vs linking) – E.g., information on Gibraltarian attractions – E.g., movie show times for nearby theaters Receiver Transmitter

Visual communications on phones No need for additional connectivity Free, directional, lightweight links “Optical” software-defined radio – Run as apps on commodity devices

Many challenges Frame synchronization Poor image quality Limited computation power on phones Aesthetics…

In the rest of the talk… Frame synchronization challenges LightSync design Implementation and evaluation

FRAME SYNCHRONIZATION CHALLENGES

Frame rate mismatch Screen side (TX) 30 fps video playback Fairly stable Camera side (RX) 15 – 30 fps in practice Variable – Lighting conditions – Device capability – API/system factors High rate, steady Low rate, unsteady

Original frame Received frame patterns

Original frame Single frame 2-frame mix Mixing pattern varies by line Received frame patterns

2-frame mix Mixing pattern (& #components) varies by line Due to Rolling Shutter CMOS camera sensor scans one line (of pixels) at a time

Received frame patterns Original frame Single frame 2-frame mix 3-frame mix Consecutive frames indicating a missing frame Mixing pattern varies by line

Display frame rate C – Frame capture rate C/2 C 2C Occasional missed frames 4-frame mixes 3-frame mix Occasional missed frames

Synchronized communication Display frame rate C – Frame capture rate C/2 C 2C The only decodable pattern Frame synchronized Under-utilized and inefficient! Carries information But undecodable and dropped Cannot be used (of worst camera)

Unsynchronized communication Display frame rate (fps) 0 30 Works with any display and capture rates Higher camera capture rate, better performance Supported frame capture rate (fps)3015

Requirements Need to be able to decode mixed frames – In-frame color tracking per line Need to guard against lost frames/lines – Inter-frame erasure coding Essentially a temporal code across frames – Works with any (monochrome) per-frame code

Our solution: LightSync Need to be able to decode mixed frames – In-frame grayscale tracking per line Need to guard against lost frames/lines – Inter-frame erasure coding Essentially a temporal code across frames – Works with any (monochrome) per-frame code

LIGHTSYNC DEMO

LightSync Need to be able to decode mixed frames – In-frame grayscale tracking per line Need to guard against lost frames/lines – Inter-frame erasure coding

Per-line tracking: Mixing patterns 3 frames mixing → 3 blocks mixing Bl or w per block → 8 mixing possibilities ??? Frame 1 Frame 2 Frame 3 → RXTX

Per-line tracking: Mixing patterns RX TX Line 1 Line 2 Line 3 Frame 1 Frame 2 Frame 3

Per-line tracking: Mixing patterns Trivial cases RX TX Line 1 Line 2 Line 3 Frame 1 Frame 2 Frame 3

Per-line tracking: Mixing patterns Easy cases RX TX Line 1 Line 2 Line 3 Frame 1 Frame 2 Frame 3

Per-line tracking: Mixing patterns Tricky cases, to be tracked per line RX TX Line 1 Line 2 Line 3 Frame 1 Frame 2 Frame 3

Code area Per-line tracking: Reference blocks Corner Scanning direction Tracking bar orientation Tracking bars

Per-line tracking: Reference blocks Tx frame 1 Tx frame 2 Tx frame 3 Tx frame 4 Tx frame 5 (repeats) Tracking bar 1 Tracking bar 2 Tracking bar 3 Tracking bar 4 4 reference blocks per line

Per-line tracking: Reference blocks Tx frame 1 Tx frame 2 Tx frame 3 Tx frame 4 Tx frame 5 (repeats) Tracking bar 1 Tracking bar 2 Tracking bar 3 Tracking bar 4

Per-line tracking: Reference blocks Tx frame 1 Tx frame 2 Tx frame 3 Tx frame 4 Tx frame 5 (repeats) Tracking bar 1 Tracking bar 2 Tracking bar 3 Tracking bar 4 Decoding by comparison Reference colors of mixes

LightSync Need to be able to decode mixed frames – In-frame grayscale tracking per line Need to guard against lost frames/lines – Inter-frame erasure coding

Inter-frame coding Simple pairwise XOR Group of 3 original frames Coded frames

Inter-frame coding Looping display on screen Coded frames … … All original frames Camera stops frame capture after message decoded

PERFORMANCE

Implementation Current frame capture rate Inter-frame intervals

Performance Goal recap: Works with any display/capture rate combinations Metric: Effective frame capture rate – Actual throughput scales with per-frame capacity

Schematic performance For a 20fps phone camera

Schematic performance For a 20fps phone camera Ideal Peaks at the camera rate Level at higher display rates Ideal

Schematic performance Peaks at ~ half the camera rate For a 20fps phone camera Conventional Drops to 0 at higher display rates Ideal

Schematic performance Conventional LightSync For a 20fps phone camera Peaks at the camera rate Slight drops at higher display rates Ideal

LightSync performance Nokia Lumia 920 (~30 fps)

LightSync performance Nokia Lumia 920 (~30 fps) HTC One X (~20 fps)

LightSync performance Nokia Lumia 920 (~30 fps) HTC One X (~20 fps) Huawei Ascend C8812E (~15 fps) Works with any display and capture rates Higher camera capture rate, better performance

Conclusion and outlook LightSync: Unsynchronized visual communication – Primitive for similar scenarios Lots of potential in the space of visual communication

Thank you!

Related work Temporal barcode design (inter-frame) – Unsynchronized 4D Barcodes Basically slows down display, only for TX up to 10fps Spatial barcode design (per-frame layout) – COBRA: Optimized for phone-to-phone – PixNet: High-end camera, computationally intensive, offline processing Other work on visible light communications