Video Surveillance: Legally Blind? Peter Kovesi School of Computer Science & Software Engineering The University of Western Australia.

Slides:



Advertisements
Similar presentations
Low-Complexity Transform and Quantization in H.264/AVC
Advertisements

Fill in missing numbers or operations
Area of Triangles.
[1] AN ANALYSIS OF DIGITAL WATERMARKING IN FREQUENCY DOMAIN.
Multiplication X 1 1 x 1 = 1 2 x 1 = 2 3 x 1 = 3 4 x 1 = 4 5 x 1 = 5 6 x 1 = 6 7 x 1 = 7 8 x 1 = 8 9 x 1 = 9 10 x 1 = x 1 = x 1 = 12 X 2 1.
Division ÷ 1 1 ÷ 1 = 1 2 ÷ 1 = 2 3 ÷ 1 = 3 4 ÷ 1 = 4 5 ÷ 1 = 5 6 ÷ 1 = 6 7 ÷ 1 = 7 8 ÷ 1 = 8 9 ÷ 1 = 9 10 ÷ 1 = ÷ 1 = ÷ 1 = 12 ÷ 2 2 ÷ 2 =
CALENDAR.
1 1  1 =.
1  1 =.
Year 5 Term 3 Unit 6b Day 1.
The Physical Stimulus: Spatial pattern (This is a poorly-generated approximation to a sine wave)
The 5S numbers game..
1 OFDM Synchronization Speaker:. Wireless Access Tech. Lab. CCU Wireless Access Tech. Lab. 2 Outline OFDM System Description Synchronization What is Synchronization?
Factoring Quadratics — ax² + bx + c Topic
1 Photometric Stereo Reconstruction Dr. Maria E. Angelopoulou.
Matthias Wimmer, Bernd Radig, Michael Beetz Chair for Image Understanding Computer Science TU München, Germany A Person and Context.
Multimedia System Video
Computer vision: models, learning and inference
Introduction to Eye Tracking
#1UNIT C Describes a material which allows light to pass through easily.
Forensic Applications of Computer Vision
Shapelets Correlated with Surface Normals Produce Surfaces Peter Kovesi School of Computer Science & Software Engineering The University of Western Australia.
JPEG Compresses real images Standard set by the Joint Photographic Experts Group in 1991.
Md. Monjur –ul-Hasan Department of Computer Science & Engineering Chittagong University of Engineering & Technology Chittagong 4349
Computer Vision Lecture 7: The Fourier Transform
Wavelets Fast Multiresolution Image Querying Jacobs et.al. SIGGRAPH95.
Image Data Representations and Standards
Image Processing IB Paper 8 – Part A Ognjen Arandjelović Ognjen Arandjelović
School of Computing Science Simon Fraser University
JPEG.
5. 1 JPEG “ JPEG ” is Joint Photographic Experts Group. compresses pictures which don't have sharp changes e.g. landscape pictures. May lose some of the.
Facial Recognition Facial recognition software - based on the ability to recognize a face and then measure the various features of the face. Each human.
Video Surveillance is Useless Peter Kovesi School of Computer Science & Software Engineering The University of Western Australia.
Roger Cheng (JPEG slides courtesy of Brian Bailey) Spring 2007
1 JPEG Compression CSC361/661 Burg/Wong. 2 Fact about JPEG Compression JPEG stands for Joint Photographic Experts Group JPEG compression is used with.jpg.
Image Compression JPEG. Fact about JPEG Compression JPEG stands for Joint Photographic Experts Group JPEG compression is used with.jpg and can be embedded.
Image and Video Compression
Image Compression - JPEG. Video Compression MPEG –Audio compression Lossy / perceptually lossless / lossless 3 layers Models based on speech generation.
Trevor McCasland Arch Kelley.  Goal: reduce the size of stored files and data while retaining all necessary perceptual information  Used to create an.
CIS 601 Fall 2004 Introduction to Computer Vision and Intelligent Systems Longin Jan Latecki Parts are based on lectures of Rolf Lakaemper and David Young.
Page 18/30/2015 CSE 40373/60373: Multimedia Systems 4.2 Color Models in Images  Colors models and spaces used for stored, displayed, and printed images.
JPEG C OMPRESSION A LGORITHM I N CUDA Group Members: Pranit Patel Manisha Tatikonda Jeff Wong Jarek Marczewski Date: April 14, 2009.
Introduction to JPEG Alireza Shafaei ( ) Fall 2005.
CS Spring 2012 CS 414 – Multimedia Systems Design Lecture 8 – JPEG Compression (Part 3) Klara Nahrstedt Spring 2012.
ECE472/572 - Lecture 12 Image Compression – Lossy Compression Techniques 11/10/11.
Multimedia Data Video Compression The MPEG-1 Standard
DATA COMPRESSION LOSSY COMPRESSION METHODS What it is… A compression of information that is acceptable in pictures or videos, but not texts or programs.
DIGITAL Video. Video Creation Video captures the real world therefore video cannot be created in the same sense that images can be created video must.
Image Processing and Computer Vision: 91. Image and Video Coding Compressing data to a smaller volume without losing (too much) information.
Multimedia Data DCT Image Compression
8. 1 MPEG MPEG is Moving Picture Experts Group On 1992 MPEG-1 was the standard, but was replaced only a year after by MPEG-2. Nowadays, MPEG-2 is gradually.
Understanding JPEG MIT-CETI Xi’an ‘99 Lecture 10 Ben Walter, Lan Chen, Wei Hu.
Multimedia. What is a graphic?  A graphic can be a: Chart Drawing Painting Photograph Logo Navigation button Diagram.
JPEG - JPEG2000 Isabelle Marque JPEGJPEG2000. JPEG Joint Photographic Experts Group Committe created in 1986 by: International Organization for Standardization.
The task of compression consists of two components, an encoding algorithm that takes a file and generates a “compressed” representation (hopefully with.
Visual Computing Computer Vision 2 INFO410 & INFO350 S2 2015
HOW JEPG WORKS Presented by: Hao Zhong For 6111 Advanced Algorithm Course.
Data Compression Data Compression For Images. Acknowledgement Most of this lecture note has been taken from the lecture note on Multimedia and HCI course.
JPEG.
Learning Photographic Global Tonal Adjustment with a Database of Input / Output Image Pairs.
Robotics Chapter 6 – Machine Vision Dr. Amit Goradia.
Face Recognition Technology By Catherine jenni christy.M.sc.
Algorithms in the Real World
DCT IMAGE COMPRESSION.
A Simple Image Compression : JPEG
JPEG Image Coding Standard
Camera Selection and Testing
JPEG Still Image Data Compression Standard
Image Compression Techniques
Presentation transcript:

Video Surveillance: Legally Blind? Peter Kovesi School of Computer Science & Software Engineering The University of Western Australia

Questions What image quality do we need for identification? How do you measure image quality? What is the image quality from a video camera? What is the effect on image quality when you: record to video tape? use image compression?

Humans are very bad at recognizing unfamiliar faces Kemp, Towell and Pike (1997) tested the value of having photos on credit cards. When a user presented a card with a photograph of someone else that had some resemblance to the user, they were challenged less than 40% of the time. Bruce et al. (1999, 2001) have tested the ability of people to match good quality CCTV images of unfamiliar faces under a variety of scenarios. Correct recognition rates are typically only 70-80%.

Good quality photograph of target Array of 10 good quality CCTV images Bruce et al (1999). Is this person in the array? If they are present match the person.

Good quality photograph of target Array of 10 good quality CCTV images Bruce et al (1999). Is this person in the array? If they are present match the person.

Good quality photograph of target Array of 10 good quality CCTV images When target was present in the array. 12% picked wrong person and 18% said they were not present (overall only 70% correct). When target was not present in the array 70% still matched the target to someone in the array. Bruce et al (1999). Is this person in the array? If they are present match the person.

Face recognition performance by humans is poor. Face recognition performance by machine is becoming quite good - but only if the images are of good quality. Surveillance video rarely provides good quality images.

Face recognition performance by humans is poor. Face recognition performance by machine is becoming quite good - but only if the images are of good quality. Surveillance video rarely provides good quality images. What image quality is needed for face identification?

Image quality is defined by many attributes Minimum feature size that can be resolved Noise level Quality of luminance reproduction Quality of colour reproduction.

(Hayes, Morrone and Burr 1986) (Costen, Parker and Craw 1996) (Nasanen 1999) In humans it has been found that face recognition is tuned to a set of spatial frequencies ranging from about 20 cycles per face width down to about 5 cycles per face width. 20 cycles 10 cycles 5 cycles Human Face Recognition Maximum sensitivity is centred around 8 to 13 cycles/face width. To recognize with confidence you need to be able to resolve down to 20 cycles/face width

(Hayes, Morrone and Burr 1986) (Costen, Parker and Craw 1996) (Nasanen 1999) In humans it has been found that face recognition is tuned to a set of spatial frequencies ranging from about 20 cycles per face width down to about 5 cycles per face width. ~ 160mm 20 cycles 10 cycles 5 cycles Human Face Recognition 8mm 16mm Maximum sensitivity is centred around 8 to 13 cycles/face width. To recognize with confidence you need to be able to resolve down to 20 cycles/face width

1951 USAF Chart Groupings of 6 pairs of bars. Each successive set is half the size of the previous.

1951 USAF Chart Groupings of 6 pairs of bars. Each successive set is half the size of the previous. 16mm 8mm

Eye charts also provide a simple way of measuring the minimum feature size that can be resolved.

20/20 Vision… … or in metric, 6/6 vision Snellen fraction 6 6 Distance at which you should be able to read the line Distance at which you can read the line on the chart Minimum Angle of Resolution

Ian Bailey and Jan Lovie The logMAR chart

88mm 72mm 58mm 36mm 44mm 6/6 6/12 6/24 6/48 Snellen fraction Letter height Number plate letters 80mm Average eye spacing 65mm 9mm 18mm 6/60 (legally blind)

Tests conducted with Pulnix TM6CN 1/2” CCD camera positioned 6m from the target. Images were digitized directly from the camera using a Data Translation 3155 frame grabber C-mount lenses: 4mm 6mm 8.5mm 12.5mm 16mm

4mm lens

6mm lens

8.5mm lens

12.5mm lens

16mm lens

Camera image recorded to video, then played back and digitized. (Look at the USAF chart) Camera image digitized directly. Expect to lose quality when images are recorded to video (cropped images taken with 12.5mm lens)

Compression is problematic. Test targets survive compression well, but faces do not. JPEG image quality 0 (14kB)JPEG image quality 4 (24kB) Original PNG image (190kB) JPEG images compressed using Photoshop. Image ‘quality’ can range from

JPEG (14kB)JPEG (24kB)Original Faces do not survive compression well

What Does Compression Do? Image is divided into 8x8 blocks. Discrete Cosine Transform is applied to each block. The transform coefficients are quantized, many will be rounded to zero. When reconstructed, the amplitude and phase of the spatial frequencies within each 8x8 block will be altered. The 64 basis functions of an 8x8 Discrete Cosine Transform JPEG and MPEG

12.5mm lens at 6m No compression ~ 40 pixels

12.5mm lens at 6m 18:1 compression

12.5mm lens at 6m 18:1 compression

12.5mm lens at 6m 31:1 compression

12.5mm lens at 6m 31:1 compression 40 pixels across face = 5 DCT blocks Spatial frequencies from 5 cycles/face width upwards are all corrupted This is exactly the range that is most important for face recognition!

A Real Surveillance Camera Installation…

4.8 m

Image quality is defined by many attributes Minimum feature size that can be resolved Noise level Quality of luminance reproduction Quality of colour reproduction.

Original laser scanned faces Same shape, varying pigmentation Same pigmentation, varying shape (Russell et al 2007) Luminance and colour cues are at least as important as shape cues People perform about equally well using just shape information or just pigmentation cues.

Hue values as greyscale 16 x16 macro-blocks Image compression typically quantizes colour information very heavily…

Conclusions Surveillance video, as it is currently used, is almost useless for identification. Face recognition in low resolution images is badly affected by compression artifacts. Image quality standards are needed for surveillance camera installations.

Conan O’Brien US talk show host Tarja Halonen President of Finland