My Perspectives on Graduate Research Panya Chanawangsa Ubiquitous Multimedia Lab Advisor: Dr. Chang Wen Chen 10/14/2014.

Slides:

Advertisements

Similar presentations

Histograms of Oriented Gradients for Human Detection

Advertisements

Assessing and Managing Risk

Human Identity Recognition in Aerial Images Omar Oreifej Ramin Mehran Mubarak Shah CVPR 2010, June Computer Vision Lab of UCF.

Real-time, low-resource corridor reconstruction using a single consumer grade RGB camera is a powerful tool for allowing a fast, inexpensive solution to.

Face Recognition Method of OpenCV

Visual Event Detection & Recognition Filiz Bunyak Ersoy, Ph.D. student Smart Engineering Systems Lab.

The image based surveillance system for personnel and vehicle tracking Chairman:Hung-Chi Yang Advisor: Yen-Ting Chen Presenter: Fong-Ren Sie Date:

A Mobile-Cloud Pedestrian Crossing Guide for the Blind

Detecting Pedestrians by Learning Shapelet Features

Perception and Communications for Vulnerable Road Users safety Pierre Merdrignac Supervisors: Fawzi Nashashibi, Evangeline Pollard, Oyunchimeg Shagdar.

Computer and Robot Vision I

CPSC 425: Computer Vision (Jan-April 2007) David Lowe Prerequisites: 4 th year ability in CPSC Math 200 (Calculus III) Math 221 (Matrix Algebra: linear.

Personal Driving Diary: Constructing a Video Archive of Everyday Driving Events IEEE workshop on Motion and Video Computing ( WMVC) 2011 IEEE Workshop.

Yiming Zhang SUNY at Buffalo TRAFFIC SIGN RECOGNITION WITH COLOR IMAGE.

Overview of Computer Vision CS491E/791E. What is Computer Vision? Deals with the development of the theoretical and algorithmic basis by which useful.

Vigilant Real-time storage and intelligent retrieval of visual surveillance data Dr Graeme A. Jones.

Cindy Song Sharena Paripatyadar. Use vision for HCI Determine steps necessary to incorporate vision in HCI applications Examine concerns & implications.

Trip Report for The IASTED International Conference on Internet and Multimedia Systems and Applications (EuroIMSA 2006) February 13-15, 2006 Innsbruck,

Triangle-based approach to the detection of human face March 2001 PATTERN RECOGNITION Speaker Jing. AIP Lab.

CONCLUSION & FUTURE WORK VEHICLE DETECTION IMAGE PROCESSING VISTA – COMPUTER VISION INNOVATIONS FOR SAFE TRAFFIC VEHICLE ORIGIN DETECTION USING LICENSE.

Opportunities of Scale, Part 2 Computer Vision James Hays, Brown Many slides from James Hays, Alyosha Efros, and Derek Hoiem Graphic from Antonio Torralba.

Face Recognition and Retrieval in Video Basic concept of Face Recog. & retrieval And their basic methods. C.S.E. Kwon Min Hyuk.

Android QR-Code Detection Cerman Martin,

Jason Li Jeremy Fowers Ground Target Following for Unmanned Aerial Vehicles.

1 REAL-TIME IMAGE PROCESSING APPROACH TO MEASURE TRAFFIC QUEUE PARAMETERS. M. Fathy and M.Y. Siyal Conference 1995: Image Processing And Its Applications.

Presenting by, Prashanth B R 1AR08CS035 Dept.Of CSE. AIeMS-Bidadi. Sketch4Match – Content-based Image Retrieval System Using Sketches Under the Guidance.

SMUCSE 8394 Devices III Surveillance Cameras. SMUCSE 8394 Surveillance of the Borders 235 different video surveillance systems currently in operation.

Autonomous Learning of Object Models on Mobile Robots Xiang Li Ph.D. student supervised by Dr. Mohan Sridharan Stochastic Estimation and Autonomous Robotics.

This action is co-financed by the European Union from the European Regional Development Fund The contents of this poster are the sole responsibility of.

Intelligent Transportation System (ITS) ISYM 540 Current Topics in Information System Management Anas Hardan.

Multimedia Databases (MMDB)

Reading Notes: Special Issue on Distributed Smart Cameras, Proceedings of the IEEE Mahmut Karakaya Graduate Student Electrical Engineering and Computer.

Introduction of Mobility laboratory & Collaboration with CALTECH Noriko Shimomura Nissan Mobility Laboratory.

Introduction to Computer Vision Olac Fuentes Computer Science Department University of Texas at El Paso El Paso, TX, U.S.A.

Flow Separation for Fast and Robust Stereo Odometry [ICRA 2009]

Center on Tolling Research Technology for Managed Lanes Christopher Poe, Ph.D., P.E. Assistant Agency Director Director, Center on Tolling Research Texas.

Test Intersection: Status, Results, Preparation for State Data Collection Lee Alexander Pi-Ming Cheng Alec Gorjestani Arvind Menon Craig Shankwitz Intelligent.

Marco Pedersoli, Jordi Gonzàlez, Xu Hu, and Xavier Roca

Computer Science Department Pacific University Artificial Intelligence -- Computer Vision.

The University of Texas at Austin Vision-Based Pedestrian Detection for Driving Assistance Marco Perez.

Pedestrian Detection and Localization

Elastic Pathing: Your Speed Is Enough to Track You Presented by Ali.

ECE 172A SIMPLE OBJECT DETECTOR WITH INDICATOR WHEN A NEW OBJECT HAS BEEN ADDED TO OR MISSING IN A ROOM Presented by by Hugo Groening.

National Taiwan A Road Sign Recognition System Based on a Dynamic Visual Model C. Y. Fang Department of Information and.

WIRED Week 3 Syllabus Update (next week) Readings Overview - Quick Review of Last Week’s IR Models (if time) - Evaluating IR Systems - Understanding Queries.

Networked Audio Visual Systems and Home Platforms ADMIRE-P at Med-e-Tel 2005 April 6-8, Application of Video Technologies and Pattern Recognition.

JASON BANICH ADVISOR: DR. JOHN SENG Crosswalk Detection via Computer Vision.

Histograms of Oriented Gradients for Human Detection(HOG)

CS332 Visual Processing Department of Computer Science Wellesley College CS 332 Visual Processing in Computer and Biological Vision Systems Overview of.

Jiu XU, Axel BEAUGENDRE and Satoshi GOTO Computer Sciences and Convergence Information Technology (ICCIT), th International Conference on 1 Real-time.

Jack Pinches INFO410 & INFO350 S INFORMATION SCIENCE Computer Vision I.

Human Activity Recognition at Mid and Near Range Ram Nevatia University of Southern California Based on work of several collaborators: F. Lv, P. Natarajan,

How the Future of Signal Processing Will Affect Us Gene A Frantz Principal Fellow Texas Instruments.

Today. you will entertain me Enhancing the Throughput of Video Streaming Using Automatic Colorization Sender Automatic Colorization Internet Receiver.

Carl Vondrick, Aditya Khosla, Tomasz Malisiewicz, Antonio Torralba Massachusetts Institute of Technology

Preliminary Transformations Presented By: -Mona Saudagar Under Guidance of: - Prof. S. V. Jain Multi Oriented Text Recognition In Digital Images.

WLD: A Robust Local Image Descriptor Jie Chen, Shiguang Shan, Chu He, Guoying Zhao, Matti Pietikäinen, Xilin Chen, Wen Gao 报告人：蒲薇榄.

Automatic License Plate Recognition for Electronic Payment system Chiu Wing Cheung d.

SHAHAB iCV Research Group.

Performance of Computer Vision

Cheng-Ming Huang, Wen-Hung Liao Department of Computer Science

A Tutorial on HOG Human Detection

Vehicle Segmentation and Tracking in the Presence of Occlusions

Accelerating the Introduction of

Maintaining order and safety in a city is no small task

AHED Automatic Human Emotion Detection

AHED Automatic Human Emotion Detection

D A L I Deep Artificial Learning Intelligence

CS 332 Visual Processing in Computer and Biological Vision Systems

Automated traffic congestion estimation via public video feeds

Presentation transcript:

My Perspectives on Graduate Research Panya Chanawangsa Ubiquitous Multimedia Lab Advisor: Dr. Chang Wen Chen 10/14/2014

About Myself SUNY Buffalo, Ubiquitous Multimedia Lab 5 th year PhD student Xerox Corporation Rochester, New York August 2012 – May 2013 AFT Computer Vision Seattle, Washington June 2013 – August 2013 AFT Computer Vision: Surveillance Camera Applications Group Seattle, Washington May 2014 – August 2014

Ubiquitous Multimedia Lab

Agenda Overview of my group’s research area Overview of my research area My PhD research Exciting (and not so exciting) aspects of doing research What I wish I had known when I joined the program Q&A

Ubiquitous Multimedia Lab HTTP live streaming Video transmission over various networks Mobile video adaptation Quality of experience for multimedia consumers Multimedia in social media context Computer vision and image processing

Ubiquitous Multimedia Lab HTTP live streaming Video transmission over various networks Mobile video adaptation Quality of experience for multimedia consumers Multimedia in social media context Computer vision and image processing

My Research Overview Computer Vision for Intelligent Transportation Systems

Input image Computer Vision System Useful informatio n Puppy, 0.94

Wikipedia

Computer Vision and its Applications Face recognition Amazon Fire Phone face tracking Facebook facial detection/recognition

Computer Vision and its Applications Image search, Image retrieval Google Image Amazon Firefly

Computer Vision and its Applications Beauty recommendation systems Carl Vondrick, Aditya Khosla, Tomasz Malisiewicz, Antonio Torralba, Wow! You are so beautiful today!, ACM International Conference on Multimedia, pp..

Beauty recommendation systems Recommendation System Synthesized result Recommendation results “You should do the following: -Have long hair with curls. -Use black eye shadow. -Use number 3 foundation.” Input image

Why Computer Vision is Hard Is there a human in the image?

Why Computer Vision is Hard Input image Features Classifier

“The new approach gives near- perfect separation on the original MIT pedestrian database, so we introduce a more challenging dataset containing over 1800 annotated human images with a large range of pose variations and backgrounds.” Naveet Dalal and Bill Triggs, Histogram of Oriented Gradients for Human Detection, CVPR 2005.

Why Computer Vision is Hard

Carl Vondrick, Aditya Khosla, Tomasz Malisiewicz, Antonio Torralba, HOGgles: Visualizing Object Detection Features, IEEE International Conference on Computer Vision Why Computer Vision is Hard

Intelligent Transportation Systems Red light cameras High-occupancy vehicle lane License plate number recognition

Intelligent Transportation Systems Smart parking Real-time traffic monitoring

My Research Overview Lane departure warning system Overtaking vehicle detection Smart parking Drunk-driving detection

Lane Departure Warning System

Research and Implementation Challenges Feature selection: color? edge? Feature detection: Resource constraint: energy, processing power Efficiency: can we meet the real-time requirement? Implementation: Android? iOS? Result validation: ground-truth generation

Overtaking Vehicle Detection System

Research and Implementation Challenges Feature selection: HOG? Symmetry? Feature detection: highly dynamic scene Efficiency: can we meet the real-time requirement? Accuracy: how do we make an accurate prediction

Drunk Driving Detection Is this driver drunk?

Basic Idea 1. Use NHTSA’s visual cues for police officers.

Basic Idea 2. What are some of the effects of alcohol on driving performances? User studies: in collaboration with Dr. Sean Wu from the IE department

Basic Idea 3. Approach the problem from ground up.

Driving Parameters Ability to maintain lateral positions Speed variability Stopping distance from the stop signs and traffic lights Turning radius

Data Acquisition BumblebeeXB 3

Initial System Setup

3D camera IEEE 1394 cables Jib Weights Safety triangle Portable battery Laptop

Dataset Tracking of instrument vehicle Multiple vehicle tracking

Dataset Lane keeping

Dataset Turning radius

Dataset Stopping distance

3D Processing Vehicle maskVehicle point cloud front view top view

Extracted 2D/3D Trajectories Trajectories of all the vehicles in data set 1

What I wish I had known way back Have many interests; focus on one. Four years is a short period of time. Treat your PhD like a full-time job. Prioritize your tasks. Make sure you are truly passionate about your research topic. Ask yourself what you really want to do in life. Do internships.

What gets me excited Freedom to pursue my academic curiosity Collaboration with top-notch researchers on funded projects High-impact and practical research Computer vision applications are everywhere. Lots of research challenges and extremely difficult problems:  Object recognition  Action recognition  Robotics

Academic vs. Industry Research Access to large datasets Shared codebase vs. implementing everything yourself Freedom to pursue your research interests Funding

Questions?