MobileASL: Making Cell Phones Accessible to the Deaf Community Anna Cavender Richard Ladner, Eve Riskin University of Washington.

Slides:



Advertisements
Similar presentations
1 P2P Layered Streaming for Heterogeneous Networks in PPSP K. Wu, Z. Lei, D. Chiu James Zhibin Lei 17/03/2010.
Advertisements

Tarun Bansal*, Karthik Sundaresan+,
Tanmoy Bhattacharya Coordinator Equal Opportunity Cell University of Delhi ICT for PwDs: with Special Reference to Indian Sign Language.
Video Phone for Deaf and Hard-Hearing Community By Hudson Asiema.
Google Confidential and Proprietary Video and Deaf+Hearing Community Naomi Black & Nathan Kester with Ben Rubin, NTID and Willie King, ZVRS.
LBVC: Towards Low-bandwidth Video Chats on Smartphones Xin Qi, Qing Yang, David T. Nguyen, Gang Zhou, Ge Peng College of William and Mary 1.
1 Outline  Introduction to JEPG2000  Why another image compression technique  Features  Discrete Wavelet Transform  Wavelet transform  Wavelet implementation.
Panoptes: A Scalable Architecture for Video Sensor Networking Applications Wu-chi Feng, Brian Code, Ed Kaiser, Mike Shea, Wu-chang Feng (OGI: The Oregon.
1 Adaptive resource management with dynamic reallocation for layered multimedia on wireless mobile communication net work Date : 2005/06/07 Student : Jia-Hao.
Recursive End-to-end Distortion Estimation with Model-based Cross-correlation Approximation Hua Yang, Kenneth Rose Signal Compression Lab University of.
A Layered Hybrid ARQ Scheme for Scalable Video Multicast over Wireless Networks Zhengye Liu, Joint work with Zhenyu Wu.
An Error-Resilient GOP Structure for Robust Video Transmission Tao Fang, Lap-Pui Chau Electrical and Electronic Engineering, Nanyan Techonological University.
The 3G signing revolution Gunnar Hellström, Omnitor TDI-2007.
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System.
A Real-Time Video Multicast Architecture for Assured Forwarding Services Ashraf Matrawy, Ioannis Lambadaris IEEE TRANSACTIONS ON MULTIMEDIA, AUGUST 2005.
Real-time Transport Protocol Matt Boutell CS457: Computer Networks November 15, 2001.
Streaming Video Gabriel Nell UC Berkeley. Outline Scalable MPEG-4 video – Layered coding method – Integrated transport-decoder buffer model RAP streaming.
ATSC Digital Television
Virtual Meetings Increasing Collaboration While Reducing Costs and Ensuring Business Continuity Ram Narayanaswamy CTO 8x8, Inc.
The Common Core State Standards (CCSS) are a set of sequential benchmarks that identify what a child needs to have learned and be able to do by the end.
1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System Supervisor: Prof Michael Lyu Presented by: Lewis Ng,
Strictly private and confidential
Remote Surveillance System Presented by: Robarin Holdings Limited Telephone: Facsimile:
CS 1 with Robots CS1301 – Where it Fits Institute for Personal Robots in Education (IPRE)‏
1 CCTV SYSTEMS RESOLUTIONS USED IN CCTV. 2 CCTV SYSTEMS CCTV resolution is measured in vertical and horizontal pixel dimensions and typically limited.
MPEG-4 & Windows Media Dr. Jordi Ribas-Corbera Lead Program Manager, Codecs Digital Media Division Microsoft Corp
BlindAid Semester Final Presentation Sandra Mau, Nik Melchior, and Maxim Makatchev.
1 An Extensible Videoconference Tool for a Collaborative Computing Network Junjun He.
Business Data Communications, Stallings 1 Chapter 1: Introduction William Stallings Business Data Communications 6 th Edition.
HiTrack Introduction Shanghai Zhuo-Shi Software Technology Co., Ltd Shanghai Zhuo-Shi Software Technology Co., Ltd
1 Seminar Presentation Multimedia Audio / Video Communication Standards Instructor: Dr. Imran Ahmad By: Ju Wang November 7, 2003.
MobiQuitous 2004Kimaya Sanzgiri Leveraging Mobility to Improve Quality of Service in Mobile Networks Kimaya Sanzgiri and Elizabeth Belding-Royer Department.
Integrating Fine-Grained Application Adaptation with Global Adaptation for Saving Energy Vibhore Vardhan, Daniel G. Sachs, Wanghong Yuan, Albert F. Harris,
Introduction GAM 376 Robin Burke Winter Outline Introductions Syllabus.
School of Computer Science & Information Technology G6DPMM - Lecture 15 Media Design III – Video & Animation.
Teleconferencing. According to the Travel Industry Association, business travel rose 14% between 1994 and 1999, with nearly half of the 44 million travelers.
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System.
R∙I∙T NTID Center on Access Technology Videoconference Communication Support Bill Clymer Christine Monikowski Benjamin Rubin February 27, A presentation.
Mohamed Hefeeda 1 School of Computing Science Simon Fraser University, Canada Video Streaming over Cooperative Wireless Networks Mohamed Hefeeda (Joint.
6: Wireless and Mobile Networks6-1 Chapter 6 Wireless and Mobile Networks Computer Networking: A Top Down Approach Featuring the Internet, 3 rd edition.
RANI NALAMARU DEPARTMENT OF COMPUTER SCIENCE BALL STATE UNIVERSITY RANI NALAMARU DEPARTMENT OF COMPUTER SCIENCE BALL STATE UNIVERSITY Efficient Transmission.
FYP: LYU0001 Wireless-based Mobile E-Commerce on the Web Supervisor: Prof. Michael R. Lyu By: Tony, Wat Hong Fai Harris, Yan Wai Keung.
Distribution of Multimedia Data Over a Wireless Network (DMDoWN): An Introduction Presented By: Rafidah Md Noor Faculty of Computer Science & Information.
The NIProxy: a Flexible Proxy Server Supporting Client Bandwidth Management and Multimedia Service Provision Maarten Wijnants Wim Lamotte.
An Overlay Network Providing Application-Aware Multimedia Services Maarten Wijnants Bart Cornelissen Wim Lamotte Bart De Vleeschauwer.
Lisa B. Elliot, Ph.D. & Robb Dooling RIT/NTID Center on Access Technology NTID Foundation Board Meeting, May 4, 2012 Rochester, NY.
EDUCAUSE 2005 Annual Conference October 19, 2005.
CROSS-LAYER OPTIMIZATION PRESENTED BY M RAHMAN ID:
Aug 25, 2005 page1 Aug 25, 2005 Integration of Advanced Video/Speech Codecs into AccessGrid National Center for High Performance Computing Speaker: Barz.
SIMPLE: Stable Increased Throughput Multi-hop Link Efficient Protocol For WBANs Qaisar Nadeem Department of Electrical Engineering Comsats Institute of.
WIRELESS SYSTEMS Adnan Iqbal MCS-MIT 1 1.
Cooperative Layered Wireless Video Multicast Ozgu Alay, Thanasis Korakis, Yao Wang, Elza Erkip, Shivendra Panwar.
COM 597 Streaming Media Class 7 July 13, General Notes and Comments Travis Petershagen, Seattle Podcasting Network Pete Grondal, Mobile Devices,
AIMS’99 Workshop Heidelberg, May 1999 Assessing Audio Visual Quality P905 - AQUAVIT Assessment of Quality for audio-visual signals over Internet.
Performance Study of Live Video Streaming over Highway Vehicular Ad hoc Networks Author:Fei Xie, Kien A. Hua, Wenjing Wang, and Yao H. Ho 2007 IEEE Speaker:
Soon Joo Hyun Database Systems Research and Development Lab. US-KOREA Joint Workshop on Digital Library t Introduction ICU Information and Communication.
Chapter Five Making Connections Efficient: Multiplexing and Compression Data Communications and Computer Networks: A Business User’s Approach Eighth Edition.
Objective This presentation covers the Generation of Telecom Network Evolution. Basically the presentation aims on the evolution from 1G to 4G and some.
A Comparison of RaDiO and CoDiO over IEEE WLANs May 25 th Jeonghun Noh Deepesh Jain A Comparison of RaDiO and CoDiO over IEEE WLANs.
August Video Management Software ViconNet Enterprise Video Management Software Hybrid DVR Kollector Strike Kollector Force Plug & Play NVR HDExpress.
SourceAmerica Design Challenge By: Nicholas Klofta.
William H. Bowers – Specification Techniques Torres 17.
Scientific Advisory Group Meeting May 7, Scientific Advisory Group Meeting May 7, 2004 Access to Technical Education Through Sign Language Interpreting:
MobileASL: Intelligibility of Sign Language as Constrained by Mobile Phone Technology Richard Ladner, Eve Riskin Dane Barney, Anna Cavender, Neva Cherniavsky,
Technical Seminar Presentation Presented by : SARAT KUMAR BEHERA NATIONAL INSTITUTE OF SCIENCE AND TECHNOLOGY [1] Presented By SARAT KUMAR BEHERA Roll.
CS 1 with Robots CS1301 – Where it Fits Institute for Personal Robots in Education (IPRE)‏
Mobile Speech-to-Text Captioning Services: An Accommodation in STEM Laboratory Courses Michael Stinson, Pamela Francis, and, Lisa Elliot National Technical.
Using Speech Recognition to Predict VoIP Quality
Brad Leon Vice president & Chief Sales Officer
Raja Kushalnagar Poorna Kushalnagar Fadi Haddad
Presentation transcript:

MobileASL: Making Cell Phones Accessible to the Deaf Community Anna Cavender Richard Ladner, Eve Riskin University of Washington

2 Two Themes MobileASL Cyber-Community for Advancing Deaf and Hard of Hearing in STEM (Science Technology Engineering and Math)

3 Challenges: Limited network bandwidth Limited processing power on cell phones Our goal: ASL communication using video cell phones over current U.S. cell phone network

4 Cell Phone Network Constraints Low bit rate goal –GPRS (General Packet Radio Service) Ranges from 30kbps to 80kbps (download) Perhaps half that for upload Unpredictable variation and packet loss 3G = 3 rd Generation –Special service –Not yet widespread –Will still have congestion Service providers more likely to offer services if throughput can be minimized.

5 MobileASL Network Goals Sign language presents a unique challenge: –Not just appearance of video, intelligibility too! –If it works for sign language, other video applications benefit too. MobileASL is about fair access to the current network –As soon as possible, no special accommodations –Not geographically limited –Lower bitrate + power savings = more accessible

6 Architecture Camera Encoder Transmitter Sender Player Decoder Receiver Cell Phone Network Cell phone Encoder

7 Codec Used: x264 Open source implementation of H.264 standard –Doubles compression ratio over MPEG2 –Replacing MPEG2 as industry standard x264 offers faster encoding Off-the-shelf H.264 decoder can be used –(speculation about H.264 on the iPhone)

8 Outline Motivation Introduction MobileASL Consumers Eyetracking Motivation Video Phone Study Compression Challenges Current Work Conclusions

9 Discussions with Consumers Open ended questions: –Physical Setup Camera, distance, … –Features Compatibility, text, … –Scenarios Lighting, driving, relay services, …

10 Consumer Response “I don’t foresee any limitations. I would use the phone anywhere: the grocery store, the bus, the car, a restaurant, … anywhere!” There is a need within the Deaf Community for mobile ASL conversations Existing video phone technology (with minor modifications) would be usable

11 Video Encoding for ASL Constraints of cell phone network create video compression challenges How do we compress ASL video to maximize intelligibility?

12 Outline Motivation Introduction MobileASL Consumers Eyetracking Motivation Video Phone Study Compression Challenges Current Work Conclusions

13 Eyetracking Studies Participants watched ASL videos while eye movements were tracked Important regions of the video could be encoded differently * Muir et al. (2005) and Agrafiotis et al. (2003)

14 Eyetracking Results 95% of eye movements within 2 degrees visual angle of the signer’s face (demo) Implications: Face region of video is most visually important –Detailed grammar in face requires foveal vision –Hands and arms can be viewed in peripheral vision * Muir et al. (2005) and Agrafiotis et al. (2003)

15 Outline Motivation Introduction MobileASL Consumers Eyetracking Motivation Video Phone Study Compression Challenges Current Work Conclusions

16 Mobile Video Phone Study 3 Region-of-Interest (ROI) values 2 Frame rates, frames per second (FPS) 3 different Bit rates –15 kbps, 20 kbps, 25 kbps 18 participants (7 women) –10 Deaf, 5 hearing, 3 CODA* –All fluent in ASL * CODA = (Hearing) Child of a Deaf Adult

17 Example of ROI Varied quality in fixed-sized region around the face (demo) 2x quality in face 4x quality in face

18 Examples of FPS Varied frame rate: 10 fps and 15 fps For a given bit rate: Fewer frames = more bits per frame (demo)

19 Questionnaire

20 User Preferences Results Bit RateFrame RateRegion of Interest

21 Implications of results A mid-range ROI was preferred –Optimal tradeoff between clarity in face and distortion in rest of “sign-box” Lower frame rate preferred –Optimal tradeoff between clarity of frames and number of frames per second Results independent of bit rate

22 Outline Motivation Introduction MobileASL Consumers Eyetracking Motivation Video Phone Study Compression Challenges Current Work Conclusions

23 Rate, distortion and complexity optimization H.264 encoder Input parameters Raw video Compressed video H.264 standard provides 50% bit savings over MPEG 2, but with higher complexity. Objective: Achieve best possible quality for least encoding time at a given bitrate

24 Time – Complexity Tradeoff Encoding Time MSE

25 Encoding/Decoding on the Cell Phone Implemented a command-line version of x264 on a cell phone using Windows Mobile Edition 5.0.

26 QVGA 320x240

27 Outline Motivation Introduction MobileASL Consumers Eyetracking Motivation Video Phone Study Compression Challenges Current Work Conclusions

28 Dynamic Region-of-Interest Skin detection algorithms Region-based metric for bit allocation –Automatically determine priority for face and hands based on currently available bitrate.

29 Activity Recognition Can save data and power by detecting: –Fingerspelling Increase frame rate for better intelligibility –Signing Sign language-specific encoding –“Just listening” Less processing and transmission needed (demo)

30 User Interface Leverages users’ prior experience with video conferencing interfaces (such as Sorenson, HOVRS, etc.) Optimized for small screen space Initial state user interface Incoming Video Stream Outgoing Video Stream Control Toolbar Toggle Privacy Mode Toggle Chat View Video Screen Layout Toggle Status Bar

31 Building the System

32 MobileASL Team Principal Investigators –Richard Ladner, Eve Riskin, and Sheila Hemami (Cornell) Graduate Students –Anna Cavender, Rahul Vanam, Neva Cherniavsky, Jaehong Chon, Dane Barney, Frank Ciaramello (Cornell) Undergraduate Students –Omari Dennis, Jessica DeWitt, Loren Merritt National Science Foundation

Cyber infrastructure for Advancing Deaf & Hard of Hearing in STEM Richard Ladner* Jorge Díaz-Herrera^ James J DeCaro^+ E William Clymer^+ Anna Cavender* University of Washington* Rochester Institute of Technology+ National Technical Institute for the Deaf^

34 Advancing Deaf and Hard of Hearing people in STEM fields through better access to education. Our Goal

35 Problems Deaf students pursuing STEM fields need skilled interpreters and captioners with specific domain knowledge. –The best interpreter may not be at the student’s locale. Deaf students face challenging classroom environments: multiple sources of information are all visual –“Deaf Whiplash” Sign language is growing to include STEM vocabulary –Community consensus is required.

36 Enabling Access to STEM Education

37 Enabling ASL to Grow in STEM

38 Summit to Create a Cyber-Community to Advance Deaf and Hard-of-Hearing Individuals in STEM (DHH Cyber-Community) Scheduled for June 2008 – RIT/NTID Discussion among the many stakeholders: 1.Deaf and hard of hearing students in STEM fields. 2.Faculty and administrators in colleges and universities with a commitment to deaf and hard of hearing students in STEM fields. 3.Interpreters and captioners. 4.Researchers who study sign vocabulary for STEM fields and interpreting and captioning for education. 5.Educational technology researchers. 6.Experts in multimedia and network services that use the national cyberinfrastructure (e.g., AccessGrid). 7.Companies already in the business of providing video relay interpreting (VRI) and real time captioning (RTC). 8.Leaders in organizations who have an interest in advancing deaf and hard of hearing students in STEM fields. Contact us if you have ideas for participation.

39 Questions? Thanks! MobileASL Webpage: Richard Ladner: Anna Cavender: