Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6,

Slides:



Advertisements
Similar presentations
CSE 424 Final Presentation Team Members: Edward Andert Shang Wang Michael Vetrano Thomas Barry Roger Dolan Eric Barber Sponsor: Aviral Shrivastava.
Advertisements

Miroslav Hlaváč Martin Kozák Fish position determination in 3D space by stereo vision.
Teacher: Kenji Tachibana Digital Photography I. Copyright © 2003 – 2009 Kenji Tachibana Mac & Lab Work 9 slides.
Voiceprint System Development Design, implement, test unique voiceprint biometric system Research Day Presentation, May 3 rd 2013 Rahul Raj (Team Lead),
The Escritoire: a personal projected display for interacting with documents Mark Ashdown Peter Robinson University of.
Sensors.
Electrical and Computer Engineer Large Portable Projected Peripheral Touchscreen Team Jackson Brian Gosselin Greg Langlois Nick Jacek Dmitry Kovalenkov.
Problem Description Security systems use many different inputs to alert the owners, but can do very little as far as actively deterring intruders and.
Virtual Reality Design Virtual reality systems are designed to produce in the participant the cognitive effects of feeling immersed in the environment.
Guitar Effects Processor Using DSP
Ryan C. Bergsmith Ross Kelly Kevin Warne Sponsor: Steve Peralta Motion Music Controller.
By : Adham Suwan Mohammed Zaza Ahmed Mafarjeh. Achieving Security through Kinect using Skeleton Analysis (ASKSA)
Real Time Embedded System Finger Finger Revolution EE4214.
All sounds are produced by the vibration of matter. If there is no vibration, there is no sound.
SYED SYAHRIL TRADITIONAL MUSICAL INSTRUMENT SIMULATOR FOR GUITAR1.
Department of Electrical and Computer Engineering He Zhou Hui Zheng William Mai Xiang Guo Advisor: Professor Patrick Kelly ASLLENGE.
Tracking Migratory Birds Around Large Structures Presented by: Arik Brooks and Nicholas Patrick Advisors: Dr. Huggins, Dr. Schertz, and Dr. Stewart Senior.
Preliminary Design Review The Lone Rangers Brad Alcorn Tim Caldwell Mitch Duggan Kai Gelatt Josh Peifer Capstone – Spring 2007.
0 - 1 © 2007 Texas Instruments Inc, Content developed in partnership with Tel-Aviv University From MATLAB ® and Simulink ® to Real Time with TI DSPs Echo.
EE Audio Signals and Systems Amplifiers Kevin D. Donohue Electrical and Computer Engineering University of Kentucky.
1 Department of Electrical and Computer Engineering Advisor: Professor Zink Team Acoustic Beamformer Preliminary Design Review 10/18/2013.
Conceptualizing and Constructing the Smart Speaker: Designing a power-limiting device for the common 2-way loudspeaker. Presented by Ryan Gwinn. Project.
EE392J Final Project, March 20, Multiple Camera Object Tracking Helmy Eltoukhy and Khaled Salama.
Graphic Equalizer Table By Jose Lerma. Main Idea The main idea of this table is to display the frequencies of any sound or audio input, either by microphone.
Fall 2004EE 3563 Digital Systems Design Audio Basics  Analog to Digital Conversion  Sampling Rate  Quantization  Aliasing  Digital to Analog Conversion.
Progress Presentation IRALAR Breanna Heidenburg -- Michael Lenisa -- Daniel Wentzel Advisor: Dr. Malinowski.
Multimedia Specification Design and Production 2013 / Semester 2 / week 3 Lecturer: Dr. Nikos Gazepidis
INTERACTING WITH SIMULATION ENVIRONMENTS THROUGH THE KINECT Fayez Alazmi Supervisor: Dr. Brett Wilkinson Flinders University Image 1Image 2 Source : 1.
What is a Computer ? What is the application of computer in Our Daily Life ? What is the application of computer in Teaching Field?
Minimal Movement Interactive Entertainment Unit Michael Lorenzo, Ryan Kelly, Chase Francis, Ernie Wilson Faculty Advisor: Prof. Neal Anderson Department.
An efficient method of license plate location Pattern Recognition Letters 26 (2005) Journal of Electronic Imaging 11(4), (October 2002)
Dynamic Range and Dynamic Range Processors
Multimedia Elements: Sound, Animation, and Video.
Intelligent Scissors for Image Composition Anthony Dotterer 01/17/2006.
Abab presents today. A closer look at the production process of a movie soundtrack. What elements are affecting recorded sound quality ?
1© Manhattan Press (H.K.) Ltd Radar speed trap.
Emergency Vehicle Detector for Use in Consumer’s Motor Vehicle Georgia Institute of Technology School of Electrical and Computer Engineering ECE 4007.
Dan Lopez Dan Lopez Ben Rohner Ben Rohner Erin Loutzenhiser Erin Loutzenhiser.
MULTIMEDIA INPUT / OUTPUT TECHNOLOGIES INTRODUCTION 6/1/ A.Aruna, Assistant Professor, Faculty of Information Technology.
1 Detection of Cellular Activity Within A Defined Space Undergraduate Project – Final Presentation Spring 2008 Doron BrotEyal Cimet Supervisor:Yossi Hipsh.
Outline Introduction & BackgroundSystem Description & DiagramSpecificationsDesign – Four StagesFuture Development 211/22/2006.
The Implementation of a Glove-Based User Interface Chris Carey.
University of California, Santa Barbara An Integrated System of 3D Motion Tracker and Spatialized Sound Synthesizer John Thompson (Music) Mary Li (ECE)
Casey Smith Doug Ritchie Fred Lloyd Michael Geary School of Electrical and Computer Engineering December 15, 2011 ECE 4007 Automated Speed Enforcement.
DJ Spatial Tracking and Gesture Recognition for Audio Effects and Mixing Andrew Hamblin, Evan Leong, and Theo Wiersema Dr. José Sanchez Bradley University.
3D Environmental Mapping and Imaging for AUVSI RoboBoat David Bumpus, Daniel Kubik, & Juan Vazquez Advisor: Dr. José Sánchez Customer: Mr. Nick Schmidt.
Timothy Kritzler and Joseph Mintun Sponsor: Martin Engineering, Illinois Advisors: Dr. Malinowski and Dr. Ahn Bradley University Electrical and Computer.
William Weeks Electrical Engineering Team Leader PCB Design Enclosure Jesse Killough Electrical Engineering Software Mark Williams Electrical Engineering.
December 19, Bring the Outside World Into Your Game Computer generated image of water balloon fight Real time image from side window Composite image.
Team IRALAR Breanna Heidenburg -- Michael Lenisa -- Daniel Wentzel Advisor: Dr. Malinowski.
Hybrid Dynamics Processor Group P14345 Jeffrey Auclair Bryan Beatrez Michael Ferry William Sender.
Emergency Vehicle Detector for use in Consumer’s Motor Vehicle Georgia Institute of Technology School of Electrical and Computer Engineering ECE 4007 Ehren.
Product: Microsoft Kinect Team I Alex Styborski Brandon Sayre Brandon Rouhier Section 2B.
Freescale 2014 Leopard Imaging
Multimedia. A medium (plural media) is something that a presenter can use for presentation of information Two basic ways to present information are: –Unimedium.
DJ Spatial Tracking and Gesture Recognition for Audio Effects and Mixing Andrew Hamblin, Evan Leong, and Theo Wiersema Dr. Jose Sanchez Bradley University.
Dan Nichols Head of Recording Services Internet2 Multimedia Specialist Northern Illinois University Your TV IS TOO SLOW.
By Aric Krause Ethan Young.  Home audio system owners spend time setting up their speakers for the optimal audio experience, but this is rendered useless.
1 ALARMLINC. 2 OVERVIEW Expanding Situational AwarenessOVERVIEW.
Lesson 4 Alternative Methods Of Input.
Alternative Methods Of Input
What to expect Tentative projects Goals
Echo and Reverberation
Lesson 4 Alternative Methods Of Input.
Senior Capstone Project Gaze Tracking System
Prepared by : fatheyah faqih Supervisor:Dr.Falah Hassn
The Implementation of a Glove-Based User Interface
Lesson 4 Alternative Methods Of Input.
Electrical traditional Chinese Instrument - Xun
Presentation transcript:

Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6, 2015

Overview -Background -Design Approach -Economic Analysis -Schedule -Division of Labor -Societal and Environmental Impacts 2

Background 3

Problem Audio manipulation over a distance 4 Audio manipulation while multi-tasking Interactivity with music

Problem Background Similar products -Playstation Eyetoy -Xbox Kinect Difference -Audio interactivity -Purely 2D image tracking 5 Microsoft 2014 Sony 2008

Solution Kinetis Tower Visual and audio input Programming based -Hand tracking -Dynamic Time Warping -Tie audio effects to motions 6 Freescale 2014 Leopard Imaging

Solution - Audio Manipulation Through Gesticulation Audio manipulation over a distance 7 Audio manipulation while multi-tasking Interactivity with music -Control while several feet away -Control with movement of a single hand -Interact through motion -Reduce repetitiveness of songs

Hand Tracking Design 8

Color Matching Through Zeroing 9

10

Color Matching Through Zeroing 11

Color Matching Through Zeroing 12

Color Thresholding 13

Color Thresholding 14

Color Thresholding 15

Color Thresholding 16

Motion Thresholding 17

Motion Thresholding 18

Motion Thresholding 19

Motion Thresholding 20

Motion Thresholding 21

Color and Motion Thresholding 22

Color and Motion Thresholding 23

Color and Motion Thresholding 24

Color and Motion Thresholding 25

Search Limiting 26

Results 27

28

Dynamic Time Warping 29

Calculate Difference - = AB |A-B| 30

Example = A B |A-B|

Calculate Cheapest Route To Bottom Right Corner Difference Cost Search Area 32

Example |A-B| Cheapest Cost To Get To That Square 1 33

Example |A-B| Cheapest Cost To Get To That Square

Example |A-B| Cheapest Cost To Get To That Square

Example |A-B| Cheapest Cost To Get To That Square

Example |A-B| Cheapest Cost To Get To That Square

Variations Time distortion cost 38 -Non-diagonal movements cost more Path killing -Routes over a certain cost are removed

Audio Processing 39

Audio Processing Processing time Analog – digital conversion 5 audio effects Finalizing input and output audio 40

Low Pass Filtering Passing lower Frequencies below cutoff 41 Beausievers 2013

High Pass Filtering Passing higher frequencies above cutoff 42 Beausievers 2013

Distortion Amplifies audio signal to threshold and clips 43 Common Wikimedia 2011

Chorus Sounds like audio is produced by multiple sources 44 Sound on Sound 2004

Reverb Sound reflecting in a space 45 Practical Musical Production 2012

Development and Testing 46

Development Tools -Bradley Computers -Kinetis Freescale Tower -Leopard Imaging USB Camera -Kinetis IDE -MATLAB 47 Freescale 2014 Leopard Imaging

Testing Hand Tracking -Must have a 80% success rate in the following conditions -Outdoor/Indoor lighting -Different movement speeds (slow/medium/fast) -At least 3 different hands -Success is tracking a hand for 30 seconds or more 48 Freescale 2014 Leopard Imaging

Testing Dynamic Time Warping -Test against already solved matrixes -Capable of gesture matching success 90% of the time -Gesture matching will occur in the same scenarios as the hand tracking 49 Freescale 2014 Leopard Imaging

Testing Audio processing -5 audio effects -Quick processing time -Satisfactory auditory results -No crackling or static -Minimize lag to 100 ms or less 50 Freescale 2014 Leopard Imaging

Economic Analysis 51

Development Costs 52 Software -Kinetis IDE: $ MATLAB: $ 0.00 (Provided By School) Hardware -Kinetis Freescale Tower: $ Leopard Imaging USB Camera: $ Total: $386.19

Division of Labor 53

Schedule 54

Societal and Environmental Impacts Is it right to alter an artists music Liability from damage while gesturing Liability of harm if used improperly Disclaimer before use to protect -Advise users to use caution -Check their surroundings 55

Societal and Environmental Impacts For avid music listeners RoHS compliant 56

Conclusion -Background -Need for more interactivity with music -Solution provides innovative experience with personal music -Design Approach -Dynamic Time Warping and hand recognition for gestures -Program audio effects tied directly with gestures 57

Conclusion -Feasible project to finish within schedule -Environmentally safe -Socially safe with disclaimer and caution 58

Audio Manipulation Through Gesticulation Garrett Fosdick, Jair Robinson José Sanchez Bradley University - Electrical & Computer Engineering October 6, 2015

Extra Slides 60

Division of Labor 61

Test Procedures Camera Input -Display Footage On A Monitor - PASS/FAIL Hand Tracking -Display Footage At The End Of Each Step – PASS/FAIL -Tracks All Test Hands For A Full 30 Seconds Gesture Recognition -Light Up An LED When The Gesture Occurs -Must Be Right 90% Of The Time 62

Test Procedures Audio Input -Receiving Without Lag And Distortion- PASS/FAIL Audio Output -Audio Is Audible At Normal Hearing Level - PASS/FAIL -No Lag Above 100ms Or Unintended Distortion Audio Effects -No Lag from Gesture Trigger Above 100ms -Must Trigger from Correct Gesture 100% of time 63

Preliminary Test Results - Gesture MatchingPartial Random SuccessFail 64

Preliminary Test Results - Gesture 65

Block Diagram 66

Glass Block Diagram 67

Gantt Chart 68

Schedule 69

Schedule 70

Schedule 71