Aaron Ballew Aleksandar Kuzmanovic C. C. Lee Northwestern University Dept. of Electrical Engineering and Computer Science July 7 th 2011 Fusion of Live.

Slides:



Advertisements
Similar presentations
Using An Audio Version of the Course Textbooks to Support Learning The Open University of Israel The Center for Technology in Distance Education Shlomit.
Advertisements

AVQ Automatic Volume and eQqualization control Interactive White Paper v1.6.
Speech Enhancement through Noise Reduction By Yating & Kundan.
The Impact of Channel Estimation Errors on Space-Time Block Codes Presentation for Virginia Tech Symposium on Wireless Personal Communications M. C. Valenti.
Authors: David N.C. Tse, Ofer Zeitouni. Presented By Sai C. Chadalapaka.
 These 100 seniors make up one possible sample. All seniors in Howard County make up the population.  The sample mean ( ) is and the sample standard.
Variance reduction techniques. 2 Introduction Simulation models should be coded such that they are efficient. Efficiency in terms of programming ensures.
Toward Automatic Music Audio Summary Generation from Signal Analysis Seminar „Communications Engineering“ 11. December 2007 Patricia Signé.
Audio Meets Image Retrieval Techniques Dave Kauchak Department of Computer Science University of California, San Diego
Digital transmission over a fading channel Narrowband system (introduction) Wideband TDMA (introduction) Wideband DS-CDMA (introduction) Rake receiver.
STAT 497 APPLIED TIME SERIES ANALYSIS
Diversity techniques for flat fading channels BER vs. SNR in a flat fading channel Different kinds of diversity techniques Selection diversity performance.
1/44 1. ZAHRA NAGHSH JULY 2009 BEAM-FORMING 2/44 2.
PSY 307 – Statistics for the Behavioral Sciences
Digital Data Transmission ECE 457 Spring Information Representation Communication systems convert information into a form suitable for transmission.
1 Drafting Behind Akamai (Travelocity-Based Detouring) AoJan Su, David R. Choffnes, Aleksandar Kuzmanovic, and Fabian E. Bustamante Department of Electrical.
Probability & Statistics for Engineers & Scientists, by Walpole, Myers, Myers & Ye ~ Chapter 10 Notes Class notes for ISE 201 San Jose State University.
A Quick Practical Guide to PCA and ICA Ted Brookings, UCSB Physics 11/13/06.
Direct Time Study Chapter 13 Sections: Direct Time Study Procedure
Wireless Communication Channels: Small-Scale Fading
PSY 307 – Statistics for the Behavioral Sciences Chapter 8 – The Normal Curve, Sample vs Population, and Probability.
Determining the Size of
1 Lecture 9: Diversity Chapter 7 – Equalization, Diversity, and Coding.
Principles of the Global Positioning System Lecture 11 Prof. Thomas Herring Room A;
Two and a half problems in homogenization of climate series concluding remarks to Daily Stew Ralf Lindau.
For 3-G Systems Tara Larzelere EE 497A Semester Project.
Making all the right connections Signal Flow 101.
Significance Tests …and their significance. Significance Tests Remember how a sampling distribution of means is created? Take a sample of size 500 from.
The paired sample experiment The paired t test. Frequently one is interested in comparing the effects of two treatments (drugs, etc…) on a response variable.
1 Techniques to control noise and fading l Noise and fading are the primary sources of distortion in communication channels l Techniques to reduce noise.
3 SIGNALLING Analogue vs. digital signalling oRecap advantages and disadvantages of analogue and digital signalling oCalculate signal transmission rates.
ECE 530 – Analysis Techniques for Large-Scale Electrical Systems
Next. Smart Music For todays aspiring musician, access to professional-grade resources doesn’t necessarily require a rock star budget. Let’s explore some.
EEG Classification Using Maximum Noise Fractions and spectral classification Steve Grikschart and Hugo Shi EECS 559 Fall 2005.
1 Psych 5500/6500 t Test for Two Independent Means Fall, 2008.
1 Statistical Distribution Fitting Dr. Jason Merrick.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
1 Nonparametric Statistical Techniques Chapter 17.
1 Chapter 9 Hypothesis Testing. 2 Chapter Outline  Developing Null and Alternative Hypothesis  Type I and Type II Errors  Population Mean: Known 
Chapter 4: Baseband Pulse Transmission Digital Communication Systems 2012 R.Sokullu1/46 CHAPTER 4 BASEBAND PULSE TRANSMISSION.
Hypothesis Testing An understanding of the method of hypothesis testing is essential for understanding how both the natural and social sciences advance.
A Semi-Blind Technique for MIMO Channel Matrix Estimation Aditya Jagannatham and Bhaskar D. Rao The proposed algorithm performs well compared to its training.
Inference: Probabilities and Distributions Feb , 2012.
Inen 460 Lecture 2. Estimation (ch. 6,7) and Hypothesis Testing (ch.8) Two Important Aspects of Statistical Inference Point Estimation – Estimate an unknown.
Chapter 13: Inferences about Comparing Two Populations Lecture 8b Date: 15 th November 2015 Instructor: Naveen Abedin.
Dongxu Yang, Meng Cao Supervisor: Prabin.  Review of the Beamformer  Realization of the Beamforming Data Independent Beamforming Statistically Optimum.
Lesson Use and Abuse of Tests. Knowledge Objectives Distinguish between statistical significance and practical importance Identify the advantages.
Smart Sleeping Policies for Wireless Sensor Networks Venu Veeravalli ECE Department & Coordinated Science Lab University of Illinois at Urbana-Champaign.
The Effect of Database Size Distribution on Resource Selection Algorithms Luo Si and Jamie Callan School of Computer Science Carnegie Mellon University.
BIOL 582 Lecture Set 2 Inferential Statistics, Hypotheses, and Resampling.
The accuracy of averages We learned how to make inference from the sample to the population: Counting the percentages. Here we begin to learn how to make.
Design of a Guitar Tab Player in MATLAB Summary Lecture Module 1: Modeling a Guitar Signal.
Relevant Document Distribution Estimation Method for Resource Selection Luo Si and Jamie Callan School of Computer Science Carnegie Mellon University
UNIT-IV. Introduction Speech signal is generated from a system. Generation is via excitation of system. Speech travels through various media. Nature of.
True or False? It is possible to listen without hearing. It is possible to hear without listening.
PSY 626: Bayesian Statistics for Psychological Science
CS 591 S1 – Computational Audio
Techniques to control noise and fading
COOPERATIVE PRINCIPLE:
Adaptive Filters Common filter design methods assume that the characteristics of the signal remain constant in time. However, when the signal characteristics.
PSY 626: Bayesian Statistics for Psychological Science
Where did we stop? The Bayes decision rule guarantees an optimal classification… … But it requires the knowledge of P(ci|x) (or p(x|ci) and P(ci)) We.
Chap. 7 Regularization for Deep Learning (7.8~7.12 )
Bayesian Nonparametric Matrix Factorization for Recorded Music
Principles of the Global Positioning System Lecture 11
NONLINEAR AND ADAPTIVE SIGNAL ESTIMATION
Chapter 10 Introduction to the Analysis of Variance
Recap In previous lessons we have looked at how numbers can be stored as binary. We have also seen how images are stored as binary. This lesson we are.
NONLINEAR AND ADAPTIVE SIGNAL ESTIMATION
Evaluation David Kauchak CS 158 – Fall 2019.
Presentation transcript:

Aaron Ballew Aleksandar Kuzmanovic C. C. Lee Northwestern University Dept. of Electrical Engineering and Computer Science July 7 th 2011 Fusion of Live Audio Recordings for Blind Noise Reduction

Aaron BallewFusion of Live Audio Recordings for Blind Noise Reduction – Fusion 2011 Observation You Attend a Concert ● You’d like a recording of the show ● Live albums exist, but… ● You want the show you went to, back in San Jose CA on Feb 22 nd 2010 Bootleggers At the show, you remember cell phones and cameras in the air

Aaron BallewFusion of Live Audio Recordings for Blind Noise Reduction – Fusion 2011 Observation, cont’d Seek it Out You find some of those recordings uploaded Not just one, but three, four, or five copies of your favorite songs Varying quality Online Database

Aaron BallewFusion of Live Audio Recordings for Blind Noise Reduction – Fusion 2011 Opportunity Each song is an unknown source signal with receiver diversity There must be a way to take advantage of the diversity in these recordings to generate a new recording whose quality is better than any of the originals

Aaron BallewFusion of Live Audio Recordings for Blind Noise Reduction – Fusion 2011 Opportunity, cont’d All the recordings have something in common – a sameness from the music that was generated They have something uncommon too – a differentness from noisy applause, screaming fans, wind, etc.

Aaron BallewFusion of Live Audio Recordings for Blind Noise Reduction – Fusion 2011 Complications No reference (except in your mind) that defines which part is music rather than noise  Studio recording won’t work in general You don’t know the SNR of any signal There’s no pilot signal to imply the channel No opportunity to pre-code a digital waveform  It’s an Analog source  No M-ary QPSK, Matched-Filters Uncountably many sources and relatively few recordings, not a good fit for ICA

Aaron BallewFusion of Live Audio Recordings for Blind Noise Reduction – Fusion 2011 Assumptions Recordings are mono  Stage speakers may be physically separated and multitrack  Relative to venue’s scale and listener’s perspective the multitracks arrive synchronized and recorded as mono by mic Recordings are not synchronized to each other  Different start/stop times and duration Receivers are distributed arbitrarily among audience Noise at one receiver is not the same noise at another  Not necessarily true if two receivers are close to each other  Not true out-of-context, such as a quiet auditorium  Sample vs. Sample  Noise vs. Noise

Aaron BallewFusion of Live Audio Recordings for Blind Noise Reduction – Fusion 2011 Strategy We will never know the absolute SNR of any of the recordings However, if we could be confident their signal powers were equal, then the differences in their total powers would be due to the noise  Assumes the noise is (close to) uncorrelated  Does not assume we know what the signal power actually is If we could use the total power as a proxy for noise power (given bullet 2 above), we could:  Rank recordings by SNR  Apply a classic averaging technique to cancel noise  Measure whether noise power went up or down compared to any original recording

Aaron BallewFusion of Live Audio Recordings for Blind Noise Reduction – Fusion 2011 Strategy, cont’d It would look like this:

Aaron BallewFusion of Live Audio Recordings for Blind Noise Reduction – Fusion 2011 Step 1 – Internal Reference Similarity & Synchronization Cross-correlations show:  Which sample is most similar to all other samples  The time-shift (lag) between any sample pair No external reference, so pick internal one from the sample set

Aaron BallewFusion of Live Audio Recordings for Blind Noise Reduction – Fusion 2011 Step 2 – Normalize In Absence of SNR, The effect of combining samples is unclear Need a way to isolate changes in signal or noise power It would be helpful if signal powers were already equal  Implies combining affects the noise

Aaron BallewFusion of Live Audio Recordings for Blind Noise Reduction – Fusion 2011 Step 2 – Normalize, cont’d Use the Right Tool Use covariance, not r, to normalize signal powers You still don’t know the absolute signal powers You only know that the differences are due to noise Now, you can tell whether noise goes up or down after combining

Aaron BallewFusion of Live Audio Recordings for Blind Noise Reduction – Fusion 2011 Step 3 – Fusion “Weighted” Average Find the average of the first M ranked samples, such that total power is minimized Why the first M?  A sample’s noise power may be so large it increases the composite’s noise *not to scale

Aaron BallewFusion of Live Audio Recordings for Blind Noise Reduction – Fusion 2011 Benefits Identify a “best” quality recording without having to manually listen to each Generate a recording that exceeds the “best” in quality Encourage user-generated (crowd-sourced) content sharing Applicable to any context where the source signal is completely unknown

Aaron BallewFusion of Live Audio Recordings for Blind Noise Reduction – Fusion 2011 Ongoing and Future Ongoing: Time-variability of noise  Shows up as “low-frequency” noise that downselects against such a recording  We window in time (and frequency) to take advantage of the high-quality parts of the recordings  Stitching the windows back together post-fusion requires some attention due to an audible discontinuity when adjacent windows generate a different composite Future: Maximal Ratio Combining  Well-known technique that requires channel knowledge  Gives optimal weighting of samples for maximal fusion gain  I believe we can adapt the inference technique to MRC, such that we get the “maximal” SNR gain, though I may not know exactly what the gain is!

Aaron BallewFusion of Live Audio Recordings for Blind Noise Reduction – Fusion 2011 Conclusion Thank You!

Aaron Ballew Aleksandar Kuzmanovic C. C. Lee Northwestern University Dept. of Electrical Engineering and Computer Science Fusion of Live Audio Recordings for Blind Noise Reduction