Playing in High Dimensions Bob Nichol ICG, Portsmouth Thanks to all my colleagues in SDSS, GRIST & PiCA Special thanks to Chris Miller, Alex Gray, Gordon.

Slides:



Advertisements
Similar presentations
How to do Bayes-Optimal Classification with Massive Datasets: Large-scale Quasar Discovery Alexander Gray Georgia Institute of Technology College of Computing.
Advertisements

Curse of Dimensionality Prof. Navneet Goyal Dept. Of Computer Science & Information Systems BITS - Pilani.
SVM - Support Vector Machines A new classification method for both linear and nonlinear data It uses a nonlinear mapping to transform the original training.
Unsupervised Learning
CS479/679 Pattern Recognition Dr. George Bebis
Biointelligence Laboratory, Seoul National University
“The Dark Side of the SDSS” Bob Nichol ICG, Portsmouth Chris Miller, David Wake, Brice Menard, Idit Zehavi, Ryan Scranton, Gordon Richards, Daniel Eisenstein,
Simulating the joint evolution of quasars, galaxies and their large-scale distribution Springel et al., 2005 Presented by Eve LoCastro October 1, 2009.
Model assessment and cross-validation - overview
4/15/2017 Using Gaussian Process Regression for Efficient Motion Planning in Environments with Deformable Objects Barbara Frank, Cyrill Stachniss, Nichola.
What is Statistical Modeling
Massive Science with VO & Grids Bob Nichol ICG, Portsmouth Thanks to all my colleagues in AG2, VOtech,& PiCA Special thanks to Chris Miller, Alex Gray,
Galaxy and Mass Power Spectra Shaun Cole ICC, University of Durham Main Contributors: Ariel Sanchez (Cordoba) Steve Wilkins (Cambridge) Imperial College.
Non-linear matter power spectrum to 1% accuracy between dynamical dark energy models Matt Francis University of Sydney Geraint Lewis (University of Sydney)
Modeling Pixel Process with Scale Invariant Local Patterns for Background Subtraction in Complex Scenes (CVPR’10) Shengcai Liao, Guoying Zhao, Vili Kellokumpu,
Challenges Bayesian Estimation for Autonomous Object Manipulation Based on Tactile Perception Anna Petrovskaya, Oussama Khatib, Sebastian Thrun, Andrew.
Resampling techniques Why resampling? Jacknife Cross-validation Bootstrap Examples of application of bootstrap.
Lecture Notes for CMPUT 466/551 Nilanjan Ray
Predictive Automatic Relevance Determination by Expectation Propagation Yuan (Alan) Qi Thomas P. Minka Rosalind W. Picard Zoubin Ghahramani.
New Software Bob Nichol ICG, Portsmouth Thanks to all my colleagues in SDSS, GRIST & PiCA Special thanks to Chris Miller, Alex Gray, Gordon Richards, Brent.
Scalable Data Mining The Auton Lab, Carnegie Mellon University Brigham Anderson, Andrew Moore, Dan Pelleg, Alex Gray, Bob Nichols, Andy.
Statistical analysis and modeling of neural data Lecture 4 Bijan Pesaran 17 Sept, 2007.
Jeff Howbert Introduction to Machine Learning Winter Classification Bayesian Classifiers.
Binary Variables (1) Coin flipping: heads=1, tails=0 Bernoulli Distribution.
Ch. Eick: Support Vector Machines: The Main Ideas Reading Material Support Vector Machines: 1.Textbook 2. First 3 columns of Smola/Schönkopf article on.
Survey Science Group Workshop 박명구, 한두환 ( 경북대 )
Three data analysis problems Andreas Zezas University of Crete CfA.
Pattern Recognition: Baysian Decision Theory Charles Tappert Seidenberg School of CSIS, Pace University.
Bayesian Networks. Male brain wiring Female brain wiring.
Cosmological Tests using Redshift Space Clustering in BOSS DR11 (Y. -S. Song, C. G. Sabiu, T. Okumura, M. Oh, E. V. Linder) following Cosmological Constraints.
Different physical properties contribute to the density and temperature perturbation growth. In addition to the mutual gravity of the dark matter and baryons,
WFMOS Feasibility Study Value-added Science Bob Nichol, ICG Portsmouth.
Spatial Indexing of large astronomical databases László Dobos, István Csabai, Márton Trencséni ELTE, Hungary.
What can we learn from the luminosity function and color studies? THE SDSS GALAXIES AT REDSHIFT 0.1.
Objectives & Methodology: We aim to identify formerly unknown quasars from select databases, using previously known techniques that have been slightly.
Kernel Methods A B M Shawkat Ali 1 2 Data Mining ¤ DM or KDD (Knowledge Discovery in Databases) Extracting previously unknown, valid, and actionable.
Overview of Supervised Learning Overview of Supervised Learning2 Outline Linear Regression and Nearest Neighbors method Statistical Decision.
Clustering in the Sloan Digital Sky Survey Bob Nichol (ICG, Portsmouth) Many SDSS Colleagues.
How Standard are Cosmological Standard Candles? Mathew Smith and Collaborators (UCT, ICG, Munich, LCOGT and SDSS-II) SKA Bursary Conference 02/12/2010.
SDSS photo-z with model templates. Photo-z Estimate redshift (+ physical parameters) –Colors are special „projection” of spectra, like PCA.
PHY306 1 Modern cosmology 3: The Growth of Structure Growth of structure in an expanding universe The Jeans length Dark matter Large scale structure simulations.
Computational AstroStatistics Bob Nichol (Carnegie Mellon) Motivation & Goals Multi-Resolutional KD-trees (examples) Npt functions (application) Mixture.
Using Baryon Acoustic Oscillations to test Dark Energy Will Percival The University of Portsmouth (including work as part of 2dFGRS and SDSS collaborations)
Advanced Stellar Populations Advanced Stellar Populations Raul Jimenez
Céline Scheidt and Jef Caers SCRF Affiliate Meeting– April 30, 2009.
BAOs SDSS, DES, WFMOS teams (Bob Nichol, ICG Portsmouth)
Bias and Variance of the Estimator PRML 3.2 Ethem Chp. 4.
Cosmological Model Selection David Parkinson (with Andrew Liddle & Pia Mukherjee)
Guest lecture: Feature Selection Alan Qi Dec 2, 2004.
3rd International Workshop on Dark Matter, Dark Energy and Matter-Antimatter Asymmetry NTHU & NTU, Dec 27—31, 2012 Likelihood of the Matter Power Spectrum.
Linear Methods for Classification Based on Chapter 4 of Hastie, Tibshirani, and Friedman David Madigan.
Robust Kernel Density Estimation by Scaling and Projection in Hilbert Space Presented by: Nacer Khalil.
Final Review Course web page: vision.cis.udel.edu/~cv May 21, 2003  Lecture 37.
Bias and Variance of the Estimator PRML 3.2 Ethem Chp. 4.
The XMM Cluster Survey: Project summary and Cosmology Forecasts Kathy Romer University of Sussex.
Constraining population synthesis (and binary black hole inspiral rates) using binary neutron stars Richard O’Shaughnessy GWDAW
Density Estimation in R Ha Le and Nikolaos Sarafianos COSC 7362 – Advanced Machine Learning Professor: Dr. Christoph F. Eick 1.
Bayesian Template-Based Approach to Classifying SDSS-II Supernovae from 3-Year Survey Brian Connolly Photometric Supernova ID Workshop 3/16/12.
Ch 1. Introduction Pattern Recognition and Machine Learning, C. M. Bishop, Updated by J.-H. Eom (2 nd round revision) Summarized by K.-I.
ETHEM ALPAYDIN © The MIT Press, Lecture Slides for.
Predictive Automatic Relevance Determination by Expectation Propagation Y. Qi T.P. Minka R.W. Picard Z. Ghahramani.
1 C.A.L. Bailer-Jones. Machine Learning. Data exploration and dimensionality reduction Machine learning, pattern recognition and statistical data modelling.
Bayesian analysis of joint strong gravitational lensing and dynamic galactic mass in SLACS: evidence of line-of-sight contamination Antonio C. C. Guimarães.
LECTURE 09: BAYESIAN ESTIMATION (Cont.)
Alan D. Mead, Talent Algorithms Inc. Jialin Huang, Amazon
Alan Qi Thomas P. Minka Rosalind W. Picard Zoubin Ghahramani
Non-Parametric Models
Course Outline MODEL INFORMATION COMPLETE INCOMPLETE
Demographics of SDSS Quasars in Two-Dimension
Ch 3. Linear Models for Regression (2/2) Pattern Recognition and Machine Learning, C. M. Bishop, Previously summarized by Yung-Kyun Noh Updated.
Presentation transcript:

Playing in High Dimensions Bob Nichol ICG, Portsmouth Thanks to all my colleagues in SDSS, GRIST & PiCA Special thanks to Chris Miller, Alex Gray, Gordon Richards, Brent Bryan, Chris Genovese, Ryan Scranton, Larry Wasserman, Jeff Schneider

SC4 DEVO Meeting Outline Cosmological data is exploding in both size and complexity. This drives us towards searching non-parametric techniques, and the need to model data in high dimensions. Two examples: 1.Detection of quasars in multicolor space 2.Detection of features in data

SC4 DEVO Meeting Hunting for quasars Quasi-stellar sources: by definition they look like stars! Traditional approaches have used UVX approach to finding quasars, i.e., quasars are “very blue” so can be isolated in color-color space using simple hyper-planes (see Richards et al. 2002). However, there is significant contamination (~40%), thus demanding spectroscopic follow-up which is very time- consuming.

SC4 DEVO Meeting Use a probabilistic approach Use Kernel Density Estimation (KDE) to map color-color space occupied by known stars and quasars (“training sets”) Use cross-validation to “optimal” smooth the 4-D SDSS color space and obtain PDFs Fast implementation via KD-trees (Gray & Moore) ~16,000 known quasars and ~ stars Using a non-parametric Bayes classifier (NBC)

SC4 DEVO Meeting additional cut 95% complete 95% pure Stars QSOs F stars

SC4 DEVO Meeting

Advancements Present prior for P(C 1 ) is the ratio of stars to galaxies in our dataset Add magnitude and redshift information, either via increasing dimensionality or through priors

SC4 DEVO Meeting Non-parametric Techniques The wealth of data demands non- parametric techniques, ie., can one describe phenomena using the less amount of assumptions? The challenges here are computational as well as psychological

SC4 DEVO Meeting CMB Power Spectrum Before WMAP WMAP data Are the 2 nd and 3 rd peaks detected?

SC4 DEVO Meeting In parametric models of the CMB power spectrum the answer is likely “yes” as all CMB models have multiple peaks. But that has not really answered our question! Can we answer the question non-parametrically e.g., Y i = f(X i ) + c i Where Y i is the observed data, f(X i ) is an orthogonal function (  i cos(i  X i )), c i is the covariance matrix. The challenge is to “shrink” f(X i ), we use Beran (2000) to strink f(X i ) to N terms equal to the number of data points - optimal for all smooth functions and provides valid confidence intervals Monotonic shrinkage of  i - specifically nested subset selection (NSS) See Genovese et al. (2004) astro-ph/

SC4 DEVO Meeting Results (optimal smoothing through bias-variance trade-off) Concordance Our f(X i ) Note, WMAP only fit is not same as concordance model

SC4 DEVO Meeting Testing models The main advantage of this method is that we can construct a “confidence ball” (in N dimensions) around f(X i ) and thus perform non-parametric interferences e.g. is the second peak detected? Not at 95% confidence!

SC4 DEVO Meeting Information Content f h (X i ) = f(X i ) + b*h Beyond here there is little information

SC4 DEVO Meeting Using CMBfast we can make parametric models (11 parameters) and test if they are within the “confidence ball”. Varying  b we get a range of to Gray are models in the 95% confidence ball

SC4 DEVO Meeting Testing in high D Now we can now jointly search all 11 parameters in the parametric models and determine which models fit in the confidence ball (at 95%). Traditionally this is done by marginalising over the other parameters to gain confidence intervals on each parameter separately. This is a problem in high- D where the likelihood function could be degenerate, ill-defined and under-identified This is computational intense as billions of models need to searched, each of which takes ~minute to run Use Kriging methods to predict where the surface of the confidence ball exists and test models there.

SC4 DEVO Meeting 7D parameter space samples Cyan : 0.5  Purple: 1  Blue : 1.5  Red : 2  Green : >2  Bimodal dist. for several parameters

SC4 DEVO Meeting Other applications Physics of CMB is well-understood and people counter that parametric analyses are better (including Bayesian methods) [however, concerns about CI] Other areas of astrophysics have similar data problems, but the physics is less developed –Galaxy and quasar spectra (models are still rudimentary) –Galaxy clustering (non-linear gravitational effects are not confidently modeled) –Galaxy properties (e.g. star-formation rate)

SC4 DEVO Meeting + 1 sigma Mean -1 sigma