Made with OpenOffice.org 1 The Velocity Distribution of Nearby Stars from Hipparcos Data: The Significance of the Moving Groups Jo Bovy Center for Cosmology.

Slides:



Advertisements
Similar presentations
Pattern Recognition and Machine Learning
Advertisements

Kriging.
Dark Matter in Dwarf Galaxies
Cost of surrogates In linear regression, the process of fitting involves solving a set of linear equations once. For moving least squares, we need to.
Model Assessment and Selection
What is Statistical Modeling
Rob Fergus Courant Institute of Mathematical Sciences New York University A Variational Approach to Blind Image Deconvolution.
An introduction to Principal Component Analysis (PCA)
Information Theoretic Image Thresholding Laura Frost Supervisors: Dr Peter Tischer Dr Sid Ray.
Estimating parameters in inversions for regional carbon fluxes Nir Y Krakauer 1*, Tapio Schneider 1, James T Randerson 2 1. California Institute of Technology.
Predictive Automatic Relevance Determination by Expectation Propagation Yuan (Alan) Qi Thomas P. Minka Rosalind W. Picard Zoubin Ghahramani.
Mass to light ratio of the Milky Way disc Chris Flynn, Johan Holmberg, Laura Portinari Tuorla Observatory Burkhard Fuchs, Hartmut Jahrei ß Burkhard Fuchs,
M ORPHOLOGY OF G ALACTIC O PEN C LUSTERS Thirty six open clusters were selected from the 2MASS database on the basis of the latest open cluster catalogue.
Proper-Motion Membership Determinations in Star Clusters Dana I. Dinescu (Yale U.)
Single Point of Contact Manipulation of Unknown Objects Stuart Anderson Advisor: Reid Simmons School of Computer Science Carnegie Mellon University.
G. Cowan Lectures on Statistical Data Analysis Lecture 10 page 1 Statistical Data Analysis: Lecture 10 1Probability, Bayes’ theorem 2Random variables and.
Regression Eric Feigelson. Classical regression model ``The expectation (mean) of the dependent (response) variable Y for a given value of the independent.
沈致远. Test error(generalization error): the expected prediction error over an independent test sample Training error: the average loss over the training.
I N T R O D U C T I O N The mechanism of galaxy formation involves the cooling and condensation of baryons inside the gravitational potential well provided.
CPSC 502, Lecture 15Slide 1 Introduction to Artificial Intelligence (AI) Computer Science cpsc502, Lecture 16 Nov, 3, 2011 Slide credit: C. Conati, S.
Detection rates for a new waveform background design adopted from The Persistence of Memory, Salvador Dali, 1931 Bence Kocsis, Merse E. Gáspár (Eötvös.
Thoughts on the GIA Issue in SNARF Jim Davis & Tom Herring Input from and discussions with Mark Tamisiea, Jerry Mitrovica, and Glenn Milne.
Principal Component Analysis: Preliminary Studies Émille E. O. Ishida IF - UFRJ First Rio-Saclay Meeting: Physics Beyond the Standard Model Rio de Janeiro.
Directional Statistics for WIMP direct detection Anne Green Astro-Particle Theory and Cosmology Group University of Sheffield Ben Morgan, AMG and Neil.
Bias and Variance of the Estimator PRML 3.2 Ethem Chp. 4.
Early science on exoplanets with Gaia A. Mora 1, L.M. Sarro 2, S. Els 3, R. Kohley 1 1 ESA-ESAC Gaia SOC. Madrid. Spain 2 UNED. Artificial Intelligence.
Chapter 7. Learning through Imitation and Exploration: Towards Humanoid Robots that Learn from Humans in Creating Brain-like Intelligence. Course: Robots.
Guest lecture: Feature Selection Alan Qi Dec 2, 2004.
Dynamic and Spatial Properties of Satellites in Isolated Galactic Systems Abel B. Diaz.
ETHEM ALPAYDIN © The MIT Press, Lecture Slides for.
Gaussian Processes For Regression, Classification, and Prediction.
Most 1.6 Earth-Radius Planets are not Rocky Introduction Sub-Neptune, super-Earth-size exoplanets are a new planet class. They account for 0% of the solar.
A bestiary of QSO/host interactions Y. Letawe 1, P. Magain 1, G. Letawe 1, F. Courbin 2, D. Hutsemékers 1 1 Institut d’Astrophysique.
Bias and Variance of the Estimator PRML 3.2 Ethem Chp. 4.
Machine Learning 5. Parametric Methods.
Dark Matter in the Milky Way - how to find it using Gaia and other surveys Paul McMillan Surveys For All, 1st February 2016.
Parameter Estimation. Statistics Probability specified inferred Steam engine pump “prediction” “estimation”
Parallax Luminosity and mass functions - a few basic facts Kinematics of the solar neighborhood Asymmetric drift Thin disk, thick disk Open and globular.
Making action-angle disc models for Gaia Paul McMillan Lund Observatory Collaborators: J. Binney, T. Piffl, J. Sanders.
Mapping the Mass with Galaxy Redshift-Distance Surveys Martin Hendry, Dept of Physics and Astronomy University of Glasgow, UK “Mapping the Mass”: Birmingham,
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
HD and its super-Earth Valerie Van Grootel (University of Liege, Belgium) M. Gillon (U. Liege), D. Valencia (U. Toronto), N. Madhusudhan (U. Cambridge),
Determining Ages of APOGEE Giants with Known Distances Diane Feuillet New Mexico State University Jon Holtzman, Jo Bovy, Leo Girardi The APOGEE Team.
1 The law of gravitation can be written in a vector notation (9.1) Although this law applies strictly to particles, it can be also used to real bodies.
Ch 1. Introduction Pattern Recognition and Machine Learning, C. M. Bishop, Updated by J.-H. Eom (2 nd round revision) Summarized by K.-I.
ETHEM ALPAYDIN © The MIT Press, Lecture Slides for.
Made with OpenOffice.org 1 Inferring the dynamics of the Milky Way Jo Bovy New York University MPE, Garching, 2009/08/07.
Inferring dynamics from kinematic snapshots Jo Bovy Distribution of Mass in the Milky Way Galaxy Leiden, 07/14/09 New York University The importance of.
PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 1: INTRODUCTION.
1 C.A.L. Bailer-Jones. Machine Learning. Data exploration and dimensionality reduction Machine learning, pattern recognition and statistical data modelling.
Inference in action: the force law in the Solar System
(Collaborators: James Binney, Tilmann Piffl, Jason Sanders)
Principal Component Analysis
Probability Theory and Parameter Estimation I
Peculiar Velocity Moments
ICS 280 Learning in Graphical Models
The Stellar Kinematics in Our Milky Way
Clustering (3) Center-based algorithms Fuzzy k-means
Special Topics In Scientific Computing
Bias and Variance of the Estimator
Pre-Cursor Data Needed for JWST Transit and Eclipse Observations
CSCI 5822 Probabilistic Models of Human and Machine Learning
Estimating the Spatial Sensitivity Function of A Light Sensor N. K
Volume 6, Issue 5, Pages e13 (May 2018)
A Short Introduction to the Bayes Filter and Related Models
Pattern Recognition and Machine Learning
9. Gravitation 9.1. Newton’s law of gravitation
The spectral properties of Galactic X-ray sources at faint fluxes
Modeling the Extended Structure of Dwarf Spheroidals (Carina, Leo I)
Presentation transcript:

Made with OpenOffice.org 1 The Velocity Distribution of Nearby Stars from Hipparcos Data: The Significance of the Moving Groups Jo Bovy Center for Cosmology and Particle Physics New York University Bovy, Hogg, & Roweis (2009), ApJ, 700, 1794

Made with OpenOffice.org 2 Introduction >>3D distribution of velocities of stars near the Sun ( <~ 100 pc) >>Long history: >First account Mädler 1846: Stars moving together in the Pleiades

Made with OpenOffice.org 3 Introduction >>3D distribution of velocities of stars near the Sun ( <~ 100 pc) >>Long history: >First account Mädler 1846: Stars moving together in the Pleiades

Made with OpenOffice.org 4 Introduction >>3D distribution of velocities of stars near the Sun ( <~ 100 pc) >>Long history: >First account Mädler 1846: Stars moving together in the Pleiades >Proctor 1869: several groups of comoving stars (Hyades & Ursa Major) >20 th century: moving groups confirmed; new, but often spurious, groups discovered (e.g., Eggen) >Hipparcos: complete samples --> first detailed studies of f(v) (e.g., Chen et al. 1997; Figueras et al. 1997; Dehnen 1998; Chereul et al. 1998, 1999; Asiain et al. 1999; Bienayme 1999; Skuljan et al. 1999) >>Question remained: - How much of the structure is significant >>We answer this question by modeling the underlying f(v) with varying degrees of complexity and finding the ''best'' model

Made with OpenOffice.org 5 Relation to Gaia >>This is an exercise in model selection: What do we mean by ''best''? >>Gaia dynamical modeling questions: - How complex does the DF need to be - How complex does the potential need to be >>Can we the 3D-kinematics from 2D-observations? no RV at the faint end of Gaia

Made with OpenOffice.org 6 Modeling the underlying distribution function I >>We deconvolve the velocity distribution by fitting for an underlying distribution >>We model this underlying distribution as a mixture (sum) of K multivariate Gaussian components f(v) = Σa j N(v | m j, V j ) ; m = mean, V = variance tensor >>Fitting consists of comparing the model predictions with the data: Convolve the model with the observational uncertainties Likelihood = ∫ d v true f(v true ) p(v obs | v true ) obs. uncertainty = Σa j N(v obs | R t m j, R t V j R t T + S obs ) Projection onto tangential comp.

Made with OpenOffice.org 7 Modeling the underlying distribution function II >>Fitting = optimize the total likelihood = Π (individual likelihoods) >>We developed a general optimization technique to optimize this objective function, see Bovy, Hogg, & Roweis (2009) arXiv:0905:2979; GPLv2 code at >>We can incorporate prior information, e.g., regularization of the variance tensors

Made with OpenOffice.org 8 Number of Gaussian components <---- Increasing complexity Direction of Gal. rotation Towards the GC Variance regularization, <--- increasing complexity

Made with OpenOffice.org 9 More complexity, better fit But aren't we overfitting the data?

Made with OpenOffice.org 10 Choosing K (and w) >>Cross-validation

Made with OpenOffice.org 11 Choosing K (and w) >>Cross-validation >>Information-based criteria: minimum message length: “Best” model is the model that compresses the data most Could be part of dynamical modeling (e.g., made-to-measure)

Made with OpenOffice.org 12 Choosing K (and w) >>Cross-validation >>Information-based criteria: minimum message length: “Best” model is the model that compresses the data most Could be part of dynamical modeling (e.g., made-to-measure)

Made with OpenOffice.org 13 Choosing K (and w) >>Cross-validation >>Information-based criteria: minimum message length: “Best” model is the model that compresses the data most >>External validation: predict GCS radial velocities

Made with OpenOffice.org 14 Predicting the GCS radial velocities I 10 components w = 4 (km/s) 2

Made with OpenOffice.org 15 Predicting the GCS radial velocities II

Made with OpenOffice.org 16 Choosing K (and w) >>Cross-validation >>Information-based criteria: minimum message length: “Best” model is the model that compresses the data most >>External validation: predict GCS radial velocities: “Best” model is the model that predicts the RVs best

Made with OpenOffice.org 17 Choosing K (and w) >>Cross-validation >>Information-based criteria: minimum message length: “Best” model is the model that compresses the data most >>External validation: predict GCS radial velocities: “Best” model is the model that predicts the RVs best Most conservative --> K = 10 components

Made with OpenOffice.org 18 Resulting velocity distribution Toward the NGP Gal. rotation direction Toward the GC Toward the GC In the direction of Gal. rotation

Made with OpenOffice.org 19 Good fit? GCS probabilities:radial velocity distribution in patches of the sky:

Made with OpenOffice.org 20 Conclusions >>Significant structure in the local velocity distribution was all known before Hipparcos(but the velocity distribution is nevertheless very complex) >>We can reconstruct the local velocity distribution from tangential velocities alone >>Deconvolution (or, more generally, forward modeling) gets the most out of the data (and no more) + it handles arbitrary uncertainties, including missing data >>But, these approaches should mature to sampling the uncertainty in the DF, especially when modeling the DF in dynamical modeling, in order to marginalize over the DF when inferring the Galactic potential