Gaia DR1: a few warnings and caveats

Slides:



Advertisements
Similar presentations
Star catalogue comparison a tool to improve asteroidal prediction accuracy Dave Herald, TTSO4, April 2010, Canberra.
Advertisements

Week 11 Review: Statistical Model A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution.
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 12 l Multiple Regression: Predicting One Factor from Several Others.
Longitudinal Experiments Larry V. Hedges Northwestern University Prepared for the IES Summer Research Training Institute July 28, 2010.
G. Cowan Lectures on Statistical Data Analysis 1 Statistical Data Analysis: Lecture 10 1Probability, Bayes’ theorem, random variables, pdfs 2Functions.
Do NOT open CLEA until all the black boxes have finished loading.
The Calibration Process
Distances. Parallax Near objects appear to move more than far objects against a distant horizon. Trigonometric parallax is used to measure distance to.
1 PREDICTION In the previous sequence, we saw how to predict the price of a good or asset given the composition of its characteristics. In this sequence,
1 1 Slide Statistical Inference n We have used probability to model the uncertainty observed in real life situations. n We can also the tools of probability.
Common Probability Distributions in Finance. The Normal Distribution The normal distribution is a continuous, bell-shaped distribution that is completely.
GEO7600 Inverse Theory 09 Sep 2008 Inverse Theory: Goals are to (1) Solve for parameters from observational data; (2) Know something about the range of.
Chapter 15 Modeling of Data. Statistics of Data Mean (or average): Variance: Median: a value x j such that half of the data are bigger than it, and half.
Distortion in the WFC Jay Anderson Rice University
University of Ottawa - Bio 4118 – Applied Biostatistics © Antoine Morin and Scott Findlay 08/10/ :23 PM 1 Some basic statistical concepts, statistics.
I Introductory Material A. Mathematical Concepts Scientific Notation and Significant Figures.
LECTURER PROF.Dr. DEMIR BAYKA AUTOMOTIVE ENGINEERING LABORATORY I.
Goal: To understand how we know distances to various objects (the distance scale) Objectives: 1)To learn about Radar (the sun) 2)To learn about Parallax.
Survey – extra credits (1.5pt)! Study investigating general patterns of college students’ understanding of astronomical topics There will be 3~4 surveys.
11 Confidence Intervals – Introduction A point estimate provides no information about the precision and reliability of estimation. For example, the sample.
Slide 1 of 48 Measurements and Their Uncertainty
Inference: Probabilities and Distributions Feb , 2012.
Cosmology and extragalactic astronomy Mat Page Mullard Space Science Lab, UCL 5. The cosmic distance ladder.
Announcements HW #1 will be returned on Wednesday Problem #7 – I wrote the distance incorrectly, should have been x1019 km. But don’t change your.
GAIA binary star simulations: Tests of sampling windows and observation strategies Staffan Söderhjelm Lund Observatory Sampling window-size Window allocation.
Sampling Design and Analysis MTH 494 Lecture-21 Ossam Chohan Assistant Professor CIIT Abbottabad.
Making careful measurements allows scientists to repeat experiments and compare results. Section 3: Measurement K What I Know W What I Want to Find Out.
What is a reading? It is the single determination of the value of an unknown quantity It is the actual reading taken in an experiment.
Measurements and Their Analysis. Introduction Note that in this chapter, we are talking about multiple measurements of the same quantity Numerical analysis.
Precision, Error and Accuracy Physics 12. Measurement  When taking measurements, it is important to note that no measurement can be taken exactly  Therefore,
Uncertainty2 Types of Uncertainties Random Uncertainties: result from the randomness of measuring instruments. They can be dealt with by making repeated.
Pisa, 4 May 2009 Alessandro Spagna A new kinematic survey (from GSC-II and SDSS-DR7) to study the stellar populations of the Milky Way Alessandro Spagna.
Parameter Estimation. Statistics Probability specified inferred Steam engine pump “prediction” “estimation”
Hypothesis Testing. Statistical Inference – dealing with parameter and model uncertainty  Confidence Intervals (credible intervals)  Hypothesis Tests.
Measuring shear using… Kaiser, Squires & Broadhurst (1995) Luppino & Kaiser (1997)
Sampling Design and Analysis MTH 494 LECTURE-11 Ossam Chohan Assistant Professor CIIT Abbottabad.
Class Six Turn In: Chapter 15: 30, 32, 38, 44, 48, 50 Chapter 17: 28, 38, 44 For Class Seven: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 Read.
Star Catalog Comparison 2016 IOTA Annual Meeting Steve Preston.
Outline Sampling Measurement Descriptive Statistics:
Multiple Regression Analysis: Inference
Working with astrometric data - warnings and caveats -
Confidence Intervals Cont.
Sponge: List five scientists you saw in the Mechanical Universe film.
More on Inference.
Employment of AGIS for Nano-Jasmine
Working with astrometric data - warnings and caveats -
Sampling Distributions and Estimation
Introduction to estimation: 2 cases
Option D1 & D2: Measuring Stellar Distances
Hypothesis Testing and Confidence Intervals (Part 1): Using the Standard Normal Lecture 8 Justin Kern October 10 and 12, 2017.
The Calibration Process
GPI Astrometric Calibration
Gaia impact on asteroidal occultations
Gaia impact on asteroidal occultations
More on Inference.
Objectives: To learn about Radar (the sun)
Arithmetic Mean This represents the most probable value of the measured variable. The more readings you take, the more accurate result you will get.
Graphing with Uncertainties
Where did we stop? The Bayes decision rule guarantees an optimal classification… … But it requires the knowledge of P(ci|x) (or p(x|ci) and P(ci)) We.
Gaia Tomaž Zwitter Gaia: > 1.1 billion objects (V ≤ 20.9),
Uncertainty and Error
Binary stars and clusters
CS639: Data Management for Data Science
Errors and Uncertainties
Parallax and the Standard Candle Method
Scientific Measurement
Propagation of Error Berlin Chen
Propagation of Error Berlin Chen
Introduction to Econometrics, 5th edition
Uncertainty and confidence
Presentation transcript:

Gaia DR1: a few warnings and caveats e.g. the importance of using variance/covariance quantities U. Bastian, Gaia User Workshop, ARI 4.4.2016

Introduction to Gaia Skipped – Stefan gave it less than an hour ago !

Forthcoming first Gaia data release „Gaia DR1“ 14.6/11.8 months end of summer 2016 1 billion positions and G magnitudes (old plan, still true) plus 2 million parallaxes and proper motions (very new)

The 1 billion positions Probably ~1.5 billion Mostly G=12-20 On average on the ICRS/ICRF system to sub-mas precision. But: too few Gaia observations for 5-parameter solutions Bayesian prior that parallax and motions are „small“, implying a Galaxy model, see Michalik et al 2015, A&A, 583, A68, ArXiV 1507.02963 effectively a „reasonable“ 2-parameter solution; no pm and parallaxes published Thus: typically uncertain by 1-3 mas due to unknown parallax and proper motion perhaps 10 percent with formal uncertainties >15 mas a few unrecognized high proper motions and parallaxes -> outliers and of course the usual outliers due to disturbed measurements No problem for all applications that I could think of. giant advance over all existing large reference catalogues precision and accuracy quickly deteriorating (few years, both future and past) for the future: to be replaced by Gaia DR2 before this problem can take effect in the future for the past: the time of applicability can be extended by combination with known pm, or by direct combination with older known positions

The 2 million parallaxes and motions Too few Gaia observations for 5-parameter solutions Solved by combination with Tycho-2 positions genuine 5-parameter solutions valid proper motions and parallaxes but: depending on Tycho-2 (incl. HIP) positions, not on Tycho-2 and HIP motions and parallaxes, however. Comparisons: Number of objects: 20 times better than HIP Parallaxes typically below 0.5 mas, i.e. better than HIP Motions typically 2 mas/yr, i.e. worse than HIP (but 0.1 mas for HIP stars) Systematic errors: worse than HIP; a number will be given with the release Not yet comparable with expected Gaia performances ! DR2 will be the first proper Gaia Catalogue Beware of the systematics !

Systematic errors Size in DR1 is similar to the individual uncertainties causes: colour terms, PSF time variations, basic-angle oscillations clanks dependence on Tycho-2 ... Averages over many stars will not be (much) better than the numbers for a single star Clusters, absolute brightness of standard candles, ... Recipe: - quadratically add a systematic uncertainty after averaging better: - give separate random and systematic uncertainties

Example N=400 stars, each with sigma(parallax) = 0.5 mas sqrt(N) law gives sigma(average) = 0.025 mas = 25 muas completely unrealistic for DR1 !

The inverse of a parallax ... ... is not necessarily a distance Insignificant parallaxes Negative parallaxes Marginally significant parallaxes Look at the standard errors in the catlg file(s) And then of course: Outliers Small degree of freedom per star (hampering outlier rejection)

The inverse of a parallax ... Mean, median, mode, rms of a trigonometric distance Consider a given true parallax p Distance D=1/p (see next page) Recipe to avoid scientific trouble: Always use forward modelling to measured parallaxes to fit higher-level unknown quantities In other words: Do not use derived quantities like individual distances or absolute magnitudes to fit higher-level unknown quantities

The inverse of a parallax ...

The inverse of a parallax ... „Estimating distance from parallax“, C. Bailer-Jones, EWASS 2016 & PASP, Oct 2015 Various Bayesian priors giving realistic means and variances - for various circumstances

Gaia image of the week, 2015-12-19

Now for something funny but serious: M11 Variances/mean errors Covariances/Correlations Condition number (inverse) GoF (F2) Source excess noise Solution_1512170101, members position-selected Wow ! An exploding cluster ?? Data from a preliminary solution; but actual Gaia DR1 will look quite similar

Now for something funny but serious: M11 Variances/mean errors Covariances/Correlations Condition number (inverse) GoF (F2) Source excess noise Wow ! Data from a preliminary solution; but actual Gaia DR1 will look quite similar

During this presentation - about 1 million stars were measured by Gaia, - roughly 10 million astrometric measurements were taken, - about 300,000 spectra were made of 100,000 stars