The Population of Near-Earth Asteroids and Current Survey Completion Alan W. Harris MoreData! 1970-2010: The Golden Age of Solar System Exploration Rome,

Slides:



Advertisements
Similar presentations
Chapter 7 Statistical Data Treatment and Evaluation
Advertisements

Probability Distributions CSLU 2850.Lo1 Spring 2008 Cameron McInally Fordham University May contain work from the Creative Commons.
Some terminology When the relation between variables are expressed in this manner, we call the relevant equation(s) mathematical models The intercept and.
Statistical Analysis of Near Earth Objects By Danielle Bisordi and Kristin Buckley Professor Florescu MA 331 December 3, 2009.
Sampling Distributions
SUMMARIZING DATA: Measures of variation Measure of Dispersion (variation) is the measure of extent of deviation of individual value from the central value.
Spreadsheet Simulation
Confidence Intervals © Scott Evans, Ph.D..
Evaluating Hypotheses
Transforms What does the word transform mean?. Transforms What does the word transform mean? –Changing something into another thing.
Peter Vereš, Juraj Tóth, Leonard Kornoš Search for very close approaching NEAs Comenius University, Bratislava, Slovakia Faculty of mathematics, physics.
Lecture 9: One Way ANOVA Between Subjects
Statistical Background
1.  Why understanding probability is important?  What is normal curve  How to compute and interpret z scores. 2.
Math 1040 Intro To Statistics Professor: Zeph Allen Smith Presented by: Nellie Sobhanian.
BPS - 3rd Ed. Chapter 131 Confidence intervals: the basics.
The Sampling Distribution Introduction to Hypothesis Testing and Interval Estimation.
Chapter 7 Probability and Samples: The Distribution of Sample Means
1 Psych 5500/6500 Statistics and Parameters Fall, 2008.
Physics 114: Lecture 15 Probability Tests & Linear Fitting Dale E. Gary NJIT Physics Department.
Hypothesis Testing:.
PSYCHOLOGY 820 Chapters Introduction Variables, Measurement, Scales Frequency Distributions and Visual Displays of Data.
14. Introduction to inference
Exploratory Data Analysis. Computing Science, University of Aberdeen2 Introduction Applying data mining (InfoVis as well) techniques requires gaining.
Example 16.1 Ordering calendars at Walton Bookstore
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
The Estimated Population of Small NEOs Alan Harris MoreData! Inc. Target NEO 2 Washington, DC, July 9, 2013.
RMTD 404 Lecture 8. 2 Power Recall what you learned about statistical errors in Chapter 4: Type I Error: Finding a difference when there is no true difference.
Lecture 12 Statistical Inference (Estimation) Point and Interval estimation By Aziza Munir.
Sampling Distribution ● Tells what values a sample statistic (such as sample proportion) takes and how often it takes those values in repeated sampling.
PARAMETRIC STATISTICAL INFERENCE
Statistics and Quantitative Analysis Chemistry 321, Summer 2014.
Section 8.1 Estimating  When  is Known In this section, we develop techniques for estimating the population mean μ using sample data. We assume that.
University of Ottawa - Bio 4118 – Applied Biostatistics © Antoine Morin and Scott Findlay 08/10/ :23 PM 1 Some basic statistical concepts, statistics.
Measures of Variability In addition to knowing where the center of the distribution is, it is often helpful to know the degree to which individual values.
COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Instructor: Dr. John J. Kerbs, Associate Professor Joint Ph.D. in Social Work and Sociology.
Essential Statistics Chapter 131 Introduction to Inference.
CHAPTER 3: The Normal Distributions
1 Psych 5500/6500 Standard Deviations, Standard Scores, and Areas Under the Normal Curve Fall, 2008.
1 Psych 5500/6500 The t Test for a Single Group Mean (Part 1): Two-tail Tests & Confidence Intervals Fall, 2008.
SUNYAEV-ZELDOVICH EFFECT. OUTLINE  What is SZE  What Can we learn from SZE  SZE Cluster Surveys  Experimental Issues  SZ Surveys are coming: What.
The Robust Approach Dealing with real data. Estimating Population Parameters Four properties are considered desirable in a population estimator:  Sufficiency.
Aldo Dell'Oro INAF- Observatory of Turin Detailed analysis of the signal from asteroids by GAIA and their size estimation Besançon November 6-7, 2003.
Copyright © 2012 Pearson Education. All rights reserved © 2010 Pearson Education Copyright © 2012 Pearson Education. All rights reserved. Chapter.
Distributions of the Sample Mean
Chapter 7 Sampling Distributions Statistics for Business (Env) 1.
Section 10.1 Confidence Intervals
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
1 Descriptive Statistics 2-1 Overview 2-2 Summarizing Data with Frequency Tables 2-3 Pictures of Data 2-4 Measures of Center 2-5 Measures of Variation.
Relative Values. Statistical Terms n Mean:  the average of the data  sensitive to outlying data n Median:  the middle of the data  not sensitive to.
Chapter 2 Statistical Background. 2.3 Random Variables and Probability Distributions A variable X is said to be a random variable (rv) if for every real.
Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics.
MODELING THE NEOs ORBITAL DISTRIBUTION AND NEO DISCOVERY STRATEGIES A. Morbidelli (OCA) R. Jedicke (Spacewatch) W.F. Bottke (SWRI) P. Michel (OCA) P. Tanga.
Organization of statistical investigation. Medical Statistics Commonly the word statistics means the arranging of data into charts, tables, and graphs.
Chapter 2: Frequency Distributions. Frequency Distributions After collecting data, the first task for a researcher is to organize and simplify the data.
Chapter 13 Sampling distributions
Descriptive Statistics for one Variable. Variables and measurements A variable is a characteristic of an individual or object in which the researcher.
IM911 DTC Quantitative Research Methods Statistical Inference I: Sampling distributions Thursday 4 th February 2016.
CHAPTER – 1 UNCERTAINTIES IN MEASUREMENTS. 1.3 PARENT AND SAMPLE DISTRIBUTIONS  If we make a measurement x i in of a quantity x, we expect our observation.
Chapter 8 Estimation ©. Estimator and Estimate estimator estimate An estimator of a population parameter is a random variable that depends on the sample.
The inference and accuracy We learned how to estimate the probability that the percentage of some subjects in the sample would be in a given interval by.
The accuracy of averages We learned how to make inference from the sample to the population: Counting the percentages. Here we begin to learn how to make.
Statistical Inferences for Variance Objectives: Learn to compare variance of a sample with variance of a population Learn to compare variance of a sample.
Chapter Seven Point Estimation and Confidence Intervals.
Copyright © 2009 Pearson Education, Inc t LEARNING GOAL Understand when it is appropriate to use the Student t distribution rather than the normal.
CHAPTER 26: Inference for Regression
The Population of Near-Earth Asteroids Revisited
CHAPTER – 1.2 UNCERTAINTIES IN MEASUREMENTS.
Advanced Algebra Unit 1 Vocabulary
CHAPTER – 1.2 UNCERTAINTIES IN MEASUREMENTS.
Presentation transcript:

The Population of Near-Earth Asteroids and Current Survey Completion Alan W. Harris MoreData! : The Golden Age of Solar System Exploration Rome, Italy, September 10-12, 2012

NEA Population: How do we know? When a survey keeps re- detecting the same objects without finding any new ones, one can infer that the survey has found them all. Going to smaller sizes, one can estimate the fraction discovered from the ratio of re- detections to total detections. Still smaller, where there are insufficient re-detections, one can estimate the relative detection efficiency versus size, and extrapolate the population estimate to still smaller objects.

Completion and Re-detection Ratio If all asteroids of a given size were equally easy to detect, then the completion in a given size range would simply be the ratio of re-detected asteroids to the total number detected (new plus old) in a trial time interval. The total population in that size range would be simply the number known divided by the completion (re-detection ratio). For example, if 200 asteroids in a given size range were already known, and in the next couple years 50 of those were re-detected by a survey and 50 new ones were discovered, we would infer the completion was 50%, and the total population would be 400. But asteroids are not all equally easy to discover, due to the range of orbital parameters resulting in, among other factors, variable intervals of observability over time. In order to obtain an accurate estimate of true completion, and thus population, one must bias-correct the observed re-detection ratio to estimate the true completion as a function of size of asteroid. We do this with a computer model simulating actual surveys.

Computer Survey Simulations (1) We generate a large number of synthetic NEA orbits matching as best we can the distribution of orbits of large discovered NEAs, where completion is high so that biases in the orbit distribution should be minimal. Rather than assign sizes to the synthetic asteroids, we define a parameter dm as follows: where V lim is the limiting magnitude of the survey and H is the absolute magnitude of interest. In the first step of a simulation, we compute the sky positions and other parameters that affect visibility (rate of motion, solar elongation, phase angle, galactic latitude, etc.) for each orbit (100,000) every few days for ten years. We tabulate these parameters, along with what would be the sky magnitude V, less absolute magnitude: where r and  are Earth and Sun distances and  (  ) is the phase relation for solar phase angle .

Computer Simulation (2) The massive “observation file” need be generated only once. For a specific simulation, we can specify a particular observing site, the area of sky to be covered, observing cadence, specific limits on declination, solar elongation, etc., and even impose modifications on dm to account for expected magnitude loss due to zenith distance, trailing loss, sky brightness, seeing, etc. We can also specify how many detections over how much time constitutes a successful “discovery”. For each “observation” where it is determined the object is in the field observed, a detection is scored if dm < dm. The same observation file can be “scanned” repeatedly using different values of dm to build up a completion curve as a function of dm.

Model Completion vs. Re-detection Ratio After ten years or so of simulated survey, the shape of the completion curve is remarkably similar over a wide range of survey parameters; it simply moves to the left as time progresses. Furthermore, the Re- detection ratio is similarly stable, tracking about 1.0 magnitude lower value of dm. Unlike a real survey, in the computer simulation, we know the total population, so we can run a simulation, say for ten years, and also track the re-detection ratio for a trial interval, say the last two years of the simulation. Thus we can plot and compare the actual (model) completion along with the re-detection ratio.

Model vs. actual survey re-detection ratio The actual re- detection ratios for the combination of LINEAR, Catalina, and Siding Spring match the model curve within the uncertainties in the survey data. We thus adopt the model completion curve as representing current survey completion.

Extrapolation to smaller size The observed re-detection ratio becomes uncertain below about 0.1 (that is, H greater than about 22) due to the low number of re-detections. However, having “calibrated” the completion curve in the range of good re-detection statistics, we can extend to still smaller sizes by assuming that the computer completion curve accurately models actual completion. This works until the number of “detections” in the computer model falls below a statistically useful number, say about 100 “detections” out of the 100,000 model asteroids, or a completion of about This corresponds to about dm = -4.0, or on the scaled curve to about H = Fortunately, below dm of ~3.0, detections are close to the Earth and can be modeled with rectilinear motion rather than accounting for orbital motion. An analytical completion function can be matched to the computer completion curve and extrapolated to arbitrarily small size. With these extensions, we now have an estimate of completion over the entire size range of observed objects.

Differential Population Plotted here are the numbers in each half- magnitude interval, in red the total number discovered as of July 2010, and in blue the estimated total population in that size range, based on the completion curves of the previous graphs.

Cumulative Population The cumulative population is the running sum of the differential population, from the previous plot. The number N is the total number of NEAs larger than the specified size (H or Diameter).

Current and Future Survey Completion This plot shows the completion vs. size at current survey level, and as expected for a survey that achieves 90% integral completion to a size of D > 140 m.

Summary Latest (2012) estimated population of NEAs is little changed from 2003, 2007, and 2010 estimates: –N(H 1km) = 976 ± 30 –N disc (H<17.75) = 849 as of August 2, 2012; Completion = 87% –Size-frequency distribution still has dip in m range, estimated population of smallest objects is slightly less. Next-Generation surveys to reach C(D>140m) of 90% will in the process: –Complete the survey to sensibly 100% of objects D > 1 km –Catalog and track >90% of “Apophis” sized objects –Discover ~1/3 of “Tunguska” sized objects and ~10% of anything large enough to make it to the ground (D > 25m) Current surveys have almost a 50% chance of detecting a “death plunge” small object with enough time for civil defense measures; future surveys have the potential to do even better with appropriate observing protocol.