From Photons to Petabytes: Astronomy in the Era of Large Scale Surveys and Virtual Observatories R. Chris Smith AURA/NOAO/CTIO/LSST.

Slides:



Advertisements
Similar presentations
Microsoft Research Microsoft Research Jim Gray Distinguished Engineer Microsoft Research San Francisco SKYSERVER.
Advertisements

USING TRANSIENTS TO ILLUMINATE THE DARK UNIVERSE R. Chris Smith NOAO/CTIO ESSENCE, SuperMACHO, and the Dark Energy Survey.
The Australian Virtual Observatory e-Science Meeting School of Physics, March 2003 David Barnes.
Brian Schmidt The Research School of Astronomy and Astrophysics Mount Stromlo & Siding Spring Observatories.
Extremely Large Telescopes and Surveys Mark Dickinson, NOAO.
The Dark Energy Survey The Big Questions The Discovery of Dark Energy The Dark Energy Survey – The telescope – The camera – The science Expected Results.
Astronomy of the Next Decade: From Photons to Petabytes R. Chris Smith AURA Observatory in Chile CTIO/Gemini/SOAR/LSST.
All these Sky Pixels Are Yours The evolution of telescopes and CCD Arrays: The Coming Data Nightmare.
Cluster Cosmology at LPPC Illuminating Dark Energy with BCS and PISCO F. William High 2 nd Year Grad Student Harvard Univ Dept of Physics Monday, 21 August.
Astronomical Research at TAMU Faculty Future facilities and projects.
July 7, 2008SLAC Annual Program ReviewPage 1 Future Dark Energy Surveys R. Wechsler Assistant Professor KIPAC.
1 LSST: Dark Energy Tony Tyson Director, LSST Project University of California, Davis Tony Tyson Director, LSST Project University of California, Davis.
SNLS: SuperNova Legacy Survey Jess Oundjian. What is SNLS?  The largest survey yet to measure the distance to, and the redshift of, far off supernovae.
Like the jelly beans in this jar, the Universe is mostly dark: about 96 percent consists of dark energy (about 70%) and dark matter (about 26%). Only about.
Building a Framework for Data Preservation of Large-Scale Astronomical Data ADASS London, UK September 23-26, 2007 Jeffrey Kantor (LSST Corporation), Ray.
KDUST Supernova Cosmology
Leicester Database & Archive Service J. D. Law-Green, J. P. Osborne, R. S. Warwick X-Ray & Observational Astronomy Group, University of Leicester What.
Astro-DISC: Astronomy and cosmology applications of distributed super computing.
Astronomical GRID Applications at ESAC Science Archives and Computer Engineering Unit Science Operations Department ESA/ESAC.
Observational Astronomy. Astronomy Primary Goal: Understanding the nature of the universe and its constituents Means: Equipment building, research, teaching.
Aus-VO: Progress in the Australian Virtual Observatory Tara Murphy Australia Telescope National Facility.
The KPNO 4m “Mayall” Telescope Arjun Dey (NOAO). National Optical Astronomy Observatory Mission: provide the best ground-based astronomical capabilities.
UNIVERSITY of MARYLAND GLOBAL LAND COVER FACILITY High Performance Computing in Support of Geospatial Information Discovery and Mining Joseph JaJa Institute.
Computing in Atmospheric Sciences Workshop: 2003 Challenges of Cyberinfrastructure Alan Blatecky Executive Director San Diego Supercomputer Center.
Scientific Data Infrastructure in CAS Dr. Jianhui Scientific Data Center Computer Network Information Center Chinese Academy of Sciences.
The NASA/NExScI/IPAC Star and Exoplanet Database 14 May 2009 David R. Ciardi on behalf of the NStED Team.
Open Science Grid For CI-Days Internet2: Fall Member Meeting, 2007 John McGee – OSG Engagement Manager Renaissance Computing Institute.
The Science Case for the Dark Energy Survey James Annis For the DES Collaboration.
From Photons to Petabytes: Astronomy in the Era of Large Scale Surveys and Virtual Observatories R. Chris Smith AURA/NOAO/CTIO.
National Center for Supercomputing Applications Observational Astronomy NCSA projects radio astronomy: CARMA & SKA optical astronomy: DES & LSST access:
1 New Frontiers with LSST: leveraging world facilities Tony Tyson Director, LSST Project University of California, Davis Science with the 8-10 m telescopes.
1 Radio Astronomy in the LSST Era – NRAO, Charlottesville, VA – May 6-8 th LSST Survey Data Products Mario Juric LSST Data Management Project Scientist.
MASSACHUSETTS INSTITUTE OF TECHNOLOGY NASA GODDARD SPACE FLIGHT CENTER ORBITAL SCIENCES CORPORATION NASA AMES RESEARCH CENTER SPACE TELESCOPE SCIENCE INSTITUTE.
THE DARK ENERGY SURVEY Science Goals: Perform a 5000 sq. deg. survey of the southern galactic cap Map the cosmological density field to z=1 Constrain the.
Astro / Geo / Eco - Sciences Illustrative examples of success stories: Sloan digital sky survey: data portal for astronomy data, 1M+ users and nearly 1B.
Instrumentation of the SAM-Grid Gabriele Garzoglio CSC 426 Research Proposal.
Using the NSA Presentation to NOAO Users Committee October 5, 2005.
LSST: Preparing for the Data Avalanche through Partitioning, Parallelization, and Provenance Kirk Borne (Perot Systems Corporation / NASA GSFC and George.
Research Networks and Astronomy Richard Schilizzi Joint Institute for VLBI in Europe
EScience May 2007 From Photons to Petabytes: Astronomy in the Era of Large Scale Surveys and Virtual Observatories R. Chris Smith NOAO/CTIO, LSST.
1 Computing Challenges for the Square Kilometre Array Mathai Joseph & Harrick Vin Tata Research Development & Design Centre Pune, India CHEP Mumbai 16.
The Dark Energy Survey The Big Questions The Discovery of Dark Energy
Astronomy, Petabytes, and MySQL MySQL Conference Santa Clara, CA April 16, 2008 Kian-Tat Lim Stanford Linear Accelerator Center.
What the Data Products Program Offers Users Todd Boroson Dick Shaw Presentation to NOAO Users Committee October 23, 2003.
G. Miknaitis SC2006, Tampa, FL Observational Cosmology at Fermilab: Sloan Digital Sky Survey Dark Energy Survey SNAP Gajus Miknaitis EAG, Fermilab.
The Large Synoptic Survey Telescope: The power of wide-field imaging Michael Strauss, Princeton University.
Type Ia Supernovae and the Acceleration of the Universe: Results from the ESSENCE Supernova Survey Kevin Krisciunas, 5 April 2008.
1 Imaging Surveys: Goals/Challenges May 12, 2005 Luiz da Costa European Southern Observatory.
May 6, 2002Earth System Grid - Williams The Earth System Grid Presented by Dean N. Williams PI’s: Ian Foster (ANL); Don Middleton (NCAR); and Dean Williams.
Data Archives: Migration and Maintenance Douglas J. Mink Telescope Data Center Smithsonian Astrophysical Observatory NSF
From photons to catalogs. Cosmological survey in visible/near IR light using 4 complementary techniques to characterize dark energy: I. Cluster Counts.
LSST VAO Meeting March 24, 2011 Tucson, AZ. Headquarters Site Headquarters Facility Observatory Management Science Operations Education and Public Outreach.
Large Area Surveys - I Large area surveys can answer fundamental questions about the distribution of gas in galaxy clusters, how gas cycles in and out.
High Energy FermiLab Two physics detectors (5 stories tall each) to understand smallest scale of matter Each experiment has ~500 people doing.
The Large Synoptic Survey Telescope and Precision Studies of Cosmology David L. Burke SLAC C2CR07 Granlibakken, California February 26, 2007 Brookhaven.
The Large Synoptic Survey Telescope Project Bob Mann LSST:UK Project Leader Wide-Field Astronomy Unit, Edinburgh.
Ray Plante for the DES Collaboration BIRP Meeting August 12, 2004 Tucson Fermilab, U Illinois, U Chicago, LBNL, CTIO/NOAO DES Data Management Ray Plante.
AstroGrid NAM 2001 Andy Lawrence Cambridge NAM 2001 Andy Lawrence Cambridge Belfast Cambridge Edinburgh Jodrell Leicester MSSL.
EScience: Techniques and Technologies for 21st Century Discovery Ed Lazowska Bill & Melinda Gates Chair in Computer Science & Engineering Computer Science.
Mountaintop Software for the Dark Energy Camera Jon Thaler 1, T. Abbott 2, I. Karliner 1, T. Qian 1, K. Honscheid 3, W. Merritt 4, L. Buckley-Geer 4 1.
1 LSST Town Hall 227 th meeting of the AAS 1/7/2016 Pat Eliason, LSSTC Executive Office Pat Osmer, LSSTC Senior Advisor.
Brenna Flaugher for the DES Collaboration; DPF Meeting August 27, 2004 Riverside,CA Fermilab, U Illinois, U Chicago, LBNL, CTIO/NOAO 1 Dark Energy and.
DESpec in the landscape of large spectrographic surveys Craig Hogan University of Chicago and Fermilab.
HELIO: Discovery and Analysis of Data in Heliophysics Robert Bentley, John Brooke, André Csillaghy, Donal Fellows, Anja Le Blanc, Mauro Messerotti, David.
LSST CORPORATION Patricia Eliason LSSTC Executive Officer Belgrade, Serbia 2016.
The Sloan Digital Sky Survey Supernova Search ● THANKS TO THE APO TEAM FROM THE SDSS-II SUPERNOVA SCIENCE TEAM!!
T. Axelrod, NASA Asteroid Grand Challenge, Houston, Oct 1, 2013 Improving NEO Discovery Efficiency With Citizen Science Tim Axelrod LSST EPO Scientist.
Optical Survey Astronomy DATA at NCSA
Chapter 30 Section 4 Big Bang Theory.
Google Sky.
Presentation transcript:

From Photons to Petabytes: Astronomy in the Era of Large Scale Surveys and Virtual Observatories R. Chris Smith AURA/NOAO/CTIO/LSST

“Classical” Optical Astronomy

1-4 investigators propose for telescope time Obtain 1 to 5 nights, or 1 to 5 hours!  Oversubscription on largest telescopes (e.g. Gemini) severely limits time per investigator Travel to distant telescope site  or not: Remote Observing, Service Observing, Queue Observe  or not: clouds (OUCH!) Take.5 to 50 GB of data home (on tapes) Reduce & Analyze “by hand”  Extract every detail from those bits  Often takes months per night of data

Optical Windows

Can “classical” techniques answer the BIG questions? Where do we come from?  Star Formation, Nucleosynthesis Are we alone?  Proto-planetary disks, search for planets Where are we going?  Big Bang & the Expansion of the Universe What is the Universe made of?  What types of matter? What types of energy?

Where are we going? A “Repulsive” Result In 1990s, began looking for deceleration Found expansion of Universe is accelerating!!! Implies something NEW! Regions of empty space REPEL each other?  “Cosmological constant”? Einstein’s greatest blunder… OR NOT?!!  Something going on in the vacuum? NEW FUNDAMENTAL PHYSICS!

Today’s BIG Questions: Dark Energy & Dark Matter Dark Energy is the dominant constituent of the Universe. Dark Matter is next. 95% of the Universe is in Dark Energy and Dark Matter, for which we have little or no detailed understanding and 2003 Science breakthroughs of the year

Attacking the Question of Dark Energy “Classical” approach won’t work  Not enough telescope time  Difficult to control calibrations & systematics LARGE SURVEYS  Goal: Provide large, uniform, well calibrated, controlled, and documented datasets to allow for advanced statistical analyses  Control calibrations & systematics to <1%  Larger collaborations provide both manpower and diverse expertise Including traditional astronomers, high-energy physicists, mathematicians, and computer scientists

Sociology of Dark Energy Dark Energy may be pushing the universe APART But it is pulling the Astronomy, High Energy Physics (HEP), and Computer Science (CS) communities TOGETHER  HEP interests in fundamental physics  HEP experience in large datasets  CS interest in CPUs, Storage, Networks, and (of course) algorithms & optimization!

Dark Energy ROADMAP to understanding Today  ESSENCE, large international group of astronomers Coming Soon to a telescope nearby  Dark Energy Survey Camera built by Fermilab, majority DOE funding Data Management System led by NCSA Groups from Spain, United Kingdom, and Brazil recently joined The next BIG step  LSST Camera built by SLAC, Data Mgmt with NCSA, NSF + DOE funding, also inc. LLNL, Brookhaven, others Stepping UP  Space-based work: JDEM (SNAP and/or others) NASA + DOE funding

Today: ESSENCE (+SuperMACHO) Use a LARGE (~200 SNe), UNIFORM set of supernova light curves to allow us to study the evolution of the expansion of the universe  Constrain “w”, the equation of state parameter of Dark Energy, to ~10% 30 half-nights per year for 5 years ( ) Use other half of nights to constrain possible DARK MATTER candidates  The ‘SuperMACHO’ project  Search the Large Magellanic Cloud for microlensing

Searching for Supernovae (and other transients) High-z SN Team

ESSENCE+SuperMACHO The data flows… The telescope  CTIO’s Blanco 4m The camera  MOSAIC 8Kx8K imager (67 megapixels) Exposures of 60s to 400s Collect 20GB of RAW data per night Data must be reduced and analyzed in near REAL TIME (within ~10min) Data ‘Reduction’ = >5x EXPANSION!  Roughly 3TB per year

… and flows MUCH larger data flow than most other astronomical projects With ADDITIONAL complication of real-time reduction & alert requirement  Must plan spectroscopic follow up on LARGEST telescopes (Gemini, Keck, VLT, Magellan, …) We THOUGHT we were ready  A few CPUs (cluster of 20 x 1GHz)  A few disks (4 x 4TB “data bricks”) But…

Challenges Moving the data  From Chile to the U.S. Storing the data  Filling up racks with “data bricks”  Keeping track of the data  Initial database didn’t cut it Reprocessing the data  Pipeline can keep up with real time flow  But need to reprocess past years of data when improvements are made to software

Coming Soon (2009?): Dark Energy Survey Investigate Dark Energy using 4 complementary and independent methods  Various types of distance measurements, based on standard luminosities, standard yardsticks, and standard volumes Combine the results to provide the best (to date) constraints on the equation of state of Dark Energy

The Instrument: Dark Energy Camera Focal Plane: 64 2k x 4k CCDs Plus guiding and WFS 0.5 GIGApixel camera

The Data: Dark Energy Survey Each image = 1GB 350 GB of raw data / night Data must be moved to supercomputer center (NCSA) before next night begins (<24 hours)  Need >36Mbps internationally Data must be processed within ~24 hours  Need to inform next night’s observing Total raw data after 5 yrs ~0.2 PB TOTAL Dataset 1 to 5 PB  Reprocessing planned using TeraGrid resources

The Large Synoptic Survey Telescope – Massively Parallel Astronomy Survey the entire sky every 4-5 nights, to simultaneously detect and study:  Dark Matter via Weak gravitational lensing  Dark Energy via thousands of SNe per year  Potentially hazardous near earth asteroids  Tracers of the formation of the solar system  Fireworks in the heavens – GRBs, quasars…  Periodic and transient phenomena ...…the unknown

LSST: The Instrument 8.2m telescope  Optimized for WIDE field of view (FOV) 3.5 degree FOV 3.5 GIGApixel camera Deep images in 15-30s Able to scan whole sky every 4-5 nights

LSST: Deep, Wide, Fast Field of view (FOV) Keck Telescope 0.2 degrees 10 m 3.5 degrees LSST

LSST Site: Cerro Pachon in Northern Chile Soar Gemini LSST ~1.5m cal telescope Support LSST site plan El Penon Gemini (South) SOAR

LSST: Distributed Data Management Long-Haul Communications Data transport & distribution Base Facility Real time processing Mountain Site data acquisition, temp. storage Archive/Data Access Centers Data processing, long term storage, & public access

LSST: The Data Flow Each image roughly 6.5GB Cadence: ~1 image every 15s 15 to 18 TB per night  ALL must be transferred to U.S. “data center” Mtn-base within image timescale (15s), ~10-20Gbps Internationally within 2-10Gbps REAL TIME reduction, analysis, & alerts  Send out alerts of transient sources within minutes  Provide automatic data quality evaluation, alert to problems  Processed data grows to >100TB per night! Just catalogs >3 PB per year!

DES, LSST, … and now for the REST of the Science Ongoing (ESSENCE, SuperMACHO, etc.) and future (DES, LSST, etc.) projects will provide PETABYTES of archived data Only a small fraction of the science potential will be realized by the planned investigations How do we maximize the investment in these datasets and provide for their future scientific use?

The Virtual Observatory What is VO?  Provides the framework for global access to the various data archives by facilitating the standardization of archiving and data-mining protocols.  Enables data analysis by providing common standards and state-of-the-art analysis tools which work over high-speed wide area networks What is VO not?  An organization funded to provide a single universal archive of all astronomical data  A provider of resources (storage, computation, bandwidth)

VO: A Global Effort BR-VO

VO Challenges Provide Access to the Content  Multiple distributed archives, some on the scale of many petabytes  Archives provide content, the VO knits those resources together Provide the Standards  Allow variety of archives talk to each other  Develop generalized data model(s) for different instruments/different wavelengths

VO Challenges Provide the User Interfaces  Streamline data discovery, data understanding, data movement, and data analysis Support the Analysis  Support large queries across distributed DBs  Support statistical analysis across results (Grid) All the “boring” bits (infrastructure)  Security, handshaking, resource management

Chris Miller/NOAO

VO Case Study: NOAO Data Products Program Management of data from all NOAO and some affiliated facilities  KPNO, including Mayall 4m (MOSAIC, NEWFIRM)  CTIO, including Blanco 4m (MOSAIC, ISPI)  SOAR & WIYN systems Virtual Observatory “back end”; CONTENT  Provide access to large volume (TBs to PBs) of archived ground-based optical & infrared data and data products Virtual Observatory “front end”; UI and TOOLS  Enable science based upon distributed data and data products, developing tools and services

NVO NOAO Focus on Scientific USER First support data DISCOVERY NOAO Supported NVO Portals:   And for S.America…  Mbps for Chilean use!

Many challenges ahead… Security  enforcing proprietary periods while allowing PIs to combine data VOSpace  virtual (and real) distributed workspaces to collect and analyze data from many sites  shared spaces for collaborations Combining GRID storage and CPU resources with VO queries and analysis

Strategic Partnerships In Local Systems  Vendors: Local Storage, Processing, Servers In Remote Systems  Supercomputer center(s) to provide bulk storage, large scale processing (e.g., NCSA, SDSC’s SRB)  Grid processing, storage Connectivity  High-speed national and international bandwidth Scientific  VO Partners to develop standards, provide tools  Providing services to, and collecting feedback from, physics and astronomy user communities  Providing strong VO node in South America