Future of Astronomy: enormous datasets, massive computing, innovative instrumentation Rachel Webster & David Barnes (Project Leader & Project Scientist,

Slides:



Advertisements
Similar presentations
Microsoft Research Microsoft Research Jim Gray Distinguished Engineer Microsoft Research San Francisco SKYSERVER.
Advertisements

Trying to Use Databases for Science Jim Gray Microsoft Research
World Wide Telescope mining the Sky using Web Services Information At Your Fingertips for astronomers Jim Gray Microsoft Research Alex Szalay Johns Hopkins.
1 Online Science The World-Wide Telescope as a Prototype For the New Computational Science Jim Gray Microsoft Research
Online Science The World-Wide Telescope as a Prototype For the New Computational Science Jim Gray Microsoft Research
International Grid Communities Dr. Carl Kesselman Information Sciences Institute University of Southern California.

1 Science and the NVO – Overview and Discussion Dave De Young NVO Project Scientist NOAO NVOSS Aspen September 2006.
John Cunniffe Dunsink Observatory Dublin Institute for Advanced Studies Evert Meurs (Dunsink Observatory) Aaron Golden (NUI Galway) Aus VO 18/11/03 Efficient.
Australian e-Astronomy US-Australia Workshop on High-Performance Grids and Applications 8-9 June 2004 David Barnes Research Fellow, School of Physics The.
Australian Virtual Observatory A distributed volume rendering grid service Gridbus 2003 June 7 Melbourne University David Barnes School of Physics, The.
What does LOFAR have to do with the Virtual Observatory (VO)? LOFAR Science Day 16 December 2003 Melbourne David Barnes The University of Melbourne.
The Australian Virtual Observatory (a.k.a. eAstronomy Australia) Ray Norris CSIRO ATNF.
The Australian Virtual Observatory e-Science Meeting School of Physics, March 2003 David Barnes.
Australian Virtual Observatory Pacific Rim Applications and Grid Middleware Assembly The 4th Workshop 5th-6th June 2003 Monash University David Barnes.
Australian Virtual Observatory International Astronomical Union GA 2003 Joint Discussion 08 17th-18th July 2003 Sydney David Barnes The University of Melbourne.
The Australian Virtual Observatory Clusters and Grids David Barnes Astrophysics Group.
SKA New Technology Demonstrator Ray Norris CSIRO Australia Telescope National Facility.
Brian Schmidt The Research School of Astronomy and Astrophysics Mount Stromlo & Siding Spring Observatories.
20 Nov, 2002Virtual Molonglo Observatory1 “The VO in Australia” Melbourne Nov. 28/  What is the AVO?  How did it develop - Grid computing – particle.
CASDA Virtual Observatory CSIRO ASTRONOMY AND SPACE SCIENCE Arkadi Kosmynin 11 March 2014.
ESO-ESA Existing Activities Archives, Virtual Observatories and the Grid.
Development of China-VO ZHAO Yongheng NAOC, Beijing Nov
European Southern Observatory
Arecibo 40th Anniversary Workshop--R. L. Brown The Arecibo Astrometric/Timing Array Robert L. Brown.
Jeroen Stil Department of Physics & Astronomy University of Calgary Stacking of Radio Surveys.
NAIC-NRAO School on Single-Dish Radio Astronomy. Arecibo, July 2005
SKAMP - the Molonglo SKA Demonstrator M.J. Kesteven CSIRO ATNF, T. J. Adams, D. Campbell-Wilson, A.J. Green E.M. Sadler University of Sydney, J.D. Bunton,
Data-Intensive Science: Addressing common needs with shared tools Christopher Stubbs Professor Department of Physics Department of Astronomy
ASKAP and DVP David DeBoer ASKAP Project Director 15 April 2010 Arlington, VA.
MWA Project:. Site: Murchison Radio Observatory Australia’s proposed SKA Site Strategy: 512 Antenna “Tiles” Explore “Large N / Small D” regime Correlate.
Planning for the Virtual Observatory Tara Murphy … with input from other Aus-VO members …
Leicester Database & Archive Service J. D. Law-Green, J. P. Osborne, R. S. Warwick X-Ray & Observational Astronomy Group, University of Leicester What.
Astro-DISC: Astronomy and cosmology applications of distributed super computing.
Astronomical GRID Applications at ESAC Science Archives and Computer Engineering Unit Science Operations Department ESA/ESAC.
Aus-VO: Progress in the Australian Virtual Observatory Tara Murphy Australia Telescope National Facility.
THE CAASTRO TEAM IS PURSUING THREE INTERLINKED SCIENCE PROGRAMS: THE EVOLVING UNIVERSE When did the first galaxies form, and how have they then evolved?
N Tropy: A Framework for Analyzing Massive Astrophysical Datasets Harnessing the Power of Parallel Grid Resources for Astrophysical Data Analysis Jeffrey.
The NASA/NExScI/IPAC Star and Exoplanet Database 14 May 2009 David R. Ciardi on behalf of the NStED Team.
The Cosmic Simulator Daniel Kasen (UCB & LBNL) Peter Nugent, Rollin Thomas, Julian Borrill & Christina Siegerist.
National Center for Supercomputing Applications Observational Astronomy NCSA projects radio astronomy: CARMA & SKA optical astronomy: DES & LSST access:
Alex Szalay, Jim Gray Analyzing Large Data Sets in Astronomy.
Science with the Virtual Observatory Brian R. Kent NRAO.
The Murchison Wide Field Array Murchison, ~300 km from Geraldton.
Astronomy: from Networks to the Grid Leopoldo Benacchio INAF (National Institute for Astrophysics) Padova Astronomical Observatory Italy.
Paul Alexander & Jaap BregmanProcessing challenge SKADS Wide-field workshop SKA Data Flow and Processing – a key SKA design driver Paul Alexander and Jaap.
THE MURCHISON WIDEFIELD ARRAY: FROM COMMISSIONING TO OBSERVING D. Oberoi 1,2, I. H. Cairns 3, L. D. Matthews 2 and L. Benkevitch 2 on behalf of the MWA.
Research Networks and Astronomy Richard Schilizzi Joint Institute for VLBI in Europe
EScience May 2007 From Photons to Petabytes: Astronomy in the Era of Large Scale Surveys and Virtual Observatories R. Chris Smith NOAO/CTIO, LSST.
1 Computing Challenges for the Square Kilometre Array Mathai Joseph & Harrick Vin Tata Research Development & Design Centre Pune, India CHEP Mumbai 16.
THEORETICAL ASTROPHYSICS AND THE US-NVO INITIATIVE D. S. De Young National Optical Astronomy Observatory.
S.A. Torchinsky SKADS Workshop 10 October 2007 Simulations: The Loop from Science to Engineering and back S.A. Torchinsky SKADS Project Scientist.
Murchison Widefield Array (MWA) : Design and Status Divya Oberoi, Lenoid Benkevitch MIT Haystack Observatory doberoi, On behalf.
Data Archives: Migration and Maintenance Douglas J. Mink Telescope Data Center Smithsonian Astrophysical Observatory NSF
The Mission  Explore the Dark Ages through the neutral hydrogen distribution  Constrain the populations of the first stars and first black holes.  Measure.
Australian SKA Pathfinder (ASKAP) David R DeBoer ATNF Assistant Director ASKAP Theme Leader 06 November 2007.
Sharing scientific data: astronomy as a case study for a change in paradigm Présenté par Françoise Genova.
ALMA and the Call for Early Science The Atacama Large (Sub)Millimeter Array (ALMA) is now under construction on the Chajnantor plain of the Chilean Andes.
Contact: Junwei Cao SC2005, Seattle, WA, November 12-18, 2005 The authors gratefully acknowledge the support of the United States National.
Observed and Simulated Foregrounds for Reionization Studies with the Murchison Widefield Array Nithyanandan Thyagarajan, Daniel Jacobs, Judd Bowman + MWA.
Introduction to the VO ESAVO ESA/ESAC – Madrid, Spain.
E. Solano. GAIA Meeting, Menorca, Oct 2009 GAIA and the Virtual Observatory Enrique Solano, LAEX/CAB (INTA-CSIC) Spanish VO Principal Investigator.
RI EGI-InSPIRE RI Astronomy and Astrophysics Dr. Giuliano Taffoni Dr. Claudio Vuerli.
The CSIRO ASKAP Science Data Archive Progress and Plans
EGEE NA4 Lofar Lofar Information System Design OmegaCEN
Moving towards the Virtual Observatory Paolo Padovani, ST-ECF/ESO
For more information, visit
For more information, visit
Google Sky.
Presentation transcript:

Future of Astronomy: enormous datasets, massive computing, innovative instrumentation Rachel Webster & David Barnes (Project Leader & Project Scientist, Australian Virtual Observatory) School of Physics, The University of Melbourne

Topics 1.Astronomy is a theoretical and observational science with massive heterogeneous datasets. 2.The Virtual Observatory (VO) 3.Aus-VO: the Australian project 4.A new local opportunity: MWA low frequency array Read here for a summary of each slide…..

2. What is a Virtual Observatory? A Virtual Observatory (VO) is a distributed, uniform interface to the data archives of the world’s major astronomical facilities.A Virtual Observatory (VO) is a distributed, uniform interface to the data archives of the world’s major astronomical facilities. A VO is realised with advanced data mining and visualisation tools which exploit the unified interface to enable cross-correlation and combined processing of distributed and diverse datasets.A VO is realised with advanced data mining and visualisation tools which exploit the unified interface to enable cross-correlation and combined processing of distributed and diverse datasets. VOs will rely on, and provide motivation for, the development of national and international computational and data grids.VOs will rely on, and provide motivation for, the development of national and international computational and data grids. Virtual observatories will effect a “sea change” in the way astronomy is done.

International Data deluge! Dozens of new surveys 2003 to 2008Dozens of new surveys 2003 to 2008 Many (10 – 100) terabytes per surveyMany (10 – 100) terabytes per survey 10 – 100 researchers per survey10 – 100 researchers per survey International collaborations (almost always)International collaborations (almost always) Data is non-proprietary (usually)Data is non-proprietary (usually) Surveys are no longer within the scope of the solo researcher, and also cannot be accommodated by isolated computing and storage facilities. Enter Grid Computing and the Virtual Observatory New surveys of the whole sky need a new paradigm: enter the Virtual Observatory.

3. Aus-VO and APAC Grid Project 10 institutions; 4 large grants over 2-4 years (LIEF & APAC)10 institutions; 4 large grants over 2-4 years (LIEF & APAC) VO Data Warehousing (10 major datasets)VO Data Warehousing (10 major datasets) Gravity Wave Research GridGravity Wave Research Grid VO Theory PortalVO Theory Portal Registry, storage service, hpc, query languages, visualisation, data miningRegistry, storage service, hpc, query languages, visualisation, data mining Melbourne-led (at present)Melbourne-led (at present) The Australian Astronomy Grid will be developed to handle data storage and access needs.

IVOA and the International Context More than 15 active national VO programs;More than 15 active national VO programs; Multi-million $$ investments in UK, USA and EuropeMulti-million $$ investments in UK, USA and Europe Loose but collegial collaborationLoose but collegial collaboration Responsible for international standardsResponsible for international standards Active meeting program (we have no funds to participate)Active meeting program (we have no funds to participate) The Australian Astronomy Grid will be developed to handle data storage and access needs.

Australian data storage and access Australian astronomy data holdings presently exceed ~40 TB in size, and are growing rapidly.Australian astronomy data holdings presently exceed ~40 TB in size, and are growing rapidly. Typical high-end workstations can store only ~100 GB or so.Typical high-end workstations can store only ~100 GB or so. Providing access to the data – raw and processed – requires a distributed, high- bandwidth network of data servers.Providing access to the data – raw and processed – requires a distributed, high- bandwidth network of data servers. The Australian Virtual Observatory project is developing the Australian Astronomy Grid to handle future demand.The Australian Virtual Observatory project is developing the Australian Astronomy Grid to handle future demand. The Australian Astronomy Grid will be developed to handle data storage and access needs.

The Australian Astronomy Grid 2004

The HI Parkes All Sky Survey Parkes 64m radio telescope in NSW.Parkes 64m radio telescope in NSW. Hyperfine transition of atomic Hydrogen, =21cm.Hyperfine transition of atomic Hydrogen, =21cm. 280 days over 4 years; 40 observers; 1000GB raw data.280 days over 4 years; 40 observers; 1000GB raw data. 400 image “cubes” searched by computer for significant signals.400 image “cubes” searched by computer for significant signals. The Parkes telescope has surveyed the entire southern sky for emission from Hydrogen.

The Two Micron All Sky Survey 4M images4M images 470M point sources470M point sources 1.6M extended sources1.6M extended sources ~500 parameters per source!~500 parameters per source! 25 TB of data!25 TB of data! All-sky map of 1.6 million 2MASS extended sources. Another example is 2MASS which has catalogued nearly half a billion objects in the sky. Jarrett et al., 2000 Less than 10% of the catalogue fits in memory on a typical workstation

Other major surveys... Sloan Digital Sky Survey (SDSS)Sloan Digital Sky Survey (SDSS) –position and brightness of 100M objects –distance to more than 100K quasars –15 Terabytes of data! Radial Velocity Experiment (RAVE)Radial Velocity Experiment (RAVE) –50M stars: velocities, metallicities, and abundance ratios –10 TB of data! Faint Images of the Radio Sky (FIRST)Faint Images of the Radio Sky (FIRST) –811,000 sources with radio continuum flux densities at 20cm wavelength Dozens of major, terabyte-scale survey projects are underway or planned.

Theoretical Astronomy Theory provides models of the phenomena discovered by observations.Theory provides models of the phenomena discovered by observations. Theory makes predictions of what will be seen by future facilities.Theory makes predictions of what will be seen by future facilities. Many theories are non- analytic, and sophisticated numerical simulations are run on supercomputers to produce realisations of synthetic universes.Many theories are non- analytic, and sophisticated numerical simulations are run on supercomputers to produce realisations of synthetic universes. Simulations can produce realisations of synthetic universes from fundamental physics.

Linking theory to observations Simulations are not expected to produce our particular Universe.Simulations are not expected to produce our particular Universe. Instead, they generate systems which can be compared statistically to our Universe.Instead, they generate systems which can be compared statistically to our Universe. Realisations of a good model should be statistically indistinguishable from the observed Universe.Realisations of a good model should be statistically indistinguishable from the observed Universe. Useful statistical comparisons demand high quality data and large numbers of objects independent of how you bin the data.Useful statistical comparisons demand high quality data and large numbers of objects independent of how you bin the data. Deeper, faster and more sophisticated surveys are called for...Deeper, faster and more sophisticated surveys are called for... Bigger and better simulations demand super surveys for statistical comprehension.

4. MWA: Mileura Widefield Array Low frequency radio domain ( MHz) largely unexploredLow frequency radio domain ( MHz) largely unexplored Not easy: ionosphere, FM band, etcNot easy: ionosphere, FM band, etc BUT: aim to detect the first sourcesBUT: aim to detect the first sources and map Epoch of Reionisation One of 3 international experimentsOne of 3 international experiments (strongest project ) (strongest project ) New US/Australian low frequency array

Remote WA, for first light in 2007Remote WA, for first light in TB fibre link to Geraldton6TB fibre link to Geraldton Storage: 100’s TBsStorage: 100’s TBs CPU: 50TflopsCPU: 50Tflops Melbourne, in collaboration with MIT, ATNF, Harvard and othersMelbourne, in collaboration with MIT, ATNF, Harvard and others Industry partnersIndustry partners New low frequency array will use innovative data-handling algorithms

MWA: Basic Approach ‘Desert Australia’ is probably the best site in the world for low frequency astronomy

MWA: Mileura Widefield Array Construction in Melbourne Haystack Observatory Mitcham Multi-skilled theorist

MWA: attennaes New design for antennas

MWA: Early Deployment 3 tiles to be commissioned by Christmas tiles to be commissioned by Christmas SKA site to be selectedSKA site to be selected NSF grant submitted and awarded!NSF grant submitted and awarded! Receiver and correlator design to be determinedReceiver and correlator design to be determined Negotiations with industry partnersNegotiations with industry partners Collaboration and timescales ‘solidifying’ this week

MWA: Signal Processing 500 tiles (x16 dipoles)500 tiles (x16 dipoles) 125,000 baselines, 4 polarization products125,000 baselines, 4 polarization products FPGA based hardwareFPGA based hardware Receiver: analog and mixed-signal front end; digital back endReceiver: analog and mixed-signal front end; digital back end Data stream: ~2 billion visibilities/0.5 sec Data stream: ~2 billion visibilities/0.5 sec Technical requirements and directions

Melbourne Astrophysics Requirements High Bandwidth Communications (Access Grid): scientific collaboration & conferencesHigh Bandwidth Communications (Access Grid): scientific collaboration & conferences Functional Grid: storage & processingFunctional Grid: storage & processing Institutional Commitment: planning, resourcing, r&d,Institutional Commitment: planning, resourcing, r&d, Institutional Leadership: NEWInstitutional Leadership: NEW