Dissemination of simulations in the Virtual Observatory Gerard Lemson German Astrophysical Virtual Observatory, Max-Planck Institute for extraterrestrial.

Slides:



Advertisements
Similar presentations
1 Online Science The World-Wide Telescope as a Prototype For the New Computational Science Jim Gray Microsoft Research
Advertisements

Building a Mock Universe Cosmological nbody dark matter simulations + Galaxy surveys (SDSS, UKIDSS, 2dF) Access to mock catalogues through VO Provide analysis.
The Australian Virtual Observatory e-Science Meeting School of Physics, March 2003 David Barnes.
The Virtual Observatory in Germany and abroad A status report with examples/demos from GAVO and other national VOs.
Where will supersymmetric dark matter first be seen? Liang Gao National observatories of China, CAS.
Hierarchical Clustering Leopoldo Infante Pontificia Universidad Católica de Chile Reunión Latinoamericana de Astronomía Córdoba, septiembre 2001.
Simulating the joint evolution of quasars, galaxies and their large-scale distribution Springel et al., 2005 Presented by Eve LoCastro October 1, 2009.
Why Environment Matters more massive halos. However, it is usually assumed in, for example, semianalytic modelling that the merger history of a dark matter.
Non-linear matter power spectrum to 1% accuracy between dynamical dark energy models Matt Francis University of Sydney Geraint Lewis (University of Sydney)
Studying the mass assembly and luminosity gap in fossil groups of galaxies from the Millennium Simulation Ali Dariush, University of Birmingham Studying.
Environmental dependence of halo formation times Geraint Harker.
A Web service for Distributed Covariance Computation on Astronomy Catalogs Presented by Haimonti Dutta CMSC 691D.
/19 LeidenMillennium DB Tutorial Introduction to the Millennium Database with an SQL tutorial.
Data-Intensive Computing in the Science Community Alex Szalay, JHU.
High Redshift Galaxies (Ever Increasing Numbers).
Leicester Database & Archive Service J. D. Law-Green, J. P. Osborne, R. S. Warwick X-Ray & Observational Astronomy Group, University of Leicester What.
Modeling the 3-point correlation function Felipe Marin Department of Astronomy & Astrophysics University of Chicago arXiv: Felipe Marin Department.
Cosmological constraints from models of galaxy clustering Abstract Given a dark matter distribution, the halo occupation distribution (HOD) provides a.
Astro-DISC: Astronomy and cosmology applications of distributed super computing.
GIANT TO DWARF RATIO OF RED-SEQUENCE GALAXY CLUSTERS Abhishesh N Adhikari Mentor-Jim Annis Fermilab IPM / SDSS August 8, 2007.
A long tradition. e-science, Data Centres, and the Virtual Observatory why is e-science important ? what is the structure of the VO ? what then must we.
Modelling radio galaxies in simulations: CMB contaminants and SKA / Meerkat sources by Fidy A. RAMAMONJISOA MSc Project University of the Western Cape.
, TriestEuro-VO DCA WP5 kick off Euro-VO DCA WP4 and GAVO and (a little) AstroGrid-D Gerard Lemson Euro-VO DCA WP5 kick-off meeting Trieste,
Millennium Data Dissemination MPA institute seminar1 possible extensions how ambitious can/should we be?
The Cosmic Simulator Daniel Kasen (UCB & LBNL) Peter Nugent, Rollin Thomas, Julian Borrill & Christina Siegerist.
Toledo, MPA access methods and plans With contributions from JHU : Alex Szalay, Jan Vanderberg MPA: Jeremy Blaizot,
National Center for Supercomputing Applications Observational Astronomy NCSA projects radio astronomy: CARMA & SKA optical astronomy: DES & LSST access:
TIG session 3+Millennium database Millennium Database Overview and some first usage experiences Gerard Lemson and the Virgo Consortium astro-ph/
GAVO – after one year H.-M. Adorf & the GAVO team.
Theory in the German Astrophysical VO Summary: We show results of efforts done within the German Astrophysical Virtual Observatory (GAVO). GAVO has paid.
Functions and Demo of Astrogrid 1.1 China-VO Haijun Tian.
, Tuorla Observatory 1 Galaxy groups in ΛCDM simulations and SDSS DR5 P. Nurmi, P. Heinämäki, S. Niemi, J. Holopainen Tuorla Observatory E. Saar,
Cosmological simulations in a relational database: modelling and storing merger trees Gerard Lemson, GAVO, Max-Planck-Institut für extraterrestrische Physik,
Wide Field Imagers in Space and the Cluster Forbidden Zone Megan Donahue Space Telescope Science Institute Acknowledgements to: Greg Aldering (LBL) and.
4th EGEE user forum / OGF 25 Catania, TheoSSA on AstroGrid-D Iliya Nickelt (AIP / GAVO), Thomas Rauch (IAAT / GAVO), Harry Enke (AIP.
Theory in the Virtual Observatory Gerard Lemson, GAVO.
EÖTVÖS UNIVERSITY BUDAPEST Department of Physics of Complex Systems VO Spectroscopy Workshop, ESAC Spectrum Services 2007 László Dobos (ELTE)
Lecture 5: Matter Dominated Universe: CMB Anisotropies and Large Scale Structure Today, matter is assembled into structures: filaments, clusters, galaxies,
Federation and Fusion of astronomical information Daniel Egret & Françoise Genova, CDS, Strasbourg Standards and tools for the Virtual Observatories.
Wiss. Beirat AIP, ClusterFinder & VO-Methods H. Enke German Astrophysical Virtual Observatory ClusterFinder VO Methods for Astronomical Applications.
Sean Passmoor Supervised by Dr C. Cress Simulating the Radio Sky.
1 Computing Challenges for the Square Kilometre Array Mathai Joseph & Harrick Vin Tata Research Development & Design Centre Pune, India CHEP Mumbai 16.
Millennium Data Dissemination databases and other services Millennium Workshop1.
PHY306 1 Modern cosmology 3: The Growth of Structure Growth of structure in an expanding universe The Jeans length Dark matter Large scale structure simulations.
Making a virtual Universe Adrian Jenkins - ICC, Durham University.
CMU-CS lunch talk, Gerard Lemson1 Computational and statistical problems for the Virtual Observatory With contributions from/thanks to: GAVO.
GES 2007, The German Astrophysical Virtual Observatory (GAVO) Knowledge Networking for Astronomy in Germany and abroad Gerard Lemson 1,2, Wolfgang.
The International Virtual Observatory Alliance (IVOA) interoperability in action.
Modeling the dependence of galaxy clustering on stellar mass and SEDs Lan Wang Collaborators: Guinevere Kauffmann (MPA) Cheng Li (MPA/SHAO, USTC) Gabriella.
Simulations by Ben Moore (Univ. of Zurich)
German Astrophysical Virtual Observatory Overview and Results So Far W. Voges, G. Lemson, H.-M. Adorf.
Gerard Lemson Theory in the VO and the SimDB specification Euro-VO DCA workshop Garching, June 26, 2008 Feedback questionnaire.
Strasbourg, EURO-VO DCA First Board Meeting (GA)VO projects at MPG(E) and participation in VO-DCA (tbcd) Wolfgang Voges, Gerard Lemson.
Theory, Grid and VO Matthias Steinmetz (AIP)
Theory in the VO, Garching, Theory in the Virtual Observatory (TVO) Goals of Euro-VO DCA WP4 Gerard Lemson, GAVO ARI-ZAH, Heidelberg MPE, Garching.
1 Analysing Cosmological Simulations in the Virtual Observatory: Designing and Mining the Millennium Simulation Database Gerard Lemson German Astrophysical.
The Large Synoptic Survey Telescope Project Bob Mann Wide-Field Astronomy Unit University of Edinburgh.
Mining Virtual Universes Simulations in a relational database.
Reproducing the Observed Universe with Simulations Qi Guo Max Planck Institute for Astrophysics MPE April 8th, 2008.
Feasibility of detecting dark energy using bispectrum Yipeng Jing Shanghai Astronomical Observatory Hong Guo and YPJ, in preparation.
KASI Galaxy Evolution Journal Club A Massive Protocluster of Galaxies at a Redshift of z ~ P. L. Capak et al. 2011, Nature, in press (arXive: )
Astronomy toolkits and data structures Andrew Jenkins Durham University.
JENAM 2008 Theory Standards for the Virtual Observatory SimDB + SimDAP.
A self consistent model of galaxy formation across cosmic time Bruno Henriques Simon White, Peter Thomas Raul Angulo, Qi Guo, Gerard Lemson, Volker Springel.
The mock galaxy catalogue for HI survey based on SAMs of galaxy formation 富坚 Guiyang, FRA2015 Shanghai Astronomical Observatory.
Virtual Observatory for cosmological simulations
Moving towards the Virtual Observatory Paolo Padovani, ST-ECF/ESO
Cross-matching the sky with database server cluster
Google Sky.
Brightest ~500,000 Galaxies in the Northern Hemisphere (1977; RA & DEC only) 2-D “lacework” pattern.
Presentation transcript:

Dissemination of simulations in the Virtual Observatory Gerard Lemson German Astrophysical Virtual Observatory, Max-Planck Institute for extraterrestrial physics

Overview The Virtual Observatory Theory/simulations in the VObs Case study: Millennium database –Storing trees in a relational database Virtual telescope prototypes Outlook

Virtual Observatory I Broad goal –Make results of astronomical research, data and applications, more readily available to larger community, and create value-adding services. (Alex’s talk yesterday) Facilitate results- –communication –checking –(re)use –comparison –combination

Combination: a multi-wavelength view of a galaxy merger John Hibbard Radio NASA/CXC/SAO/G. Fabbiano et al. X-Ray Optical

SDSS ROSAT 2MASS FIRST GAIA the problem

ROSAT FIRST GAIA SDSS2MASS work on a solution

Virtual Observatory II Approach: –online availability of datasets and applications –standardized publication and discovery mechanisms –standardized description through common (meta-)data models –standardized selection mechanisms –standardized formats for transmitted data –value added services –introduce new technologies –find clever algorithms Organized in International VO Alliance (IVOA)

Observations in the VO Most VO efforts concentrate on observational data sets –simple observables: photons detected at a certain time from a certain area on the sky –long history of archiving –pre-existing standards (FITS) –valuable over long time (digitising 80 yr old plates) Standards observationally biased –common sky: cone search, SIAP, region –common objects: XMatch –data models: characterisation of sky/time/energy(/no polarisation yet)

Theory in the VO: issues Simulations not so simple –complex observables –no standardisation (not even HDF5) –archiving ad hoc, for local use Moore’s law makes useful lifetime relatively short: few years later can do better Current IVOA standards somewhat irrelevant –no common sky –no common objects –requires data models for content, physics, code

“Moore’s law” for N-body simulations Courtesy Simon White

History of simulations Toomre & Toomre, 1972 Di Matteo, Springel and Hernquist, 2005 Courtesy Volker Springel

So why bother publishing simulations? Simulations are interesting: –For many cases only way to see processes in action –Complex observations require sophisticated models for interpretation Bridging gap in specialisations: not everyone has required expertise to create simulations, though they can analyse them. Many use cases do not require the latest/greatest –exposure time calculator –survey design

A possible formation scenario Courtesy Volker Springel

Detailed observations electron density gas pressure gas temperature Courtesy Alexis Finoguenov, Ulrich Briel, Peter Schuecker, (MPE)

Detailed predictions Courtesy Volker Springel

Case study: Simulations in a relational database Goal: investigate the use of RDB and web services for disseminating results of cosmological N-body simulations. Why database ? –encapsulation of data in terms of logical structure, no need to know about internals of data storage –standard query language for finding information –advanced query optimizers –forces one to think carefully about data structure –speeds up path from science question to answer –facilitates communication –new ways of thinking about results –links to other efforts (Sloan SkyServer)

The Virgo consortium’s Millennium simulation Millennium simulation –10 billion particles, dark matter only –500 Mpc (~2Gly) periodic box –“concordance model” (as of 2004) initial conditions –64 snapshots – CPU hours –O(30Tb) raw + post-processed data play Postprocessing: –dark matter density fields smoothed at various scales (45 * grid cells) –dark matter cluster merger trees (~750 million) –galaxy merger trees (~1 billion/catalogue) DeLucia & Balizot, 2006 Bower et al, 2006

Evolution

Dark matter and galaxies

Halos and galaxies

the Millennium database + web server Post-processing results only SQLServer database –MPA: 2000, soon –Durham: 2005 Web application (Java in Apache tomcat web server) –portal: –public DB access: –private access: –MyDB Access methods –browser with plotting capabilities through VOPlot applet –wget + IDL, R –TOPCAT plugin

Database design: “20 queries” 1.Return the galaxies residing in halos of mass between 10^13 and 10^14 solar masses. 2.Return the galaxy content at z=3 of the progenitors of a halo identified at z=0 3.Return the complete halo merger tree for a halo identified at z=0 4.Find properties of all galaxies in haloes of mass 10**14 at redshift 1 which have had a major merger (mass-ratio < 4:1) since redshift Find all the z=3 progenitors of z=0 red ellipticals (i.e. B-V>0.8 B/T > 0.5) 6.Find the descendents at z=1 of all LBG's (i.e. galaxies with SFR>10 Msun/yr) at z=3 7.Find all z=3 galaxies which have NO z=0 descendent. 8.Return all the galaxies within a sphere of radius 3Mpc around a particular halo 9.Find all the z=2 galaxies which were within 1Mpc of a LBG (i.e. SFR>10Msun/yr) at some previous redshift. 10.Find the multiplicity function of halos depending on their environment (overdensity of density field smoothed on certain scale) 11.Find the dependency of halo formation times on environment

Time evolution: merger trees

Efficient storage of trees in a relational database Goal: allow queries for the formation history of any object No recursion possible, or desired Method: –depth first ordering of trees –label by rank in order –pointer to “last progenitor” below each node –all progenitors have label BETWEEN label of root AND that of last progenitor –cluster table on label

Merger trees : select prog. from galaxies des, galaxies prog where des.galaxyId = 0 and prog.galaxyId between des.galaxyId and des.lastProgenitorId Leaves : select galaxyId as leaf from galaxies des where galaxyId = lastProgenitorId Branching points : select descendantId from galaxies des where descendantId != -1 group by descendantId having count(*) > 1

Main branches Roots and leaves: select des.galaxyId as rootId, min(prog.lastprogenitorid) as leafId into rootLeaf from galaxies des, galaxies prog where des.galaxyId = 0 and prog.galaxyId between des.galaxyId and des.lastProgenitorId Main branch select rl.rootId, b.* from rootLeaf rl, galaxies b where prog.galaxyId between rl.rootId and rl.leafId

More database design features Spatial indices –Peano-Hilbert index links to field (256^3) –Z-curve index (bit interleaved, 256^3) SQLServer2005 CLR integration with C# for range queries –Zone index (ix/iy/iz, 50^3) select * from galaxies where snapnum = 63 and ix = 1 and iy = 5 and iz = 20 Random sampling select * from galaxies where snapnum = 63 and random between 1000 and 2000

Under construction Batch processing through CAS jobs Mock catalogues –pre-calculated in database –online MoMaFMoMaF Utilise PCA for storing photometric predictions Tree comparisons: statistics of branch lengths, node counts; tree edit distance.

Virtual telescopes Virtual observations of virtual universe Produce data products that are as similar to observational results as possible: –images –spectra –catalogues Include atmosphere and telescope effects –predict –analyse: easier to add problems than to remove them

Prototype examples No realistic telescope yet Planck simulator – Mock catalogs through Millennium – Hydro simulations of galaxy clusters –

Mock Map Making Facility Blaizot, J. et al Mon.Not.Roy.Astron.Soc. 360 (2005)

Conclusions and outlook Simulation data valuable addition to VObs Especially with interfaces similar to observational ones IVOA theory interest group standards under development: SNAP, Semantics, Simulation data model Virtual telescopes provide perfect use case for testing VObs ideas: –requires very different specialisations –not co-located: needs distributed treatment –requires standards for data structure and service APIs, as well as models linking observations and theory –high performance computational infrastructure for scientifically meaningful results Distributed virtual telescope configuration

Acknowledgments Virgo consortium, in particular: –Volker Springel, Simon White, Gabriella DeLucia, Jeremy Blaizot (MPA, Munich, Germany), –Carlos Frenk, Richard Bower, John Helly (ICC, Durham, UK) Alex Szalay, Jan van den Berg (JHU) GAVO is funded by the German Federal Ministry for Education and Research

Relevant references and links Springel, V., et al (2005), Simulations of the formation, evolution and clustering of galaxies and quasars, Nature, 435, 629 Lemson, G. and the Virgo Consortium (2006), Halo and Galaxy Formation Histories from the Millennium Simulation: Public release of a VO-oriented and SQL-queryable database for studying the evolution of galaxies in the LCDM cosmogony, Lemson, G. & Springel, V. (2005), Cosmological Simulations in a Relational Database: Modelling and Storing Merger Trees, ASPC, 351, Astronomical Data Analysis Software and Systems XV De Lucia, G. & Blaizot, J. (2006) The hierarchical formation of the brightest cluster galaxies, Bower, R. et al (2006), The brokern hierarchy of galaxy formation, Mon.Not.Roy.Astron.Soc and vo.org/Millennium vo.org/Millennium