2007-05-15TIG session 3+Millennium database Millennium Database Overview and some first usage experiences Gerard Lemson and the Virgo Consortium astro-ph/0608019.

Slides:



Advertisements
Similar presentations
Building a Mock Universe Cosmological nbody dark matter simulations + Galaxy surveys (SDSS, UKIDSS, 2dF) Access to mock catalogues through VO Provide analysis.
Advertisements

GALAXIES IN DIFFERENT ENVIRONMENTS: VOIDS TO CLUSTERS:  Simulations will require to model full physics:  Cooling, heating, star formation feedbacks…
Where will supersymmetric dark matter first be seen? Liang Gao National observatories of China, CAS.
The 1 st galaxies and the cosmic web: the clustering of galaxy hosts from dark matter simulations Darren Reed Los Alamos National Laboratory arxiv:
Simulating the joint evolution of quasars, galaxies and their large-scale distribution Springel et al., 2005 Presented by Eve LoCastro October 1, 2009.
Astro-2: History of the Universe Lecture 4; April
Why Environment Matters more massive halos. However, it is usually assumed in, for example, semianalytic modelling that the merger history of a dark matter.
Studying the mass assembly and luminosity gap in fossil groups of galaxies from the Millennium Simulation Ali Dariush, University of Birmingham Studying.
In The Beginning  N-body simulations (~100s particles) – to study Cluster formation  Cold collapse produces too steep a density profile (Peebles 1970)
Environmental dependence of halo formation times Geraint Harker.
/19 LeidenMillennium DB Tutorial Introduction to the Millennium Database with an SQL tutorial.
Simon Portegies Zwart (Univ. Amsterdam with 2 GRAPE-6 boards)
Hito-Shura Milestone 4 presentation Harlan Broughton Stephen Link.
Multiple Tiers in Action
Modeling the 3-point correlation function Felipe Marin Department of Astronomy & Astrophysics University of Chicago arXiv: Felipe Marin Department.
Cosmological constraints from models of galaxy clustering Abstract Given a dark matter distribution, the halo occupation distribution (HOD) provides a.
Russ Houberg Senior Technical Architect, MCM KnowledgeLake, Inc.
Easy HTML DB. Michael Cunningham Developer/Database Administrator.
Masami Ouchi (Space Telescope Science Institute) for the SXDS Collaboration Cosmic Web Made of 515 Galaxies at z=5.7 Kona 2005 Ouchi et al ApJ, 620,
Session Objectives Object Types – Query, HTML Table Purpose of the Query and Explanation How to add a Query to a PTF Test Case 2 Session 5 - Query.
Modelling radio galaxies in simulations: CMB contaminants and SKA / Meerkat sources by Fidy A. RAMAMONJISOA MSc Project University of the Western Cape.
Millennium Data Dissemination MPA institute seminar1 possible extensions how ambitious can/should we be?
Toledo, MPA access methods and plans With contributions from JHU : Alex Szalay, Jan Vanderberg MPA: Jeremy Blaizot,
CMSPro Omniversal Apps, Inc.. Application overview CMSPro is an extremely powerful, yet simple, metadata exploration and analysis tool for Business Objects.
Environmental Properties of a Sample of Starburst Galaxies Selected from the 2dFGRS Matt Owers (UNSW) Warrick Couch (UNSW) Chris Blake (UBC) Michael Pracy.
Theory in the German Astrophysical VO Summary: We show results of efforts done within the German Astrophysical Virtual Observatory (GAVO). GAVO has paid.
Stephen Booth EPCC Stephen Booth GridSafe Overview.
, Tuorla Observatory 1 Galaxy groups in ΛCDM simulations and SDSS DR5 P. Nurmi, P. Heinämäki, S. Niemi, J. Holopainen Tuorla Observatory E. Saar,
Cosmological simulations in a relational database: modelling and storing merger trees Gerard Lemson, GAVO, Max-Planck-Institut für extraterrestrische Physik,
Benedetta Ciardi MPA Reionization Nucleosynthesis ‘Dark Ages’ Big Bang Fluctuations begin to condense into first stars and protogalaxies Decoupling matter-radiation.
Dissemination of simulations in the Virtual Observatory Gerard Lemson German Astrophysical Virtual Observatory, Max-Planck Institute for extraterrestrial.
Module 5 Planning for SQL Server® 2008 R2 Indexing.
EÖTVÖS UNIVERSITY BUDAPEST Department of Physics of Complex Systems VO Spectroscopy Workshop, ESAC Spectrum Services 2007 László Dobos (ELTE)
Bookkeeping Tutorial. Bookkeeping & Monitoring Tutorial2 Bookkeeping content  Contains records of all “jobs” and all “files” that are created by production.
Evolution of the Halo Mass Function Zarija Lukić (UIUC) Katrin Heitmann (LANL) Salman Habib (LANL) Sergei Bashinsky (LANL) Paul Ricker (UIUC) astro-ph/
Sean Passmoor Supervised by Dr C. Cress Simulating the Radio Sky.
The coordinated growth of stars, haloes and large-scale structure since z=1 Michael Balogh Department of Physics and Astronomy University of Waterloo.
Millennium Data Dissemination databases and other services Millennium Workshop1.
Making a virtual Universe Adrian Jenkins - ICC, Durham University.
GES 2007, The German Astrophysical Virtual Observatory (GAVO) Knowledge Networking for Astronomy in Germany and abroad Gerard Lemson 1,2, Wolfgang.
Modeling the dependence of galaxy clustering on stellar mass and SEDs Lan Wang Collaborators: Guinevere Kauffmann (MPA) Cheng Li (MPA/SHAO, USTC) Gabriella.
Analysis methods for Milky Way dark matter halo detection Aaron Sander 1, Larry Wai 2, Brian Winer 1, Richard Hughes 1, and Igor Moskalenko 2 1 Department.
Introduction to Morpho RCN Workshop Samantha Romanello Long Term Ecological Research University of New Mexico.
Simulations by Ben Moore (Univ. of Zurich)
Theory, Grid and VO Matthias Steinmetz (AIP)
1 Analysing Cosmological Simulations in the Virtual Observatory: Designing and Mining the Millennium Simulation Database Gerard Lemson German Astrophysical.
Adrian Jackson, Stephen Booth EPCC Resource Usage Monitoring and Accounting.
Bookkeeping Tutorial. 2 Bookkeeping content  Contains records of all “jobs” and all “files” that are produced by production jobs  Job:  In fact technically.
Mining Virtual Universes Simulations in a relational database.
Reproducing the Observed Universe with Simulations Qi Guo Max Planck Institute for Astrophysics MPE April 8th, 2008.
联 合 天 体 物 理 中 心 Joint Center for Astrophysics The half-light radius distribution of LBGs and their stellar mass function Chenggang Shu Joint Center for.
Steven Perry Dave Vieglais. W a s a b i Web Applications for the Semantic Architecture of Biodiversity Informatics Overview WASABI is a framework for.
Present-Day Descendants of z=3.1 Ly  Emitting (LAE) Galaxies in the Millennium-II Halo Merger Trees Jean P. Walker Soler – Rutgers University Eric Gawiser.
We created a set of volume limited samples taken from the 2dFGRS (Colless 2001) that contains about 250,000 galaxies with accurate redshifts, is relatively.
KASI Galaxy Evolution Journal Club A Massive Protocluster of Galaxies at a Redshift of z ~ P. L. Capak et al. 2011, Nature, in press (arXive: )
Study of Proto-clusters by Cosmological Simulation Tamon SUWA, Asao HABE (Hokkaido Univ.) Kohji YOSHIKAWA (Tokyo Univ.)
Apache Solr Dima Ionut Daniel. Contents What is Apache Solr? Architecture Features Core Solr Concepts Configuration Conclusions Bibliography.
MESA A Simple Microarray Data Management Server. General MESA is a prototype web-based database solution for the massive amounts of initial data generated.
The Mass-Dependent Role of Galaxy Mergers Kevin Bundy (UC Berkeley) Hubble Symposium March, 2009 Masataka Fukugita, Richard Ellis, Tom Targett Sirio Belli,
A self consistent model of galaxy formation across cosmic time Bruno Henriques Simon White, Peter Thomas Raul Angulo, Qi Guo, Gerard Lemson, Volker Springel.
Using iRODS with the EnginFrame Grid Portal into the GRIDA3 project Francesco Locunto Marco Piras Matteo Vocale.
Light-cone data format and ray-tracing tools
The mock galaxy catalogue for HI survey based on SAMs of galaxy formation 富坚 Guiyang, FRA2015 Shanghai Astronomical Observatory.
Tamas Szalay, Volker Springel, Gerard Lemson
Cross-matching the sky with database server cluster
Clustering properties and environment of AGN
Modeling the dependence of galaxy clustering on stellar mass and SEDs
Core of Coma Cluster (optical)
Brightest ~500,000 Galaxies in the Northern Hemisphere (1977; RA & DEC only) 2-D “lacework” pattern.
Presentation transcript:

TIG session 3+Millennium database Millennium Database Overview and some first usage experiences Gerard Lemson and the Virgo Consortium astro-ph/

TIG session 3+Millennium database The Virgo consortium’s Millennium simulation Millennium simulation –10 billion particles, dark matter only –500 Mpc (~2Gly) periodic box –“concordance model” (as of 2004) initial conditions –64 snapshots – CPU hours –O(30Tb) raw + post-processed data Postprocessing: –dark matter density fields smoothed at various scales (45 * grid cells) –dark matter cluster merger trees (~750 million) –galaxy merger trees (~1 billion/catalogue) DeLucia & Blaizot, 2006 Bower et al, 2006

TIG session 3+Millennium database Dark matter and galaxies

TIG session 3+Millennium database Halos and galaxies

TIG session 3+Millennium database Database design

TIG session 3+Millennium database Database design: “20 queries” 1.Return the galaxies residing in halos of mass between 10^13 and 10^14 solar masses. 2.Return the galaxy content at z=3 of the progenitors of a halo identified at z=0 3.Return the complete halo merger tree for a halo identified at z=0 4.Find properties of all galaxies in haloes of mass 10**14 at redshift 1 which have had a major merger (mass-ratio < 4:1) since redshift Find all the z=3 progenitors of z=0 red ellipticals (i.e. B-V>0.8 B/T > 0.5) 6.Find the descendents at z=1 of all LBG's (i.e. galaxies with SFR>10 Msun/yr) at z=3 7.Find all z=3 galaxies which have NO z=0 descendent. 8.Return all the galaxies within a sphere of radius 3Mpc around a particular halo 9.Find all the z=2 galaxies which were within 1Mpc of a LBG (i.e. SFR>10Msun/yr) at some previous redshift. 10.Find the multiplicity function of halos depending on their environment (overdensity of density field smoothed on certain scale) 11.Find the dependency of halo formation times on environment

TIG session 3+Millennium database Time evolution: merger trees

TIG session 3+Millennium database Merger trees : select prog. from galaxies des, galaxies prog where des.galaxyId = 0 and prog.galaxyId between des.galaxyId and des.lastProgenitorId Leaves : select galaxyId as leaf from galaxies des where galaxyId = lastProgenitorId Branching points : select descendantId from galaxies des where descendantId != -1 group by descendantId having count(*) > 1

TIG session 3+Millennium database More database design features Spatial indices –Peano-Hilbert index links to field (256^3) –Z-curve index (bit interleaved, 256^3) SQLServer2005 CLR integration with C# for range queries –Zone index (ix/iy/iz, 50^3) select * from galaxies where snapnum = 63 and ix = 1 and iy = 5 and iz = 20 Random sampling select * from galaxies where snapnum = 63 and random between 1000 and 2000

TIG session 3+Millennium database the Millennium database web server Web application (Java in Apache tomcat web server) –portal: –public DB access: 30sec/1000rows | 30sec/unlimited rows –private access: 30sec/1000rows | 420sec/unlimited rows –MyDB, 1Gb, sometimes more Access methods –browser with plotting capabilities through VOPlot applet –wget + IDL, R –TOPCAT plugin

TIG session 3+Millennium database

TIG session 3+Millennium database

TIG session 3+Millennium database

TIG session 3+Millennium database Usage statistics Up since Aug 2006 Community notified via preprint server Obtained form DB-base log with SQL > 130 registered users almost 1.7 million queries (not all correct) since March 3, >5 billion rows handled

TIG session 3+Millennium database

TIG session 3+Millennium database Usage patterns Start with milli-Millennium (1/512 of full) Some download complete set Mainly to test approach, SQL Ask for account on full Millennium Run into timeout –either ask me –cut query in pieces –execute via script, using wget (good for hit rate count of site!) MyDB usage –small projects collaborate via results, –upload own data (when local at MPA, or via me)

TIG session 3+Millennium database Conclusions If you have valuable data (and “if you build it”), “they will come” PR helps –astro-ph/ –presentations by owners (Simon White, Volker Springel, Carlos Frenk) Users are not stupid –can and will learn SQL –don’t mind learning SQL (especially when relatively young) –come up with interesting solutions on their own Documentation important –not optimal yet: indexes, internal relationships Help desk (i.e. me) helps and is much appreciated Possible/planned improvements –full upload facility into MyDB –mirror machine with CAS jobs longer timeouts batch querying collaboration easier