Scientific Paradigm First : Observations Second : Theory Third: Computer simulations Fourth: Data mining.

Slides:



Advertisements
Similar presentations
Data Mining and the Web Susan Dumais Microsoft Research KDD97 Panel - Aug 17, 1997.
Advertisements

Data Mining, ADQL, & The National Virtual Observatory's OpenSkyQuery Utility by Richard Doc Kinne, KQR 2008 AAVSO Fall Conference Nantucket, MA.
9 September 2005NVO Summer School Aspen Astronomical Dataset Query Language (ADQL) Ray Plante T HE US N ATIONAL V IRTUAL O BSERVATORY.
CASDA Virtual Observatory CSIRO ASTRONOMY AND SPACE SCIENCE Arkadi Kosmynin 11 March 2014.
Databases for the 'Pi of the Sky' experiment Marek Biskup Warsaw University.
A Web service for Distributed Covariance Computation on Astronomy Catalogs Presented by Haimonti Dutta CMSC 691D.
Computational Physics Kepler Dr. Guy Tel-Zur. This presentations follows “The Getting Started with Kepler” guide. A tutorial style manual for scientists.
DATA WAREHOUSING.
14 1 Chapter 14 Database Connectivity and Web Development Database Systems: Design, Implementation, and Management, Seventh Edition, Rob and Coronel.
Compilation of stellar fundamental parameters from literature : high quality observations + primary methods Calibration stars for astrophysical parametrization.
An Agent-Oriented Approach to the Integration of Information Sources Michael Christoffel Institute for Program Structures and Data Organization, University.
MonetDB/SQL Meets SkyServer: the Challenges of a Scientific Database Milena Ivanova, Niels Nes, Romulo Goncalves, Martin Kersten CWI, Amsterdam Presented.
Lesson 2 Technology: Federated Searching Explained.
Pattern Recognition. Introduction. Definitions.. Recognition process. Recognition process relates input signal to the stored concepts about the object.
Implementing the DDI A new backoffice system at WSA Ron Dekker Netherlands Organisation for Scientific Research.
Applying Multi-Criteria Optimisation to Develop Cognitive Models Peter Lane University of Hertfordshire Fernand Gobet Brunel University.
SDSS Web Services Tamás Budavári Johns Hopkins University Coding against the Universe.
SQL Forms Engine Koifman Eran Egri Ozi Supervisor: Ilana David.
The Metadata System of C hina M ete. D ata S ervice S ystem WANG Guofu National Meteorological Information Centre, CMA Metadata Workshop.
Data Mining Techniques
Data Mining Chun-Hung Chou
Graph-RAT Overview By Daniel McEnnis. 2/32 What is Graph-RAT  Relational Analysis Toolkit  Database abstraction layer  Evaluation platform  Robustly.
Cool white dwarfs in the Sloan & SuperCOSMOS Sky Surveys Nigel Hambly, Wide Field Astronomy Unit, IfA, University of Edinburgh.
Research paper: Web Mining Research: A survey SIGKDD Explorations, June Volume 2, Issue 1 Author: R. Kosala and H. Blockeel.
The VAO is operated by the VAO, LLC. VAO: Archival follow-up and time series Matthew J. Graham, Caltech/VAO.
Enabling Embedded Systems to access Internet Resources.
1 Research Groups : KEEL: A Software Tool to Assess Evolutionary Algorithms for Data Mining Problems SCI 2 SMetrology and Models Intelligent.
Functions and Demo of Astrogrid 1.1 China-VO Haijun Tian.
Chapter 1 Introduction to Data Mining
Software Agents for Web Mining FYP Project by: Shuchi Mittal Quek Siew Guat Patricia Professor: Franklin Fu.
1 Pattern Recognition Concepts How should objects be represented? Algorithms for recognition/matching * nearest neighbors * decision tree * decision functions.
UWG 2013 Meeting PO.DAAC Web Services Demo. What are PO.DAAC Web Services?
Chapter 10: The Data Tier We discuss back-end data storage for Web applications, relational data, and using the MySQL database server for back-end storage.
Wiss. Beirat AIP, ClusterFinder & VO-Methods H. Enke German Astrophysical Virtual Observatory ClusterFinder VO Methods for Astronomical Applications.
“curator” DB design Curator meeting, GFDL, Sep 20.
2007. Software Engineering Laboratory, School of Computer Science S E Web-Harvest Web-Harvest: Open Source Web Data Extraction tool 이재정 Software Engineering.
Gyöngyi Kerekes Eötvös Lóránd University, Budapest MAGPOP 2008, Paris István Csabai László Dobos Márton Trencséni.
Multidimensional classification of burst triggers from the fifth science run of LIGO Soma Mukherjee CGWA, UTB GWDAW11, Potsdam, 12/18/06 LIGO-G
14 May 2003Scientific Use Cases - Masahiro Tanaka1 Scientific Use Cases to be implemented to IVOQL Masahiro Tanaka and JVO team (NAOJ) IVOA Interoperability.
EÖTVÖS UNIVERSITY BUDAPEST Department of Physics of Complex Systems Photometric parallax estimation using the MILES catalog and BaSeL models István Csabai.
Source catalog generation Aim: Build the LAT source catalog (1, 3, 5 years) Jean Ballet, CEA SaclayGSFC, 29 June 2005 Four main functions: Find unknown.
GEON2 and OpenEarth Framework (OEF) Bradley Wallet School of Geology and Geophysics, University of Oklahoma
Persistent Digital Archives and Library System (PeDALS)
The International Virtual Observatory Alliance (IVOA) interoperability in action.
Multidimensional classification of burst triggers from LIGO S5 run Soma Mukherjee for the LSC Center for Gravitational Wave Astronomy University of Texas.
ICT-enabled Agricultural Science for Development Scenarios, Opportunities, Issues by ICTs transforming agricultural science, research & technology generation.
German Astrophysical Virtual Observatory Overview and Results So Far W. Voges, G. Lemson, H.-M. Adorf.
Mireille Louys et al., OV-France Theory WG meeting, LYON, June DALIA : an observation vs simulation comparison frame work What could be a Model.
Discovery, Study, Classification and Modeling of Variable Stars Natalia A. Virnina Department of High and Applied Mathematics, Odessa National Maritime.
How Web Database Architectures Work CPS181s April 8, 2003.
CNRS Documentation project : CCSD (Center for Direct Scientific Communication ) Htask meeting (Madrid) 06/12/ Lyon Daniel Charnay / Hélène Jamet.
Lecture 3 With every passing hour our solar system comes forty-three thousand miles closer to globular cluster 13 in the constellation Hercules, and still.
Young Star Disk Evolution in the Planet Formation Epoch Gregory Mace Mentor: Lisa Prato Northern Arizona University Physics and Astronomy.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
June 27-29, DC2 Software Workshop - 1 Tom Stephens GSSC Database Programmer GSSC Data Servers for DC2.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI OpenSource GeoSpatial Catalogue Platform-as-a-Service Salvatore Pinto Cloud.
Spatial Searches in the ODM. slide 2 Common Spatial Questions Points in region queries 1.Find all objects in this region 2.Find all “good” objects (not.
Database and Data API. Database Different data sources: International Offices Web Design the data schema based on the data found Transform all the data.
Data Mining Introduction to data mining concepts.
School of Computer Science & Engineering
Transfer Learning in Astronomy: A New Machine Learning Paradigm
Introduction to Data Mining
Data Mining: Concepts and Techniques Course Outline
What is Pattern Recognition?
Social Research Methodology and Supplementary Documentation John Kallas University of the Aegean, Department of Sociology.
Database Connectivity and Web Development
Database Connectivity and Web Development
Research Institute for Future Media Computing
Health & Consumers DG SANCO Unit A.4 Information systems
CS855 Overview Dr. Charles Tappert.
Presentation transcript:

Scientific Paradigm First : Observations Second : Theory Third: Computer simulations Fourth: Data mining

Data Mining Database: Data format Access Data model Preprocessing Processing

Data Mining: Connection

Data Mining: Concept Data Preservation: OAIS Data Formats: FITS +metadata Access & Interfaces: http, web interfaces, XML Applocations

Data Mining: Implementation

Data Mining in Astronomy Use cases Cross-identification Parametric search Pattern recognition Coordinate transformation Source extraction

Cross-identification Different coordinates Different epochs Reference system

Cross-identification: HTM Time consuming Distance

Parametric search Photometric and coordinate search Object parameters (shapes, classificators) SDSS: benchmark of 20 “typical” queries Search for Cataclysmic Variables and pre-CVs with White Dwarfs and very late secondaries: u-g < 0.4 g-r < 0.7 r-i > 0.4 i-z > SELECT run, camCol, rerun, field, objID, u,g,r,i,z, ra, dec INTO ##results FROM PhotoPrimary WHERE (u - g) < 0.4 and (g - r) < 0.7 and (r - i) > 0.4 and (i - z) > 0.4

Pattern recognition Parametric Non-parametric

Coordinate tranformation FFT Wavelets

Source extraction Sextractor