GSC2 Maintenance GSC2 Annual meeting 2001. Database administrative tasks Database production tasks Identification and correction of errors Processing.

Slides:



Advertisements
Similar presentations
Testing Relational Database
Advertisements

SolidWorks Enterprise PDM Data Loading Strategies
Configuration management
Networking Essentials Lab 3 & 4 Review. If you have configured an event log retention setting to Do Not Overwrite Events (Clear Log Manually), what happens.
Easy to use Ability to attach policies/procedures to call types Ability to schedule calls in advance Officer safety alerts Robust search capabilities.
TRANSACTION PROCESSING SYSTEM ROHIT KHOKHER. TRANSACTION RECOVERY TRANSACTION RECOVERY TRANSACTION STATES SERIALIZABILITY CONFLICT SERIALIZABILITY VIEW.
DB-03: A Tour of the OpenEdge™ RDBMS Storage Architecture Richard Banville Technical Fellow.
Getting the most out of insect-related data. A major issue for pollinator studies is to find out what affects the number of various insects. Example from.
HIGH PROPER MOTION WHITE DWARF CANDIDATES GSCII Annual Meeting October CBBS, Stevensville (MD) by Daniela Carollo Osservatorio Astronomico.
Kapsalakis Giorgos - AM: 1959 HY459 - Internet Measurements Fall 2010.
Measuring the height of Lunar Mountains using data from the Liverpool Telescope.
Software Delivery. Software Delivery Management  Managing Requirements and Changes  Managing Resources  Managing Configuration  Managing Defects 
Chapter 19: Network Management Business Data Communications, 4e.
Reference: Message Passing Fundamentals.
A Web service for Distributed Covariance Computation on Astronomy Catalogs Presented by Haimonti Dutta CMSC 691D.
Implementation/Acceptance Testing CSE Week 8 / 1 CSE9020 Case Study Week 8 Implementation and Acceptance Testing.
EE694v-Verification-Lect5-1- Lecture 5 - Verification Tools Automation improves the efficiency and reliability of the verification process Some tools,
Implementation/Acceptance Testing / 1 Implementation and Acceptance Testing Physical Implementation Criteria: 1. Data availability 2. Data reliability.
Astro-DISC: Astronomy and cosmology applications of distributed super computing.
KDD for Science Data Analysis Issues and Examples.
Testing - an Overview September 10, What is it, Why do it? Testing is a set of activities aimed at validating that an attribute or capability.
Design, Implementation and Maintenance
The Use of Infrared Color-Color Plots to Identify Rare Objects in the Galactic Mid-Plane Jessica Fuselier Dr. Robert Benjamin, advisor.
Brief Overview of Data Processing of Afghanistan Household Listing, Pilot Census Results, Population and Housing Census and NRVA Survey Brief Overview.
MSF Testing Introduction Functional Testing Performance Testing.
United Nations Economic Commission for Europe Statistical Division Applying the GSBPM to Business Register Management Steven Vale UNECE
Chapter 1 Database Systems. Good decisions require good information derived from raw facts Data is managed most efficiently when stored in a database.
Migration XenDesktop 7. © 2013 Citrix | Confidential – Do Not Distribute Migration prerequisites Set up a XenDesktop 7 Site, including the site database.
Ch 8.1 Numerical Methods: The Euler or Tangent Line Method
Copyright © Cengage Learning. All rights reserved. 8 Tests of Hypotheses Based on a Single Sample.
TESTING STRATEGY Requires a focus because there are many possible test areas and different types of testing available for each one of those areas. Because.
1 TIPS 2011 May Persistence in the WFC3 IR detector Knox S. Long.
Facilimanage Dynamics aka “Facilies” CS 499 Final Presentation Curtis McKay Manneet Singh Brad Vonder Haar.
An Investigation of Oracle and SQL Server with respect to Integrity, and SQL Language standards Presented by: Paul Tarwireyi Supervisor: John Ebden Date:
Alexandre A. P. Suaide VI DOSAR workshop, São Paulo, 2005 STAR grid activities and São Paulo experience.
SPACE TELESCOPE SCIENCE INSTITUTE Operated for NASA by AURA COS Pipeline Language(s) We plan to develop CALCOS using Python and C Another programming language?
Chapter 1 In-lab Quiz Next week
Testing Session Testing Team-Release Management Team.
Virtis-Opis Beta Testing Todd S. Thompson, PE South Dakota DOT Office of Bridge Design August 3, 2011.
CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services Job Monitoring for the LHC experiments Irina Sidorova (CERN, JINR) on.
Patrizia Ferrero 3rd Integral Bart Work Shop Chocerady - November 1-3, Fast dissemination of GRB afterglow information Patrizia Ferrero (IASF-BO,
GSC-II Plate Processing Pipeline Status C. Loomis, C. Sturch, F. Guglielmetti.
08/30/05GDM Project Presentation Lower Storage Summary of activity on 8/30/2005.
Enabling Grids for E-sciencE System Analysis Working Group and Experiment Dashboard Julia Andreeva CERN Grid Operations Workshop – June, Stockholm.
Introduction for Basic Epidemiological Analysis for Surveillance Data National Center for Immunization & Respiratory Diseases Influenza Division.
September Interface Kickoff Sunflower Project Statewide Management and Reporting Tool Update September 02, 2009.
7 th Annual GSC-II Project Meeting C B A S GSC-II and DSS-II Projects 2001 Annual Report Brian McLean 22 nd October 2001.
Virtual Survey System sept 04 ASTRO-WISE- federation OmegaCEN AstroWise a Virtual Survey System OmegaCAM – Lofar – AstroGrid –((G)A) VO AstroWise a Virtual.
Week 7 : Chapter 7 Agenda SQL 710 Maintenance Plan:
Data Analysis Software Development Hisanori Furusawa ADC, NAOJ For HSC analysis software team 1.
Increasing Efficiency in Data Collection Processes Arie Aharon, Israel Central Bureau of Statistics.
The COMPASS (Catalogs of Objects and Measure Parameters for All Sky Surveys) Database Overview Gretchen Greene, Brian McLean, David Wolfe, and Charles.
Software Maintenance Speaker: Jerry Gao Ph.D. San Jose State University URL: Sept., 2001.
Software Development Problem Analysis and Specification Design Implementation (Coding) Testing, Execution and Debugging Maintenance.
Why A Software Review? Now have experience of real data and first major analysis results –What have we learned? –How should that change what we do next.
MGL/OATo 7th GSC2 Meeting, Barolo (I), 22 Oct 2001 The GSC2.2 Catalog: Global Properties Mario G. Lattanzi Osservatorio Astronomico di Torino.
Infrastructure for Data Warehouses. Basics Of Data Access Data Store Machine Memory Buffer Memory Cache Data Store Buffer Bus Structure.
Fundamentals of Workflow Analysis and Process Redesign Unit Process Change Implementation and Evaluation.
Chapter 10 Information Systems Development. Learning Objectives Upon successful completion of this chapter, you will be able to: Explain the overall process.
IMS 4212: Database Implementation 1 Dr. Lawrence West, Management Dept., University of Central Florida Physical Database Implementation—Topics.
Copyright , Dennis J. Frailey CSE Software Measurement and Quality Engineering CSE8314 M00 - Version 7.09 SMU CSE 8314 Software Measurement.
Photometric Calibration Jorge F. García Yus GEMINI Observatory Barolo 2001.
The World Is Our Office 11i Upgrade Versus Install 10 Questions to Consider.
Sequential Processing to Update a File Please use speaker notes for additional information!
26th October 2005 HST Calibration Workshop 1 The New GSC-II and it’s Use for HST Brian McLean Archive Sciences Branch.
Software Test Plan Why do you need a test plan? –Provides a road map –Provides a feasibility check of: Resources/Cost Schedule Goal What is a test plan?
GSPC -II Program GOAL: extend GSPC-I photometry to B = V ˜ 20 add R band to calibrate red second-epoch surveys HOW: take B,V,R CCD exposures centered at.
Human Computer Interaction Lecture 21 User Support
Introduction To DBMS.
COMPASS Database SPACE TELESCOPE SCIENCE INSTITUTE Gretchen Greene
Presentation transcript:

GSC2 Maintenance GSC2 Annual meeting 2001

Database administrative tasks Database production tasks Identification and correction of errors Processing Statistics and where do we go?

Database Administrative tasks System and Database upgrades System upgrade to Windows 2000 Hard disk storage increased to 4TB RAID Objectivity 6.1 latest version has browsing and object manipulation improvements as well as greatly enhanced the transaction cleanup time. New version expected soon : PYTHON BINDING for Objectivity will be supported.

DB Admin. (cont) Database file migration into the new disk storage –all files required re-registration in addition to migrating into the disk storage. –file access error in migration. Vendor provided solution implemented and all files registered. –Reassessed the distribution of files within the RAID system based on previous problems with disk space. –We had implemented an ad hoc fix to highly non-homogeneous distribution of objects with respect to the HTM. –Some disks nearly empty and others nearly full!

N0

S3 N1 N2 N3 S0 S1 S2 N11 N33N32N31N30 N23N22N21N20 N13N10 N03N01N02N00 N12 S03S02S01S00 S12S11S10S13 S23S22S21S20 S33S32S31S30 J: N00 N01 K: N02 L: N03 M: N10 N11 N12 N13 N: N20 N21 N22 N23 O: N30 N31 P: N32 N33 Q: S00 S01 S02 S03 R: S10 S: S11 T: S12 S22 U: S20 S21 V: S22 W: S23 X: S30 Y: S31 S32 S33 WHAT THIS MEANS IS WE NOW HAVE 16 DRIVES EACH AT ABOUT 4% TO 8% OF CAPACITY. PLENTY OF ROOM TO GROW AND EFFICIENT OPERATIONS

Database production tasks All tasks currently in production required to be recompiled and rebuilt. Integration of PYTHON scripts into day to day production. Insertion of reference catalogs into database. Streamlining administrative tasks and integration into production tasks. Porting photosol and new classification tasks into windows 2000 environment.

Identification and correction of errors Complexity Ten’s of thousands of lines of code (C, FORTRAN, perl, idl, dcl, C++…) 3 operating systems Nearly 1 billion objects Greater then 3600 unique photographic plates response, uniformity of the glass under stress, physics and chemistry of the emulsion and the manufacturing process… A great number of factors associated with observational astronomy. seeing, atmospheric transmission, temperature, extinction, telescope tracking, image quality…

Identification and correction of errors Database errors –Corruption –Referential integrity

Database errors corruption and referential integrity Effected small amount of data on the order of fractions of a percent. Complete rebuild of 4 or 5 databases for gsc2.2 delivery due to corruption. Corruption is most serious due to the unknown nature of how it occurs Especially difficult due to the hands off large scale production efforts we employ with tasks running for weeks on end over large datasets and complicated production tasks. Vender request to help in order so they can better understand Reference integrity fundamental to our project. likely cause is concurrent access between database applications and possibly between applications and administrative tasks. We do have some utilities to check for zero reference objects and to statically look at various ratios of 1,2,…,n references. Complications due to various factors (primarily the complicated nature of the plate overlap regions). Use of this tool is mainly as a result of some other additional indication of problems. The extent of this problem is again fairly small with only a handful of databases requiring un-matching and re-matching.

Database errors Matching integrity Clearly visible on the sky maps. Can be result of various causes. in cases where it only occurs in overlap region could be the result database timing and file access problems as well as astrometric problems (astrometry tasks are very robust and the reduction has been very uniform for the 2 nd epoch surveys). The identification and correction of matching problems is made more difficult due to the difference between the plate based matching and the region based database.

15 h 12 h N322 N321 In addition to J and F POSSII fields 507, 508, 442, 443 we have the IV N fields 507 And 508 which do not have magnitude selected limits imposed in the export task. The quick V fields N321 and N322 are loaded as well. All the matching is reasonably well done in the plate centers. N

Take a closer look in the North east corner of field 507 Bright stars: 1.Entry with F, J, N, V 2.2 nd entry with F, N 3.3 rd entry with V As well as various entries that result from lack of a magnitude limit for V and N.

So ? Whatever happened to create this must have been a fairly complicated sequence of events. Clearly for some reason all the plates matched well at the center of the field but not around the edges. Evidence suggests that this is NOT an astrometric problem. Completely un match all the plates in the region. Verify the astrometric and photometric solutions for the IV N and quick V plates (I am assuming that there is a reason to include these fields despite the fact they really do not belong in this release). Re-match all the regions on all the plates again. Re-export all those regions and check.

What is the point? No single method or task was able to detect and explain why this region looked so different. Visualization tools: skycat showsky, fitsview and IDL were all used but care must be exercised as different problems (photometric error) could produce similar results. Easy to generate global statistics like matching ratios and object index counts… only gave very qualitative indications that are hard to interpret. The cause of the problem remains unknown. On the order of 50 fields or 100 plates may be affected in a similar fashion requiring re-matching. The difficulty is in the identification due to the complicated plate overlaps. The data is completely fixable! And this allows us to focus on the important issues. (science, calibration updates, loading and matching new surveys…)

Photometric Errors Fairly easy to identify and fix if they are large. From the plate maps it is fairly obvious that the plate to plate consistency is fairly good. Several cases found to occur in fields without good sequences deeper then GSPC1 (14 th to 15 th mag).

Field 219 north Examination of J-F showed decent agreement to around 14 th mag. Diverged at fainter magnitude to around mean J-F near 18 was about 2.5 to 3.0. Also turned out that the cause of the error was an isolated procedural error and an acceptable calibration had been performed but had failed to be applied properly.

Catalog processing Southern IVN (IS) 44 % complete North IVN (XI) 35% complete POSS1 E 100% complete Small number have been loaded and matched. Infrared surveys could be completed in relatively short time if concerted effort was made. May have to make some hard decisions in light of staffing and resource issues.

Conclusions 1.Database maintenance continues to be a high priority issue prior to proceeding with large scale operations such as loading and matching new surveys. 2.Data quality and integrity is being addressed but may not be incorporated into the export catalog on a fix by fix basis. High priority requests could be accommodated to some degree. 3.Estimates on the amount of data that have been compromised (primarily in the matching integrity) are at less then 10%. 4.We can identify and fix the vast majority of problems. 5.Data processing and future enhancements to the GSC2 are both planned and proceeding. 6.We are grateful and deeply indebted to those working in collaboration with us to help in the analysis and better understand this massive dataset.