HUBBLE LEGACY ARCHIVE STSCI Astronomical Data Tagging Web 2.0 meets Astronomy in the HLA Niall I. Gaffney, W. Warren Miller (STScI)

Slides:



Advertisements
Similar presentations
Discovery and Exploration in the VO Chris Miller NOAO/CTIO La Serena, Chile T HE US N ATIONAL V IRTUAL O BSERVATORY.
Advertisements

Overview of Current and Forthcoming GALEX Search Capabilities and Data Products Current Search Options New GALEX Fluxes gPhoton.
Data Mining and Text Analytics Advertising Laura Quinn.
MAST-VizieR/NED cross correlation tutorial 1. Introduction Figure 1: Screenshot of the MAST VizieR Catalog Search Form. or enter here as object class:
Bringing Science to the World Museum Victoria Natural Science Collections Online 2008 EMu Users’ Meeting, Wellington, New Zealand Alex Chubaty, Collection.
Web + VO + Database Technologies = HLA Footprints STScI: Gretchen Greene, Steve Lubow, Brian McLean, Rick White and the HLA Team JHU: Alex Szalay and Tamas.
Search Engines. 2 What Are They?  Four Components  A database of references to webpages  An indexing robot that crawls the WWW  An interface  Enables.
Scoil an Léinn Eolais agus na Leabharlannaíochta UCD UCD School of Information and Library Studies OJAX: A Web 2.0 search user interface Judith Wusteman.
 Copyright 2005 Digital Enterprise Research Institute. All rights reserved. 1 The Architecture of a Large-Scale Web Search and Query Engine.
An idea of graphical browsing tool for Chandra archives Ken Ebisawa (ISAS/JAXA) 1.
Rich Tags: Cross-Repository Browsing Cross-site browsing and exploration of digital repositories Daniel Alexander Smith
Tracking Chandra Science Productivity Publication Metrics special thanks to Mihoko Yukita (CDO) Sherry Winkelman (Archive Group) Paul J. Green (CDO)
The Web 2.0 and the NOAO NVO Portal Christopher J. Miller Data Products Program CTIO/NOAO.
The Privacy Tug of War: Advertisers vs. Consumers Presented by Group F.
Website Design & Development Proposal Dental Clinic Website By Expert Web Design Solutions All rights reserved –
CONCRETE SOFTWARE SOLUTIONS PVT. LTD. A leading Digital Marketing Firm In India.
Hubble Legacy Archive Lee Quick - TIPS meeting July 19, 2012 Goals Data History Current Work Demo.
HTTP: cookies and advertising Concepts to cover:  web page content (including ads) from multiple site: composition at client  cookies  third-party cookies:
Digitized Sky Survey Update Brian McLean : Archive Sciences Branch / Operations and Engineering Division.
1 Anonshare 2.0 P2P Anonymous Browsing History Share Frank Chiang Terry Go Rui Ma Anita Mathew.
Google Xtras. Google Maps Google Latitude tests Site mapping What is it? A New Standard: Search Engine Giants Adopt the XML Protocol In 2005, the search.
Data Management Subsystem: Data Processing, Calibration and Archive Systems for JWST with implications for HST Gretchen Greene & Perry Greenfield.
1. 2 introductions Nicholas Fischio Development Manager Kelvin Smith Library of Case Western Reserve University Benjamin Bykowski Tech Lead and Senior.
Dec 2, 2014 Hubble Legacy Archive and Hubble Source Catalog Rick White & Brad Whitmore Current teams: HLA: Michael Dulude, Mark Kyprianou, Steve Lubow,
Dec 2, 2014 MAST Data Discovery Portal Tom Donaldson Tony Rogers.
Functions and Demo of Astrogrid 1.1 China-VO Haijun Tian.
HUBBLE LEGACY ARCHIVE STSCI HLA: Where We Are Today Warren Miller Technology Showcase March 2007.
PERSONALIZED SEARCH Ram Nithin Baalay. Personalized Search? Search Engine: A Vital Need Next level of Intelligent Information Retrieval. Retrieval of.
Search Engine By Bhupendra Ratha, Lecturer School of Library and Information Science Devi Ahilya University, Indore
Introduction to Nutch CSCI 572: Information Retrieval and Search Engines Summer 2010.
Dixon Jones Receptional Internet Marketing. WWW: Machine or Alive?
Online Advertising Greg Lackey. Advertising Life Cycle The Past Mass media Current Media fragmentation The Future Target market Audio/visual enhancements.
APT Standard Target Name Agenda: History of APT Standard Target Name PR DMS Level 3 Data File Names SANTA: MAST Name Resolver Pros/Cons of adding Standard.
Graphic material for HSC FAQ – March 23, Five things you should know about the Hubble Source Catalog (HSC) 1. Coverage can be very non-uniform,
European New HST & MMI Demo Nacho León María Arévalo Jonas Haase Jesús Salgado Deborah Baines Bruno Merín ESAC 20 January 2015.
Footprint Service Specification IVOA Interop Meeting Trieste 2008 Gretchen Greene and Tamas Budavari.
HST Spectroscopic Legacy Working Group Areas for Recommended Work in FY2014 and Beyond Meeting 12 Jun 2013.
2007. Software Engineering Laboratory, School of Computer Science S E Web-Harvest Web-Harvest: Open Source Web Data Extraction tool 이재정 Software Engineering.
University of Illinois at Urbana-Champaign BeeSpace Navigator v4.0 and Gene Summarizer beespace.uiuc.edu `
The Business Model of Google MBAA 609 R. Nakatsu.
The Birth & Growth of Web 2.0 COM 415-Fall II Ashley Velasco (Prince)
Sharon M. Jordan Assistant Director for Program Integration U.S. DOE Office of Scientific & Technical Information Vantage Point: Government R&D Results.
Enhancing Science with the Hubble Legacy Archive Bologna, January 29, 2008 Brad Whitmore OUTLINE The potential for enhanced archival science What is the.
Common Archive Observation Model (CAOM) What is it and why does JWST care?
SPACE TELESCOPE SCIENCE INSTITUTE Operated for NASA by AURA WFC3 and StarView
Nov Common Archive Observation Model What is CAOM and why should MAST use it? Brian McLean.
The World Wide Web: Information Resource. Hock, Randolph. The Extreme Searcher’s Internet Handbook. 2 nd ed. CyberAge Books: Medford. (2007). Internet.
MAST Users Group – July 9, 2009 other planned and ongoing projects within MAST.
EMu Interface and the Web Clear identification of web fields for users and administrators Visual identifier of the web presentations in EMu, ie Collection.
AstroGrid Usability & Docs, JBO, 6 th Dec 2005 Jonathan Tedds Leicester University AstroGrid Usability & Documentation Usability –Documentation –Infrastructure.
Virtual Observatories, Press Release Images, and Web Services Dr. Frank Summers Space Telescope Science Institute November 3, 2005.
The European Hubble Space Telescope Legacy Archive Wolfram Freudling.
F. Genova, VO as a Data Grid, 2003/06/301 Interoperability of astronomy data bases Françoise Genova, CDS.
Nov Hubble Legacy Archive & Hubble Source Catalog Rick White & Brad Whitmore November 18, 2013.
VERSION 12.5 HIHGLIGHTS Lead Developer - Rob Nikkel.
Where (Online) to Display Your Ad Jim Jansen College of Information Sciences and Technology The Pennsylvania State University
Faculty meeting - 13 Dec 2006 The Hubble Legacy Archive Harald Kuntschner & ST-ECF staff 13 December 2006.
Document Clustering for Natural Language Dialogue-based IR (Google for the Blind) Antoine Raux IR Seminar and Lab Fall 2003 Initial Presentation.
Leveraging Web Content Management in SharePoint 2013 Christina Wheeler.
 GEETHA P.  Originally coined by Tim O’Reilly Publishing Media  Second generation of services available on www.  Lets people collaborate and share.
HST’s Productivity and Impact Jill Lagerstrom Daniel Apai.
Introduction to SHERPA RoMEO and its Significance for Publishers
HOW TO USE GOOGLE WEBMASTER TOOLS TO IMPROVE SEO ? GOOGLE WEBMASTEER.
The Hubble Legacy Archive (HLA) Slitless Spectroscopy Project
HST Spectroscopic Legacy Working Group Areas for Recommended Work in FY2014 and Beyond Meeting 26 Jun 2013.
A research literature search engine with abbreviation recognition
A proposed pilot project with AAS Journals
Martin Rajman, EPFL Switzerland & Martin Vesely, CERN Switzerland
Overview Blogs and wikis are two Web 2.0 tools that allow users to publish content online Blogs function as online journals Wikis are collections of searchable,
X-ray high resolution spectra in the VO: the case of XMM-Newton RGS
Presentation transcript:

HUBBLE LEGACY ARCHIVE STSCI Astronomical Data Tagging Web 2.0 meets Astronomy in the HLA Niall I. Gaffney, W. Warren Miller (STScI)

HUBBLE LEGACY ARCHIVE STSCI What is the HLA Hubble Legacy Archive –Joint project STScI, ST-ECF, CADC –Providing best archive data products from HST data Improving WCS solutions Combine data Extracting image photometry and GRISM spectra Create Simple and Powerful User Interface –Typical HST archive user visits once a year –Get the right data into the users own environment Users want to use their daily applications (e.g. web) Users have their own data analysis system

HUBBLE LEGACY ARCHIVE STSCI HLA UI Philosophy UI “Requirements” from users –Interfaces must be simple, understandable, powerful, rich, self-explanatory “Google like” –Interface must feature the Data and not the Query –Interface must NOT get in the way of getting data and using them in the tools users are accustomed to –Interface should expose information that previous interfaces have not been able to

HUBBLE LEGACY ARCHIVE STSCI Early Data Release - Target Oriented

HUBBLE LEGACY ARCHIVE STSCI Who else does this…

HUBBLE LEGACY ARCHIVE STSCI What is Web 2.0 Web 2.0 is a change in how we use the network Web 2.0 is NOT dynamic web pages (AJAX) –Web 2.0 is enabled by AJAX Web 2.0 are applications and APIs delivered via the web –Netscape vs. Google –DoubleClick vs. AdSense –My Home Page vs. My Blog or MySpace A synergy between services and information to provide a more focused information service User aware and user provided (context) Tim O’Reilly article with long discussion

HUBBLE LEGACY ARCHIVE STSCI YouTube - Data and Tags

HUBBLE LEGACY ARCHIVE STSCI Where to get Tags for our Data Proposal data not enough (one target in a sea) Astronomers are few and busy –Its not “Browse or Perish”, “Publish or Perish”

HUBBLE LEGACY ARCHIVE STSCI What we did Use a “basic footprint” (aka cone search) with Simbad to identify objects within a given field –Not a true footprint as objects returned are all points Used Simbad to then get bibcodes for objects Used ADS to get keywords for each bibcode Harvested other data from HST proposal information (abstract, proposed targets…) Use Apache Lucene as our search engine Modified the Apache Lucene search demo

HUBBLE LEGACY ARCHIVE STSCI How well did this work 43% of the 2769 ACS WFC “visits” in the past 2 years  38% of “visits” are parallels (semi-random pointing) Average ~ 22 keywords per observation with keywords

HUBBLE LEGACY ARCHIVE STSCI DEMO

HUBBLE LEGACY ARCHIVE STSCI Where to go next Scientific input needed –Is More Like This useful or annoying scientifically more often than not? Can it be tweaked? Footprints and more Footprints –Intersection of observation footprints with object footprints improve tags (especially smaller fields) –Real time evaluation for cutouts and surveys (seconds not minutes) Standardize tags more –Case, spelling, removal of irrelevant words (e.g. “Galaxy Clusters General” -> “Galaxy Clusters”, “Colour” -> “Color”, “Charged Coupled Device” =>/dev/null)

HUBBLE LEGACY ARCHIVE STSCI AstroTube