FGDC and ASF Using Structured Metadata Archie Warnock A/WWW Enterprises

Slides:



Advertisements
Similar presentations
2008 EPA and Partners Metadata Training Program: 2008 CAP Project Geospatial Metadata: Intermediate Course Module 3: Metadata Catalogs and Geospatial One.
Advertisements

Getting Involved in OLAC Steven Bird University of Pennsylvania LREC Symposium: The Open Language Archives Community 29 May 2002.
Schedule of Releases (since Tromso meeting) and New Access Interfaces.
CYCLADES Kickoff 19/02/01 Gudrun Fischer, Norbert Fuhr University of Dortmund (Germany) CYCLADES Acess Service.
Comparison of BIDS ISI (Enhanced) with Web of Science Lisa Haddow.
Retrieval of Information from Distributed Databases By Ananth Anandhakrishnan.
Basic Searching Engineering Village. Agenda What is Engineering Village? Setting up a personal account Searching Engineering Village How to.
© Copyright 2012 STI INNSBRUCK Apache Lucene Ioan Toma based on slides from Aaron Bannert
Z39.50 and the Web ZIG July 2000 Poul Henrik Jørgensen, Danish Bibliographic Centre,
Geospatial One-Stop A Federal Gateway to Federal, State & Local Geographic Data
An Operational Metadata Framework For Searching, Indexing, and Retrieving Distributed GIServices on the Internet By Ming-Hsiang.
Enterprise Content Management Departmental Solutions Enterprisewide Document/Content Management at half the cost of competitive systems ImageSite is:
Chapter 2. Slide 1 CULTURAL SUBJECT GATEWAYS CULTURAL SUBJECT GATEWAYS Subject Gateways  Started as links of lists  Continued as Web directories  Culminated.
1 panFMP - Ein XML-basiertes Framework für Metadaten- Portale Vortrag und „hands-on“ Seminar am GFZ Potsdam Uwe Schindler MARUM – Universität Bremen PANGAEA.
Harvesting Metadata for Use by the geodata.gov Portal Doug Nebert FGDC Secretariat Geospatial One-Stop Team.
MCNC/CNIDR & A/WWW Enterprises Introduction to CNIDR’s Isite Jim Fullton - MCNC/CNIDR Archie Warnock - A/WWW Enterprises.
Information Retrieval in Practice
Extending the Capabilities of Geospatial One-Stop Through Partner-Developed Web-Services April 16, 2010 Federal Geographic Data Committee’s (FGDC) Cooperative.
December 9, 2002 Cheshire II at INEX -- Ray R. Larson Cheshire II at INEX: Using A Hybrid Logistic Regression and Boolean Model for XML Retrieval Ray R.
Planned Title: Review of Evaluation of Geospatial Search Allan Doyle.
Evolution of NBII Search-Based Technologies Oct 24, 2002 Donna Roy USGS Center for Biological Informatics.
What is the Internet? The Internet is a computer network connecting millions of computers all over the world It has no central control - works through.
2001 User Meeting OCLC SiteSearch Update Doug Loynes SiteSearch Product Manager.
Z39.50, XML & RDF Applications ZIG Tutorial January 2000 Poul Henrik Jørgensen, Danish Bibliographic Centre,
CEN/ISSS DC workshop, January The UK approach to subject gateways Rachel Heery UKOLN University of Bath UKOLN is.
A/WWW Enterprises1 Introduction to CNIDR’s Isearch Archie Warnock
Mining For Lost Treasure National Geospatial Data Clearinghouse Archibald Warnock U.S. Federal Geographic Data Committee A/WWW Enterprises.
Geospatial One Stop Modules Two and Three. Module 2 Inventory/Document existing Federal agency framework datasets and publish metadata to clearinghouse.
The GeoConnections Discovery Portal Michael Robson MacDonald Dettwiler and Associates Brian McLeod, Michael Adair Natural Resources Canada.
Chapter 7 Web Content Mining Xxxxxx. Introduction Web-content mining techniques are used to discover useful information from content on the web – textual.
Why We Create Metadata and How it is Useful Bruce Godfrey University of Idaho Library INSIDE Idaho
WebWatch Ian Peacock UKOLN University of Bath Bath BA2 7AY UK
Project Overview Bibliographic merging, Endeca, and Web application.
Publishing Clearinghouse resources to geodata.gov Doug Nebert FGDC Secretariat Geospatial One-Stop Team September 17, 2004.
Testing and Improving Interoperability The Z39.50 Interoperability Testbed William E. Moen School of Library and Information Sciences Texas Center for.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
A/WWW Enterprises15 July 1996 Implementing Queries with HTTP A. Warnock A/WWW Enterprises
EO/GEO Team Response to Open GIS Consortium Catalog Interface RFP George Percivall February 1999.
Fisheries Oceanography Collaboration Software Donald Denbo NOAA/PMEL-UW/JISAO Presented by Nancy Soreide NOAA/PMEL AMS 2002/IIPS 10.3.
1 CS 502: Computing Methods for Digital Libraries Lecture 19 Interoperability Z39.50.
A/WWW Enterprises 28 Sept 1995 AstroBrowse: Survey of Current Technology A. Warnock A/WWW Enterprises
Z39 Server and Z39.50 Gateway. Z39 Configuration Z39.50 Server Bath Profile conformance has been added to the Z39 Server. Z39 server supports Structure.
ONE-2 Profile ZIG Tutorial 19 th January 2000 Poul Henrik Jørgensen, Danish Bibliographic Centre
Extending Access To Information Resource Discovery Service William E. Moen, Ph.D. Kathleen R. Murray, Ph.D. School of Library and Information Sciences.
The Future of Isite - Growing GILS Archie Warnock A/WWW Enterprises
Uwe SchindlerGES 2007 – May 2-4, 2007 Data Information Service based on Open Archives Initiative Protocols and Apache Lucene Uwe Schindler 1, Benny Bräuer.
Managed by UT-Battelle for the Department of Energy Mercury – Distributed Metadata Tool for Finding and Retrieving CDIAC Data CDIAC UWG Meeting September.
Mercury – A Service Oriented Web-based system for finding and retrieving Biogeochemical, Ecological and other land- based data National Aeronautics and.
Archibald Warnock FGDC Activities CIP/INFEO Interoperability and ISO CD2 Metadata Activities.
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
A Resource Discovery Service for the Library of Texas Requirements, Architecture, and Interoperability Testing William E. Moen, Ph.D. Principal Investigator.
WEB SERVER SOFTWARE FEATURE SETS
National Coastal Data Development Center Status of OPeNDAP at NCDDC 11 September 2003 Susan Starke, Chief of IT Operations
U.S. Environmental Protection Agency Central Data Exchange Pilot Project Promoting Geospatial Data Exchange Between EPA and State Partners. April 25, 2007.
Coming Soon to a Computer Near You (maybe) MicroZGate A Light, Portable, and Configurable z39.50 Gateway John Ulmer NOAA Coastal Services Center.
No Longer Under Our Control? The Nature and Role of Standards in the 21 st Century Library William E. Moen School of Library and Information Sciences Texas.
Don’t Duck Metadata March 2005 Introducing Setting Up a Clearinghouse Node Topic: Introduction to Setting Up a Clearinghouse Node Objective: By.
A/WWW Enterprises 15 July 1996 Implementing Queries with Z39.50 A. Warnock A/WWW Enterprises
National Geospatial Enterprise Architecture N S D I National Spatial Data Infrastructure An Architectural Process Overview Presented by Eliot Christian.
CWIC Open Search Best Practices Doug Newman (NASA ECHO) CEOS WGISS-37 April 15th 2014 Presenter: Archie Warnock (A/WWW Enterprises)
Interoperability and Standards for Bibliographic Applications Poul Henrik Jørgensen Danish Library Centre Telematics for.
1 CS 430: Information Discovery Lecture 26 Architecture of Information Retrieval Systems 1.
Repository for Archiving, Managing and Accessing Diverse DAta Thiru.
Alexandria Digital Library The ADL Testbed Greg Janée
1 Presented to Query Language '98 December 4, 1998 by Eliot Christian U.S. Geological Survey XML Encoding Rules (XER)
Alexandria Digital Library ADL Metadata Architecture Greg Janée.
WHAT DOES THE FUTURE HOLD? Ann Ellis Dec. 18, 2000
Building Search Systems for Digital Library Collections
ORNL is Operated by UT-Battelle for DOE
Archibald Warnock A/WWW Enterprises
Presentation transcript:

FGDC and ASF Using Structured Metadata Archie Warnock A/WWW Enterprises

FGDC Project n FGDC Project is based at USGS n Cooperative effort of USGS, A/WWW, Blue Angel Technologies n Utilizing structured metadata to locate geospatial data through Z39.50 n Based on FGDC Metadata Standard n Centralized search gateway, distributed sites

GILS and the Advanced Search Facility n ASF is a US Dept. of Commerce project, built by Pilot Research Associates, A/WWW Enterprises and collaborators n Information Communities - networks of cooperative, low-impact, distributed nodes n The basic interchange will be structured GILS metadata n Search on full text and GILS metadata

FGDC Reference Implementation n FGDC Node Software Iindex, Isearch, Iutil - the search engine zclient, izclient, zping, zbatch - the Z39.50 clients zserver, zserverNT - the Z39.50 servers n zcon & zgate - the WWW-to-Z39.50 gateway (not supported by FGDC)

ASF Reference Implementation n Isearch - basic search engine n Yaz - Z39.50 toolkit n htDig - URL harvester n ZAP - Web-to-Z39.50 gateway n Custom APIs and Components Search API xyz, the Z39.50 server ids, the internode data communications

GILS, Dublin Core and Others n Dublin Core is a minimal (15 fields) generic metadata scheme for virtually any kind of document n GILS represents a more detailed approach, including most of DC, providing greater interoperability n GILS is less bibliographically oriented than (Z39.50) BIB-1 n GILS is lightweight compared to GEO (FGDC) and EOS/CIP (which have specific functional requirements)

What Structured Metadata Means -1 n GILS - Fewer fields More documents More metadata records Skinnier metadata records Easier abstraction n FGDC - More fields Fewer documents Fewer metadata records Fatter metadata records Less abstraction  GILS is a good, general compromise

What Structured Metadata Means - 2 n A Z39.50 profile as defines a language At some level, Z39.50 is a detail Protocols are about communication, profiles are about abstraction and GILS is about content Z39.50 guarantees that the user’s query can be unambiguously decoded - no guarantees about content We could implement the profile over any protocol - http, CORBA, etc. Does we have to use Z39.50? No, but the abstraction is required Z39.50 already includes the abstraction model

Related Documents n Tools ftp://ftp.cnidr.org/pub/software/Isite ftp://ftp.clark.net/pub/warnock/Software n A/WWW Enterprises

Isearch Features  Full text search  Search on text fields  Search on numeric fields with appropriate relations (>, <, =)  Search on date fields with appropriate relations (before, during, after)  Search on geospatial bounding box  Boolean searches  Phrase searching  Right truncation  Proximity searching (within N characters)  Case insensitive searching, punctuation ignored  Configurable stopword list  Customizable results presentation  Relevance ranked scores  Term weighting

Isearch Document Types n ASCII text n SGML tagged fields HTML GILS (XML) templates FGDC templates n Colon delimited fields GCMD DIF templates n USMARC records n IAFA templates n SOIF templates n First line in file n Filenames n folders n Usenet news archives whois++ templates n Multi-file documents n US patents n BIBTeX n Medline