GeoXwalk:- Developing a Gazetteer Service and Server for UK Academia James S Reid Project Manager, Geo-data Services, EDINA AGILE Conference 2003 Lyon.

Slides:



Advertisements
Similar presentations
Joint Information Systems Committee Supporting Higher and Further Education Portals and the JISC Information Environment Strategy Chris Awre Programme.
Advertisements

Geo-spatial and Visualisation L&T materials - the e-MapScholar project Moira Massey ALT-C 2002 University of Sunderland.
EDINA Geoservices Deliver 24/7 online geo-data, mapping and gazetteer services to HE/FE and beyond Highly experienced and skilled team –provides advice.
Metadata workshop, June The Workshop Workshop Timetable introduction to the Go-Geo! project metadata overview Go-Geo! portal hands on session.
Where next…. Stakeholder workshop, 29 Jan To the end of the project.
Report on progress Stakeholder workshop, 29 Jan 2003.
Report on Progress. Overview First cut gazetteer database built Basic working demonstrator built –simple interface for reference use –basic machine2machine.
The geoXwalk project funded under JISC DNER Development Programme –builds on scoping study –aims to develop a demonstrator gazetteer service suitable for.
Project Overview. The Context The JISC IE an information environment that enables people to discover, access and use a wide variety of quality assured.
1 Issues and Challenges. 2 Adding historical data Performance Licensing Improving the Geoparser General Issues.
1 Where next?. 2 To the end of the project Continue populating database – additional data from e.g. Getty Look at performance Use the gazetteer to enhance.
James Reid Project Manager EDINA. The geoXwalk project funded under JISC IE Development Programme –builds on Phase I scoping study –aims to develop a.
HILT II: Towards Interoperable Subject Descriptions Report to the JISC Terminologies Workshop, February Dennis Nicholson: Centre for Digital Library.
Linking Repositories Scoping Study Key Perspectives Ltd University of Hull SHERPA University of Southampton.
Geo-spatial data service developments at EDINA - interoperability in the Information Environment Dr David Medyckyj-Scott Manager, EDINA Research and Geo-data.
Data Management: Metadata, Repositories and Curation Tony Mathys, Anne Robertson Eddie Boyle, Guy McGarva GeoForum, 4 th November, York.
Page 1© Crown copyright 2006 Registry technology & case study implementation J. Tandy, D. Thomas - November 2006.
A Community Specific SDI – the Case of UK Academia GI-days in Munster 26 th June 2003 Chris Higgins (Medyckyj-Scott, D. and Reid, J) GIS Project Leader.
Digital Gazetteers in the UK : the geo-X-walk Project at EDINA Presented by: Andy Corbett (Development Engineer) James S Reid (Project manager)
Overview of key concepts and features
Workshop 10:Result Going beyound the point, towards ontologies of geographical information Indexing multilingual sources with multilingual place name information.
1 The GeoParser. 2 Overview What is a geoparser? –Software for the automated extraction of place names from text Why would you want one? –Document characterisation.
Issues and challenges Stakeholder workshop, 29 Jan 2003.
© Tefko Saracevic, Rutgers University1 metadata considerations for digital libraries.
Welcome to EDINA Digimap Digimap is an EDINA service offering online access to a range of spatial data. It is authenticated using the UK Federation and.
A Geo-spatial Perspective or What’s Special about the Spatial? Peter Burnhill Director EDINA, UK National Data Centre University of Edinburgh CoSMiC Terminologies.
Joint Information Systems Committee Supporting Higher and Further Education Development of an Information Environment for UK Learning and Teaching NOF-Digitise.
Enterprise Search With SharePoint Portal Server V2 Steve Tullis, Program Manager, Business Portal Group 3/5/2003.
Moira Massey EDINA Learning and Teaching Projects Co-ordinator Collaborative Working in UK Learning and Teaching Projects.
CORDRA Philip V.W. Dodds March The “Problem Space” The SCORM framework specifies how to develop and deploy content objects that can be shared and.
Alexandria Digital Library Project Goals and Challenges in Georeferenced Digital Libraries Greg Janée.
Digital Object: A Virtual Online Storage Solution 598C Course Project Huajing Li.
Geographical Data Products Carol Blackwood UKBORDERS 3 rd July 2012.
ArcGIS Workflow Manager An Introduction
James Reid, project manager Eddie Boyle, software developer EDINA.
The GeoConnections Discovery Portal Michael Robson MacDonald Dettwiler and Associates Brian McLeod, Michael Adair Natural Resources Canada.
Using IESR Ann Apps MIMAS, The University of Manchester, UK.
Moira Massey e-MapScholar Project Co-ordinator Digimap e-MapScholar overview.
Mapping between SOS standard specifications and INSPIRE legislation. Relationship between SOS and D2.9 Matthes Rieke, Dr. Albert Remke (m.rieke,
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
ZLOT Prototype Assessment John Carlo Bertot Associate Professor School of Information Studies Florida State University.
Linking electronic documents and standardisation of URL’s What can libraries do to enhance dynamic linking and bring related information within a distance.
Complex Data Transformations in Digital Libraries with Spatio-Temporal Information B. Martins, N. Freire, J. Borbinha Instituto Superior Técnico, Technical.
Design of a Search Engine for Metadata Search Based on Metalogy Ing-Xiang Chen, Che-Min Chen,and Cheng-Zen Yang Dept. of Computer Engineering and Science.
MilestoneNotes December 2010 Compliant Discovery Metadata created (and available) for INSPIRE Annex I & II datasets.
The Digital National Framework of Great Britain GSDI 6 Conference - From Global to local September 16-19th 2002 BUDAPEST, HUNGARY.
AUKEGGS Architecturally Significant Issues (that we need to solve)
Linking Tasks, Data, and Architecture Doug Nebert AR-09-01A May 2010.
Accessing a national digital library: an architecture for the UK DNER Andy Powell ELAG 2001, Prague 7 June 2001 UKOLN, University of Bath
Alexandria Digital Earth ProtoType DIGITAL LIBRARIES AND ENVIRONMENTAL INFORMATION Terence R. Smith Alexandria Digital Library Project.
James Reid Project Manager EDINA. The geoXwalk project funded under JISC IE Development Programme –builds on Phase I scoping study –aims to develop a.
Distribution and components. 2 What is the problem? Enterprise computing is Large scale & complex: It supports large scale and complex organisations Spanning.
Joint Information Systems Committee Supporting Higher and Further Education Rachel Bruce Programme Manager, JISC Executive Collection.
Alternative Architecture for Information in Digital Libraries Onno W. Purbo
Alexandria Digital Library Project The ADL Gazetteer Protocol Greg Janée
PIXUS - The JISC Image Portal Demonstrator Portals & Portlets 2003 e-Science Institute Sandy Buchanan
GeoCrossWalk Use Cases. Reference use Information server Searching (1) Geo-parsing & indexing The GeoCrossWalk Server GeoCrossWalk use cases Searching.
Data mediators experience with metadata – A national data centre view Peter Burnhill (Director) & Tony Mathys EDINA National Data Centre University of.
Task XX-0X Task ID-01 GEO Work Plan Symposium April 2014 Task ID-01 “ Advancing GEOSS Data Sharing Principles” Experiences related to data sharing.
Optimising Interoperability in Multi-KOS Subject Searching: Framework for a Collaborative Approach? Dennis Nicholson, Centre for Digital Library Research.
The FDES revision process: progress so far, state of the art, the way forward United Nations Statistics Division.
National Geospatial Enterprise Architecture N S D I National Spatial Data Infrastructure An Architectural Process Overview Presented by Eliot Christian.
Cornucopia: the UK database of museum collections Peter Winsor Resource: The Council for Museums, Archives and Libraries CD Focus - 14 May 2002.
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
ICAO Seminar on Aeronautical spectrum management (Cairo, 7 – 17 June 2006) SAFIRE Spectrum and Frequency Information Resource (presented by Eurocontrol)
James Reid Project Manager EDINA
Accessing a national digital library: an architecture for the UK DNER
INSPIRE Geoportal Thematic Views Application
ICAO Seminar on Aeronautical spectrum management (Cairo, 7 – 17 June 2006) SAFIRE Spectrum and Frequency Information Resource (presented by Eurocontrol)
Session 2: Metadata and Catalogues
Presentation transcript:

geoXwalk:- Developing a Gazetteer Service and Server for UK Academia James S Reid Project Manager, Geo-data Services, EDINA AGILE Conference 2003 Lyon

Context - EDINA a JISC National Data Centre, –hosted by Edinburgh University Data Library, mission... to enhance productivity of research, learning and teaching in UK higher & further education major provider within the JISC Information Environment –range of bibliographic resources –multimedia and image services –key geo-spatial data and geo-referenced information UKBORDERS ( ) boundary outlines & geo- reference database Digimap (2000 -) online source of Ordnance Survey mapping – development projects - geoXwalk,Go-Geo!,e-MapScholar,Pathfinder... strategic move toward interoperability & shared services role –adoption of appropriate standards (OGC,ISO)

Context - The JISC Information Environment is… variously stated as … –a national digital library... for UK higher and further education –a managed collection of quality assured resources –a distributed resource supporting learning and research in the UK definitely heterogeneous –‘words, numbers, pictures, sound’: including geo-spatial data for use by researchers, students, teachers & support staff based on an underlying functional model – simplified to: search -> obtain -> use -> publish – {discover/locate} {request/access} {view/copy/amend/combine} {publish} now to have location-based searching –requiring geo-referencing of information objects

 Gazetteer - A list of geographic features together with their associated spatial location  Digital Gazetteer - An electronic list of geographic features together with their associated spatial location An authority database of places (and features?) An ‘Active Gazetteer”  Digital Gazetteer Service - A network-addressable middle- ware server supporting geographic referencing and searching. A shared ‘terminology’ service. Definitions

The problem How to search ‘geographically’ ? given that : e.g. a postcode, a placename and an administrative area are all valid geographies and yet every information system cannot know about all the possible variations of what constitutes a ‘geography’! Problem compounded by inconsistency of use even in the ‘standards’ e.g. placenames evolve, have alternative names Long history in UK of boundary changes and changes in the geographies used to record things e.g. electoral ward boundary changes …

There is underlying complexity, such as Multiple Geographies …

The vision How? A digital gazetteer that stores the different geographies and can implicitly resolve the relationships between them Provision as a service to service other services Make variations in defn. of ‘geography’ transparent Provide a means to ‘crosswalk’ geographies i.e. translate one geography into another - hence the name ‘Geographic agnosticism’

Results of scoping study (Phase I) Great deal of interest both within and without academia in concept of a digital gazetteer Such a gazetteer would act as an important reference source The gazetteer could also support machine to machine (m2m) interactions based on open protocols making it capable of becoming a ‘shared service’ A suitably extensible model for the gazetteer was identified in the Alexandria Digital Library (ADL) model A prototype demonstrator gazetteer should be developed based on the ADL model

Phase II - Project Aims To develop a ‘proof-of-concept’ geo-spatial gazetteer service suitable for extension to full service and illustrating:  The use of a gazetteer to enhance the geographic searching of one or more existing JISC services  The use of a gazetteer to assist in the semi-automatic geographic indexing of descriptions of JISC resources  Reference use through the provision of a command driven web- based interface, to show the types of queries that could be answered by a well-populated service To consider how the gazetteer data could be made available as a shared service as part of the JISC Information Environment Promote the possibilities of a fully functioning service

geoXwalk - High Level Architecture The geoXwalk Server (Spatially enabled RDBMs) Web client Request via protocol (ADL, OGC, Z39.50) Information server Request via protocol (ADL, OGC, Z39.50) (human interaction) (machine2machine interaction)

ADL Gazetteer Content Standard Geographic Feature ID Geographic Name Variant Geographic Name (R) Type of Geographic Feature (R) Other Classification Terms (R) Geographic Feature Code (R) Spatial Location (R) Street Address Related Feature (R) Description Geographic Feature Data (R) Link to Related Source of Information (R) Supplemental Note Metadata Information comprehensive description but with small set of core elements temporal aspects of names, footprints, relationships, … document source, spatial accuracy/scale of footprint does permit explicit relationship types!

Gazetteer Database Built on ADL Content Standard Currently seeded with: –OS 1:50,000 digital Gazetteer –digital boundary data from UKBORDERS –data sourced from other OS products - Strategi, Meridian, 1:250,000 gazetteer –starting to add 3rd party data including Getty Accuracy enhanced and metadata support Current coverage: –Geographical - GB –Thematic - see below

geoXwalk gazetteer - current thematic content (based on adapted ADL Feature Type Thesaurus )

Protocols ADL Query Protocol –lightweight, generic, relatively simple to implement OGC Filter Encoding Specification –fuller, highly flexible, more complex Z39.50 –pervasive in JISC IE, not specifically for geo-spatial data, lack of support

identifier-query identifier Return gazetteer entry identified by identifier Supported by geoXwalk name-query operator text Returns gazetteer entries which match text under text-operator operator geoXwalk supports the mandatory equals operator and the optional match-pattern operator footprint-query operator (polygon | box | identifier) Returns all gazetteer entries having a footprint that matches a query region according to spatial operator operator geoXwalk supports spatial operators within, contains and overlaps. Spatial extents can be specified by bounding box or identifier. 5 types of ADL query:

ADL queries (contd) class-query thesaurus term Returns all gazetteer entries which belong to the class (feature type). geoXwalk supports class queries (but currently does not return sub-classes by default as the ADL does) relationship-query relationship identifier Returns all gazetteer entries having relationship relationship to a target gazetteer entry identified by identifier. geoXwalk does not support queries of this type because we do not hold explicit relationships between entities - they are derived implicitly from the geometries

Reference use Information server Searching (1) Geo-parsing & indexing The geoXwalk Server geoXwalk use cases Searching (2) e.g. Where is Aberdour? On what river is Dundee situated? By what alternative names has York been known? List me all places ending with ‘kirk’

Contact details For EDINA services contact: EDINA, Data Library, University of Edinburgh or telephone +44 (0) For information on geoXwalk project:

Task: Find resource about 'Liverpool docks’ Search using a ‘traditional’ gazetteer might yield: … that means more & better hits …. !!! < Using spatial proximity in an active gazetteer, the search can be widened: PlaceCounty/UALiverpool BebbingtonWirral BirkenheadWirral BootleSefton New BrightonWirral SeacombeWirral SeaforthWirral WaterlooSefton co-ordinates allow (near) co-located places to be co-identified.

geoXwalk use case : simple cross searching geoXwalk Server Content Provider CContent Provider A Content Provider B Coordinate footprints Parish names Place names Portal service Post code: L34 0HS? ‘Find resources for this postcode’ (NB postcode often used to geo-reference survey data files) Knowsley , , BX003 <

geoXwalk use case :(semi) automatic indexing Need screen shot of parser here <

Objectives (1)  Elicit the detailed requirements for a gazetteer service  Involve organisations outside UK academia in the development of a gazetteer service demonstrator.  Build a demonstrator focussing on near-contemporary data which should illustrate the following:  The use of a gazetteer to enhance the geographic searching of one or more existing JISC services  The use of a gazetteer to assist in the semi-automatic geographic indexing of descriptions of JISC resources  Reference use through the provision of a command driven web-based interface, to show the types of queries that could be answered by a well-populated service

Objectives (2)  Investigate:  The issues involved in making the gazetteer a Z39.50 target  SOAP (web services) as an access mechanism  The utility and usability of the ADL Gazetteer Content Standard  Questions about performance and scalability of the service  The level of interest and commitment of interested parties outside tertiary education  The costs involved in populating the gazetteer, linking the data and quality assuring the data  Negotiate with data owners to use the key core datasets required to populate the gazetteer  Suggest ways in which data can be kept up-to date, and what kind of quality assurance on data input will be required  Carry out focus groups to assess the needs of the stakeholders for a full gazetteer service and promote the possibilities of a fully functioning service

Deliverables  A functioning scalable demonstrator gazetteer service that has the potential to be fully integrated into the JISC Information Environment A report on who the relevant stakeholders are and how the needs of the user group will be met (An exit strategy - Phase III)

< Query: Archaeological sites within the city of York?