Presentation is loading. Please wait.

Presentation is loading. Please wait.

Alexandria Digital Library

Similar presentations


Presentation on theme: "Alexandria Digital Library"— Presentation transcript:

1 Alexandria Digital Library
Thank you for becoming a Partner Davidson Library Alexandria Digital Library

2 Alexandria Digital Library
What Is Spatial Information? Botanical Survey Museum Artifacts Art about … Zoological Habitat Study Books about … Earth Science Data Davidson Library Alexandria Digital Library

3 Alexandria Digital Library
ADL Mission To provide a federated spatially searchable set of digital libraries of geographically referenced materials. The library's components may be distributed (spread across the Internet) or coexist within a single network or desktop. Geographically-referenced means that all the information objects in the library will be associated with one or more regions ("footprints") on the surface of the Earth. Davidson Library Alexandria Digital Library

4 Evolution of the Alexandria Digital Library
The model for libraries in the 1980's was a traditional monolithic centralized set of collections with access available only on site. ADL has been designed so users may find and access digital information from wherever they are. ADL's key features: Spatial information searching Users tools for browsing and data downloading Evolving architecture Intranet to a Peer-to-Peer distributed set of archives (coming) Federated distributed heterogeneous information network. Davidson Library Alexandria Digital Library

5 Alexandria Digital Library
The Pre ADL Library Model Centralized Information Model Some Z39.50 Bibliographic Catalog Connections No spatial searching No on-line access or browsing of data objects Pre ADL Centralized collections and access Textual searching interfaces 1980's Hmmm? Searching for Information Geographically GRIN project Davidson Library Alexandria Digital Library

6 Distributed Institutional Model Shared Centralized Database
From Intranet to Internet Distributed Institutional Model Shared Centralized Database ADL in the 90’s Bibliographic searching via Web connection Spatial searching Graphical Interfaces – map and coordinates On-line browsing of data objects Remote data access Data viewing Data downloading Institutional Networked Sharing 1990's Hmmm…. I wonder what information there is about this place? Davidson Library Alexandria Digital Library

7 Alexandria Digital Library
A Federation of Collections and Archives 2002 Forward Personal Collection Institutional Archive Company Government Archive ADL in the 2000’s Bibliographic searching over dispersed collections Spatial searching over heterogeneous data Smart Concept searching (ADEPT) Development and exposing of personalized collections Collection discovery services Peer-to-Peer application sharing Collection management Via query analysis Find me any information about hydrologic-cycles in the rain forests of central Asia? Davidson Library Alexandria Digital Library

8 Alexandria Digital Library
ADL Partners Davidson Library Alexandria Digital Library

9 Alexandria Digital Library
Operational Partners Implementers & Content Builders SERL – (Auckland University of Technology) DLESE – (Digital Library for Earth Systems Education) Software implementation and content builder CNR – (Center for National Research, Pisa Italy) BREN (Environmental Information Lab, UCSB) Content Builders ADEPT – Educational classroom content CASS – (Center for the Analysis of Sacred Sites) – Video, sound, imagery text ESSW – MODIS real-time spacecraft imagery Scripps – SIOExplorer Oceanographic Data Davidson Library Alexandria Digital Library

10 UCSB Node - Alexandria Digital Library
ADL operational: NSF funded Internet digital library ; UCSB funded Uses spatial methodology to organize & search for information Focused on information with geographical location Internet searching and data delivery Collection development tools for personal archives Current library holdings 2.8 million bibliographic records (currently) 5.8 million place names (currently) 7.3 terabytes of data on-line Davidson Library Alexandria Digital Library

11 Map and Imagery Laboratory (MIL)
Collections Ranked #1 spatial-data collection by US Association of Research Libraries – More than 5 million items held Services Expert reference and technical assistance Facilities State-of-the-art computational services Special digital services Alexandria Digital Library Alexandria World Gazetteer Davidson Library Alexandria Digital Library

12 Alexandria Digital Library
Operational ADL Searchable “Collections” ADL Catalog Collection of homogeneous datasets, Each “dataset” determines it’s own metadata and access - But at present, metadata is standardized Ingest is building a record for the spatial catalog, a full metadata record, and the access methods for the web server Time consuming Rights management crude Davidson Library Alexandria Digital Library

13 Ingest System Characteristics
Information agnostic Handle large homogeneous “information sets” Extract information from a variety of both raw data, and metadata Standardize forms Process that information into ADL spatial catalog record Full metadata record, if no metadata exists API’s for building and managing “collections” Information agnostic. It can be raw data, or metadata in a variety of formats Homogenous, same data format, or same metadata format Standardize forms, basically make sure the data type is the same, Correct errors or problems, standardize names, rewrite spelling errors, standardize terms (which is just rewrite) Davidson Library Alexandria Digital Library

14 Alexandria Digital Library
Activities in Ingest Building collection ingest API’s and tools to process large homogenous datasets Planning a web interface to allow content builders to expose catalog records Correcting errors and problems in the present database Planning ADL catalog interfaces on collections managed by non-ADL staff First and the last are the focus of the this talk. We will talk about how I think ADL and a data service like TimeMap will work together because it spells the philosophy behind the collection API. Davidson Library Alexandria Digital Library

15 Alexandria Digital Library
ADL Content Examples Series # DB Records / Data* Size (GB) Geodex map index 322,000 2 Landsat 1,500,000 100 DOQQ 15,000 600 DRG 1:24,000 3,000 66 DRG 1:100,000 250 6 NASA air photos 502,000 25 MIL air photos 45,000 2800 SPOT 200 DEM 1 ADL Gazetteer 4,800,000 AVHRR & MODIS N Remote Collection New Zealand ADL Starting Spring 2003 * Not all records point to data Davidson Library Alexandria Digital Library

16 Alexandria Digital Library
Current Goals Discovery services ( summer 2003 ) Adding federated collections Personal, distributed collections Ingest tools for automated coordinate generation and image-map registration Executable content Gazetteer integration with ADL catalog ADL in a box Davidson Library Alexandria Digital Library

17 Alexandria Digital Library
ADL Organization The ADL project has: An operational library run by the Map and Imagery Laboratory of the Davidson Library (ADL) A research component (ADEPT) funded by NSF and others, and A gazetteer (place name index and geocoder) also managed by the MIL in the UCSB Davidson Library Partner distributed content Davidson Library Alexandria Digital Library

18 Operational and Research Teams
Catherine Masi, David Valentine, Greg Hajic, Carolyn Jones, Mary Larsgaard and Larry Carver Research Terry Smith, Jim Frew, Mike Goodchild, Greg Janee, Dan Ancona and Linda Hill Oversight Smith, Goodchild, Frew, Carver and Janee Davidson Library Alexandria Digital Library

19 Middleware Development Plan
Current May 2003 Summer 2003 Fall 2003 MW1 bucket searching MW2 metadata views MW3 ingest support MW4 collection discovery Bucket and field-level searching over federated collections Collection formation via database mapping Ranking Access control Arbitrary views New standard ADL views: bucket, browse, access Metadata validation Query translation paradigm interface and library Management services: create/ replace/delete collections / items Integrated DB Universal schema Integrated collection statistics Collection registry, polling Discovery/ranking over collection statistics Euler histograms Collection hierarchies Integration with thesaurus servers Davidson Library Alexandria Digital Library

20 ADL Operational Technology
Geospatial searching Search buckets Collection discovery Metadata harvesting Search across heterogeneous collections Distributable software Tools for building/ingesting new collections Search interfaces Davidson Library Alexandria Digital Library

21 ADL Operational Technology
Geospatial searching Search buckets Collection discovery Metadata harvesting Search across heterogeneous collections Distributable software Tools for building/ingesting new collections Search interfaces Davidson Library Alexandria Digital Library

22 Alexandria Digital Library
Geospatial Searching Middleware Map server/browser Gazetteer Search indexes (buckets) Davidson Library Alexandria Digital Library

23 Alexandria Digital Library
ADL middleware Middleware Collection WebClient ADL employs a standard three-tier architecture in which clients connect to collections through a middleware server, which acts as a common access point and as a kind of broker. Davidson Library Alexandria Digital Library

24 Alexandria Digital Library
ADL middleware The middleware provides search and retrieval services services related to collection development indexes optimized for geospatial searching searching of all classes of information Davidson Library Alexandria Digital Library

25 ADL Middleware Details
C L I E N T web browser SDLIP proxy, other clients HTTP OR web intermediary/ XMLHTML converter HTTP HTTP transport RMI transport XML M I D L E W A R configuration file core functionality access control (service- and collection-level) query fan-out & results merging query result ranking result set caching access control mechanisms ranking methods client-side services (Java classes) server-side interface (Java interfaces) S E R V XML JDBC paradigm library generic DB driver query translator proxy driver RDBMS thesauri Z39.50 driver configuration files, Python scripts group driver Davidson Library Alexandria Digital Library

26 Alexandria Digital Library
ADL middleware ADL employs a standard three-tier architecture in which clients connect to collections through a middleware server, which acts as a common access point and as a kind of broker. More information: Davidson Library Alexandria Digital Library

27 ADL Map Server/Browser
ADL developed map servers/browsers: Davidson Library Alexandria Digital Library

28 ADL Map Server/Browser
What we are looking for in a map server/browser Omnipresent, stateful Navigation/selection tools Support for GIS functionality/layering Integration with Gazetteer (capture coords) Davidson Library Alexandria Digital Library

29 ADL Map Server/Browser
What we’ve looked at: TimeMap ArcIMS University of Minnesota Mapserver The Geography Network Davidson Library Alexandria Digital Library

30 ADL Map Server/Browser
ADL employs a standard three-tier architecture in which clients connect to collections through a middleware server, which acts as a common access point and as a kind of broker. If you have information: Davidson Library Alexandria Digital Library

31 Alexandria Digital Library
ADL Gazetteer Spatial dictionary of named and typed places 6.5 million entries Worldwide coverage Textual Geospatial Integration (TGI) project Goal is to translate textual references into searchable locations Davidson Library Alexandria Digital Library

32 Alexandria Digital Library
ADL Gazetteer Concept of a geographic place is fuzzy (e.g., Rocky Mountains) and we use place names differently according to the circumstances (e.g., using “Auckland” generally to mean the whole general area or specifically to mean just the incorporated city area. When locations are named, they can be in a gazetteer. A place can have more than one name: name variants, name in different languages, etc. In a geospatially referenced gazetteer, each entry has a “footprint” consisting of latitude and longitude coordinates. This footprint can be a point (most current gazetteer footprints are points); bounding boxes which are useful because they enclose the area but they often enclose too much extra area also; and generalized polygons which are more faithful to the actual shape but still simplified enough for information system query parameters. Each entry in a digital gazetteer must also be categorized according to a formal typing system (a controlled vocabulary of type terminology). With these three components, a digital gazetteer service can translate from names, footprints, and types to any of the other representations. E.g., “What schools are in this area?” “Where is Taupo?” “Show me on a map the location of the Moscow that is in the U.S.” Davidson Library Alexandria Digital Library

33 ADL Gazetteer and TGI Project
ADL employs a standard three-tier architecture in which clients connect to collections through a middleware server, which acts as a common access point and as a kind of broker. More information: Linda Hill, Davidson Library Alexandria Digital Library

34 ADL Operational Technology
Geospatial searching Search buckets Collection discovery Metadata harvesting Search across heterogeneous collections Distributable software Tools for building/ingesting new collections Search interfaces Davidson Library Alexandria Digital Library

35 Alexandria Digital Library
ADL Search Buckets Abstract, searchable indexes Similar to GILS, Dublin Core, GCMD but buckets define allowable content and search semantics, and are optimized for geospatial searching Designed to be easy for populating collections Provide uniform client services across all collections Davidson Library Alexandria Digital Library

36 Alexandria Digital Library
ADL Search Buckets Identifier Geographic location Coverage date Object type Format Originator Subject-related text Title Assigned term Keywords Davidson Library Alexandria Digital Library

37 Alexandria Digital Library
ADL Information retrieval via bucket search Davidson Library Alexandria Digital Library

38 Alexandria Digital Library
ADL Buckets ADL employs a standard three-tier architecture in which clients connect to collections through a middleware server, which acts as a common access point and as a kind of broker. More information: Davidson Library Alexandria Digital Library

39 ADL Operational Technology
Geospatial searching Search buckets Collection discovery Metadata harvesting Search across heterogeneous collections Distributable software Tools for building/ingesting new collections Search interfaces Davidson Library Alexandria Digital Library

40 ADL Collection Level Metadata
Facilitates: human cross-collection searching automated cross-collection searching collection discovery across multiple digital libraries Object Type cartographic works maps images photographs aerial photographs Count 324,876 2,014,799 484,083 Thus, It is Important to Users! Davidson Library Alexandria Digital Library

41 ADL Collection Level Metadata
Contextual Identity Scope and Purpose Originator, Creator Inherent Statistics Overall counts Counts by Type Counts by Format Spatial Coverage Temporal Coverage Search fields supported Harvest methods supported (e.g. NSDL/OAI) Object Type cartographic works maps images photographs aerial photographs Count 324,876 2,014,799 484,083 Davidson Library Alexandria Digital Library

42 ADL CLM – Compare spatial coverage
Davidson Library Alexandria Digital Library

43 ADL CLM– Compare temporal coverage
Davidson Library Alexandria Digital Library

44 Collection discovery service will allow cross-archive searching
Davidson Library Alexandria Digital Library

45 ADL Collection Level Metadata
ADL employs a standard three-tier architecture in which clients connect to collections through a middleware server, which acts as a common access point and as a kind of broker. More information: Davidson Library Alexandria Digital Library

46 ADL Operational Technology
Geospatial searching Search buckets Collection discovery Metadata harvesting Search across heterogeneous collections Distributable software Tools for building/ingesting new collections Search interfaces Davidson Library Alexandria Digital Library

47 Adding collections to ADL
Davidson Library Alexandria Digital Library

48 Adding collections to ADL
Davidson Library Alexandria Digital Library

49 Adding collections to ADL
Davidson Library Alexandria Digital Library

50 Adding collections to ADL
ADL employs a standard three-tier architecture in which clients connect to collections through a middleware server, which acts as a common access point and as a kind of broker. More information: Davidson Library Alexandria Digital Library

51 ADL Operational Technology
Geospatial searching Search buckets Collection discovery Metadata harvesting Search across heterogeneous collections Distributable software Tools for building/ingesting new collections Search interfaces Davidson Library Alexandria Digital Library

52 ADL User Interface - current
Davidson Library Alexandria Digital Library

53 ADL User Interface - Browse
Davidson Library Alexandria Digital Library

54 ADL User Interface - Browse
Davidson Library Alexandria Digital Library

55 ADL User Interface - Browse
Davidson Library Alexandria Digital Library

56 ADL User Interface – search
Query builder to formulate search Search at collection level / item level / combined Advanced search: utilize power of middleware query language Simple search: subset of advanced User guidance: placename/thesaurus vs free text, zero hits, etc. Davidson Library Alexandria Digital Library

57 ADL User Interface – current search
Davidson Library Alexandria Digital Library

58 ADL User Interface - Search
Davidson Library Alexandria Digital Library

59 ADL User Interface - Search
Davidson Library Alexandria Digital Library

60 ADL User Interface - Results
Davidson Library Alexandria Digital Library

61 Alexandria Digital Library
ADL User Interface - View Davidson Library Alexandria Digital Library

62 ADL User Interface - Build
Davidson Library Alexandria Digital Library

63 ADL User Interface - Build
Create personalized spaces in which collections can be Built Shared Organized Incorporated into curricula Davidson Library Alexandria Digital Library

64 ADL Operational Technology
Geospatial searching Search buckets Collection discovery Metadata harvesting Search across heterogeneous collections Distributable software Tools for building/ingesting new collections Search interfaces Davidson Library Alexandria Digital Library

65 Alexandria Digital Library
ADL Nodes Davidson Library Alexandria Digital Library

66 Alexandria Digital Library
ADL Nodes WebClient AUT Middleware Collection Davidson Library Alexandria Digital Library

67 Alexandria Digital Library
ADL Nodes UCSB Middleware Collection Scripps Middleware Bren EIL Middleware AUT Middleware Collection WebClient Davidson Library Alexandria Digital Library

68 Alexandria Digital Library
ADL Nodes Davidson Library Alexandria Digital Library

69 Alexandria Digital Library
ADL Nodes Davidson Library Alexandria Digital Library

70 Alexandria Digital Library
ADL nodes ADL employs a standard three-tier architecture in which clients connect to collections through a middleware server, which acts as a common access point and as a kind of broker. More information: Davidson Library Alexandria Digital Library

71 Alexandria Digital Library
Thank you for becoming a Partner Davidson Library Alexandria Digital Library


Download ppt "Alexandria Digital Library"

Similar presentations


Ads by Google