Choosing Technology That Can Evolve With User Needs VALA 2006 Melbourne, Australia February 2006 Sandy Payette Co-Director, Fedora Project Researcher,

Slides:



Advertisements
Similar presentations
DuraSpace: Digital Information All Ways, Always Pretoria, South Africa May 14 th, 2009.
Advertisements

An Introduction to Repositories Thornton Staples Director of Community Strategy and Alliances Director of the Fedora Project.
Repositories: Disruptive Technology or Disrupted Technology? Sandy Payette, Executive Director DORSDL Workshop at ECDL 2008 September 2008.
Building and Analyzing Social Networks Web Data and Semantics in Social Network Applications Dr. Bhavani Thuraisingham February 15, 2013.
Funded by: © AHDS Sherpa DP – a Technical Architecture for a Disaggregated Preservation Service Mark Hedges Arts and Humanities Data Service King’s College.
Object Re-Use and Exchange Mellon Retreat, Nassau Inn, Princeton, NJ, March Herbert Van de Sompel, Carl Lagoze The OAI Object Re-Use & Exchange.
Planning for Flexible Integration via Service-Oriented Architecture (SOA) APSR Forum – The Well-Integrated Repository Sydney, Australia February 2006 Sandy.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
Fedora Commons: Introduction and Update Swedish National Library June 24, 2008.
Building a Digital Library with Fedora International Conference on Developing Digital Institutional Repositories Hong Kong December 9, 2004.
UKOLN is supported by: A non-technical introduction to: OAI-ORE ( Defining Image Access project meeting.
The Fedora Project March 19, 2003 ISTEC Symposium, Brazil Sandy Payette Cornell Information Science.
The Open Archives Initiative Simeon Warner (Cornell University) Symposium on “Scholarly Publishing and Archiving on the Web”, University.
The Open Archives Initiative Simeon Warner (Cornell University) Open Archives seminar “Facilitating Free and Efficient Scientific.
ÆKOS: A new paradigm for discovery and access to complex ecological data David Turner, Paul Chinnick, Andrew Graham, Matt Schneider, Craig Walker Logos.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.
PREMIS Tools and Services Rebecca Guenther Network Development & MARC Standards Office, Library of Congress NDIIPP Partners Meeting July 21,
Tutorial – Semantic Digital Libraries, May 9, 2007 WWW 2007 Copyright , DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University.
Fedora Commons Overview and Future Plans Sandy Payette, Executive Director Cornell University Library Metadata Working Group June 13, 2008.
Computer Science and Engineering 1 Service-Oriented Architecture Security 2.
1st Workshop on Intelligent and Knowledge oriented Technologies Universal Semantic Knowledge Middleware Marek Paralič,
Preserving Digital Collections for Future Scholarship Oya Y. Rieger Cornell University
Fedora Content Models for the National Science Digital Library Data Repository Fedora User’s Group Meeting Copenhagen, September 28, 2005 Carl Lagoze Cornell.
PLoS ONE Application Journal Publishing System (JPS) First application built on Topaz application framework Web 2.0 –Uses a template engine to display.
Extensible Markup Language (XML) Extensible Markup Language (XML) is a simple, very flexible text format derived from SGML (ISO 8879).ISO 8879 XML is a.
Information Network Overlay Architecture Adding Value to Digital Content Carl Lagoze CS 431 – May 4, 2005 Cornell University.
Topic Rathachai Chawuthai Information Management CSIM / AIT Review Draft/Issued document 0.1.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
1 Advanced Software Architecture Muhammad Bilal Bashir PhD Scholar (Computer Science) Mohammad Ali Jinnah University.
Semantic Web Technologies Research Topics and Projects discussion Brief Readings Discussion Research Presentations.
Grid Computing & Semantic Web. Grid Computing Proposed with the idea of electric power grid; Aims at integrating large-scale (global scale) computing.
Presented by Scientific Annotation Middleware Software infrastructure to support rich scientific records and the processes that produce them Jens Schwidder.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
Enabling the Future Service-Oriented Internet (EFSOI 2008) Supporting end-to-end resource virtualization for Web 2.0 applications using Service Oriented.
Technical Update 2008 Sandy Payette, Executive Director Eddie Shin, Senior Developer April 3, 2008 Open Repositories 2008, Fedora User Group.
Interoperability from the e-Science Perspective Yannis Ioannidis Univ. Of Athens and ATHENA Research Center
Introduction to the Semantic Web and Linked Data
Digital Library The networked collections of digital text, documents, images, sounds, scientific data, and software that are the core of today’s Internet.
1 Registry Services Overview J. Steven Hughes (Deputy Chair) Principal Computer Scientist NASA/JPL 17 December 2015.
David Smiley SOA Technology Evangelist Software AG Lead, follow or get out of the way Here Comes SOA.
Millman—Nov 04—1 An Update on Digital Libraries David Millman Director of Research & Development Academic Information Systems Columbia University
DSpace - Digital Library Software
A centre of expertise in digital information management Shaping the e-future? Grids, Web Services and Digital Libraries Professor Tony.
Five fantastic Fedora Commons projects in five minutes, in no particular order Carol Minton Morris Communications Director National Science Digital Library,
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Carl Lagoze Digital Library Service Registry Workshop Services in a Scholarly Communication Framework.
The Mellon-Funded Fedora Project A Presentation to the European Digital Library Conference September 17, 2002 Sandy Payette and Thornton Staples.
Fedora Service Framework Sandy Payette, Executive Director UK Fedora Training London January 22-23, 2009.
Fedora Commons Overview and Background Sandy Payette, Executive Director UK Fedora Training London January 22-23, 2009.
Software Architecture Patterns (3) Service Oriented & Web Oriented Architecture source: microsoft.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
The Fedora Project March 10, 2003
? What is Institutional Repository for Rutgers University
The Fedora Project March 19, 2003 ISTEC Symposium, Brazil
Joseph JaJa, Mike Smorul, and Sangchul Song
Overview: Fedora Architecture and Software Features
Flexible Extensible Digital Object Repository Architecture
Flexible Extensible Digital Object Repository Architecture
An Architecture for Complex Objects and their Relationships
VI-SEEM Data Repository
Analyzing and Securing Social Networks
PREMIS Tools and Services
NSDL Data Repository (NDR)
Fedora Filling the “Sweet Spot” in the Information Landscape
Metadata in Digital Preservation: Setting the Scene
Malte Dreyer – Matthias Razum
BUILDING A DIGITAL REPOSITORY FOR LEARNING RESOURCES
Institutional Repositories
Presentation transcript:

Choosing Technology That Can Evolve With User Needs VALA 2006 Melbourne, Australia February 2006 Sandy Payette Co-Director, Fedora Project Researcher, Cornell Information Science A service-oriented approach to e-research, e-scholarship, and advanced scholarly publication

Outline Connecting with Users What are the motivating contexts for libraries? –e-research and e-scholarship –Advanced digital libraries –Scholarly publication How do we position for the future? –Goals for the “new order” –Enabling Technologies Research and development at Cornell –Fedora Service Framework –National Science Digital Library –Pathways Moving Forward and Conclusions

Connecting with Users How are user needs evolving? Do we understand expectations of younger generation? Are we “hip?” Can we see current trends”? –Behavior –Technology Can we choose technology appropriately?

Upcoming Generation of Scholars Age 10 - play –Yahoo (music) –Google (bios, animals) –Neo pets (community) –Powerpoint (expression) Age 20 – social and study –Blogging –IM –Google –BitTorrent –Craigslist

How much should we worry now about choosing technology for evolving user needs? A Lot! Recent questions by member of audience: –“Do scholars really want new stuff or are we trying to hard to architect things that we think they want?” Let’s examine what’s already going on and position for the future…

Scholarly and Scientific Communities Documents  Integrated Information Networks

Early signs of change… Grid computing in sciences –Share computing resources –Share services and distributed virtual file systems –Examples Storage Resource Broker (SRB) Open Grid Services Infrastructure National Virtual Observatory ( Humanities computing –Hyperlinked historical documentary editions –New Forms of Digital Scholarship Rossetti archive ( Valley of the Shadow ( Perseus (

New Contexts - user and technical User Contexts Technical Contexts

First… User contexts

Key areas for connecting with users E-research E-scholarship Advanced digital library New models of scholarly communication

Technical contexts: understanding key trends…

Relevant Technology Trends Service-oriented architecture Web 2.0 Semantic Web SOA Web 2.0 RDF OWL-S OWL

Service-oriented architectures (SOA) Characteristics of services –Modular, atomic –Well-defined interfaces –Loosely coupled –Like building blocks –Standards for invoking operations (e.g., SOAP/REST, XML) Benefits –Flexibility –Enable creation of higher-level services –Enable customized end-user applications –Re-use services in different contexts –Evolution: create new services as needed –Orchestrate services to fulfill a process monolithic application

From S. Wilson, K. Blinco and D. Rehak Service Oriented Frameworks JISC/DEST Service Framework

Simple Example: Web Service using SOAP My Application SOAP/HTTP Google Web Service Request (XML) Response (XML) doSpellingSuggestion(payet) payette

Looking ahead…

Web 2.0

Implications of Web 2.0 Key themes –Services (not packaged apps) –Architecture of participation –Remix/transform data sources –Harness collective intelligence Emergent Behavior –Upcoming generations of scholars will have a completely different paradigm and expectations regarding technology –Collaborative classification (e.g., flickr) –Power of collective intelligence (amazon) –Alternative trust models (reputation – ebay; open-source)

Semantic Web Resource Description Framework (RDF) –data model for resources and relationships between them Ontologies –OWL to describe information resources –OWL-S to describe web services Rich, extensible description –no “fixed schema” –Relationships and graph-based models Knowledge inference –Equivalence –Transitivity A  B; B  C; A  C

Users and technology in action…

1.Creation and publication of new forms of “information units” 2.Services to better enable the processes of research and scholarship 3.Knowledge environments that captures semantic and factual relationships among information units 4.Promote information re-use and contextualization 5.Facilitate collaborative activity and capture information that is created as a byproduct of it Goals for enabling users in the “New Order”

Support the new “information unit” Documents Text Data Simulations Images Video Computations Automated Analyses Data

Key Projects at Cornell University Fedora Service Framework NSF Pathways (Cornell/LANL) National Science Digital Library (NSDL)

The Fedora Project Fedora –Flexible –Extensible –Digital –Object –Repository –Architecture History –Cornell Research (1997-) DARPA and NSF-funded research and reference implementation Distributed, Interoperable Repositories (experiments with CNRI) –Open Source Project (2002-present) Andrew W. Mellon Foundation funded Joint development by Cornell University and University of Virginia SOA RDF

Fedora Digital Objects Flexible object model can support –Documents, articles, journals –Electronic Scholarly Texts –Digital Images –Complex multimedia publications –Datasets –Metadata –Learning objects –More… Create “networks” of objects –Define object relationships and other properties via RDF –Collection/member; part/whole; etc.

Network of Digital Objects in a Fedora Repository

Fedora Service Framework ( )

eSciDoc (Max Planck Society and Fiz Karlsruhe)

Pathways Project SOA OWL-S A new system for scholarly communication OWL

Pathways – motivating context Decompose and distribute traditional steps in scholarly publishing value chain 1 –Registration – claim precedence for a scholarly finding. –Certification - establish validity of scholarly claim –Awareness - discover and access claims and findings –Archiving - preserves the scholarly record over time –Rewarding - based on metrics derived from that system Add new services to the mix –Workflow –Collaborative functions (e.g., annotation, re-use) –Data mining and analysis –Preservation monitoring and migration 1. Roosendaal and Geurts 1997

Pathways Vision: Interoperable Information Model Most things can be represented as a graph of nodes and arcs. Cornell University and Los Alamos Nat’l Lab

Service pathways (decomposed and distributed) Cornell University and Los Alamos Nat’l Lab

Pathways Challenges Phase 1 Current situation –Heterogeneous repository systems –Heterogeneous object models (or no object model) –Multiple protocols and service APIs –Services lacking formal interface definitions Can these ever play nicely together? Need common abstractions… –Ontology-based Information model –Ontology-based Service model

Core-1 Ontology: “Article” Example Cornell University and Los Alamos Nat’l Lab

Building Block: Repository Integration Cornell University and Los Alamos Nat’l Lab

NSDL – Core Integration SOA Advanced Digital Libraries: Beyond Search and Access Web 2.0 RDF

Information Flow in Traditional Library Knowledge In-BandOut-of-Band

Information Flow in the Digital Library In-BandOut-of-Band Knowledge Annotations Quality Ratings Relationships Reviews

NSDL Data Repository – How? Data as the asset –Structured core data model –Digital objects –Relationships –Augmented with unstructured and semi-structured Expose knowledge base via core service API NDR Technologies –Fedora repository – 2 million digital objects –Kowari RDF triplestore – 160 million triples –Services – both SOAP and REST

Conclusions: Practical steps, ongoing challenges…

Choosing technology to evolve with user needs… now Think in terms of flexible service frameworks Define fundamental services for libraries Repositories as web services Support for complex digital objects –Local and remote content –Mixed genre  documents, data, images, everything… –Dynamic views –XML expressions (esp. for ingest/export and migration) Model common entities with ontology-based metadata –Hope for interoperability via semantics –Relationships among objects are key

Ongoing Challenges Low barrier to entry –Simple protocols (e.g., like OAI) –Light-weight (REST vs. SOAP?) –Simple tools to create overlays Service matching (object-to-service) –Ontologies to expose objects with formats and semantics –OWL-S for semantic service description –Matching-making algorithms Security and Trust –Authentication and trust among repositories and services –Interoperability of authorization policy Preservation –Distributed and dynamic digital objects a challenging reality

Thank You! Questions and Comments…