From Creation to Dissemination A Case Study in the Library of Congress’s use Open Source Software DLF Spring Forum Corey Keith

Slides:



Advertisements
Similar presentations
Enabling Technology Innovation using Open Source Software
Advertisements

WDL Technical Architecture Working Group (TAWG) June 2010 Achievements and Recommendations Co-chaired by Noha Adly, Bibliotheca Alexandrina Babak Hamidzadeh,
Carlo Tarantola Senior Director EMEA Mobile and Wireless Center of Expertise Warsaw, Poland Oracle Corporation.
Websydian Anne-Marie Arnvig Manager, Websydian Communications & Relations.
Websydian products.
XML-based Network Management Rob Enns
METS Creation in a production environment METS Opening Day Corey Keith
1 Actuate Corporation © 2010 THE BIRT COMPANY THE BIRT COMPANY THE BIRT COMPANY THE BIRT COMPANY THE BIRT COMPANY THE BIRT COMPANY THE BIRT COMPANY THE.
Chapters 14 & 15 Internet Databases. E-Commerce  Bringing new products, services, or ideas to market, supporting and enhancing business operations 
ARCHIMÈDE Presented by Guy Teasdale Directeur, Services soutien et développement Bibliothèque de l’Université Laval CARL Workshop on Institutional Repositories.
Toolbox Mirror -Overview Effective Distributed Learning.
Wikis And Blogs Their role in the Corporate Environment Kevin Baker Nelson Brodyck Adam Garten-Zutter Darren Makovichuk Cameron Smith.
Developing the NSDL User Portal Dean Krafft, Cornell University
Software Developer Career. ◦ Desktop Program development ◦ Web Program Development ◦ Mobile Program Development.
PHP and MySQL Week#1  Course Plan.  Introduction to Dynamic Web Content.  Setting Up Development Server Eng. Mohamed Ahmed Black 1.
DSpace XML UI Project Texas A&M University Digital Initiatives, Research and Technology Scott Phillips, Cody Green, Alexey Maslov, Adam Mikeal, Brian Surratt,
Spring Roo CS476 Aleksey Bukin Peter Lew. What is Roo? Productivity tool Allows for easy creation of Enterprise Java applications Runs alongside existing.
Project Proposal: Academic Job Market and Application Tracker Website Project designed by: Cengiz Gunay Client: Cengiz Gunay Audience: PhD candidates and.
Alfresco – An Open Source Content Management System - Bindu Nayar, Bhavana Mohanraj.
M. Taimoor Khan * Java Server Pages (JSP) is a server-side programming technology that enables the creation of dynamic,
PREMIS Tools and Services Rebecca Guenther Network Development & MARC Standards Office, Library of Congress NDIIPP Partners Meeting July 21,
Joel Bapaga on Web Design Strategies Technologies Commercial Value.
1 Understanding Cocoon2 Pankaj Kumar May 24, 2001 Web Services Organization HPMD, SSO, HP.
TeraGrid Information Services John-Paul “JP” Navarro TeraGrid Grid Infrastructure Group “GIG” Area Co-Director for Software Integration and Information.
Oracle Application Express 3.0 Joel R. Kallman Software Development Manager.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
SDPL 2002Notes 7: Apache Cocoon1 7 XML Web Site Architecture Example: Apache Cocoon, a Web publishing architecture based on XML technology
CIS 375—Web App Dev II Microsoft’s.NET. 2 Introduction to.NET Steve Ballmer (January 2000): Steve Ballmer "Delivering an Internet-based platform of Next.
BZUPAGES.COM Presentation on Content Management System (CMS) Presented to. Sir Ahmad Kareem.
Web based METS creation Ralf Stockmann case study.
IUScholarWorks is a set of services to make the work of IU scholars freely available. Allows IU departments, institutes, centers and research units to.
Z-Geoinfo Inc. Capability Briefing June 21, 2011.
Revolutionizing enterprise web development Searching with Solr.
Overview of IU Digital Collections Search Hui Zhang Jon Dunn Indiana University Digital Library Program IU Digital Library Brown Bag October 19, 2011.
Introduction to PHP and MySQL Kirkwood Center for Continuing Education By Fred McClurg, © Copyright 2015, Fred McClurg, All Rights.
Web Server Programming 3. Server side includes. HTML+ (‘hole’) technologies Uses ‘HTML+”, HTML (or XML) extended with code. Very many different varieties.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
0 Apex as Your Integration Platform: Is It Possible? Copyright 2012 Fujitsu America, Inc.
Nate Trail Network Development & MARC Standards Office 8/1/2006 With help from Sydney Olive How to Build, Display and Find METS Objects.
CONTENT DISCOVERY, SERVICES, AND SUSTAINED ACCESS Timothy Cole, William Mischo, Beth Sandore, Sarah Shreeves ~ University of Illinois Library
METS Dissemination METS Opening Day Corey Keith
The Fedora Project April 28-29, 2003 CNI, Washington DC Thornton Staples University of Virginia Sandy Payette Cornell Information Science NOTE: CSG
INTRODUCTION TO WEB APPLICATION Chapter 1. In this chapter, you will learn about:  The evolution of the Internet  The beginning of the World Wide Web,
Uwe SchindlerGES 2007 – May 2-4, 2007 Data Information Service based on Open Archives Initiative Protocols and Apache Lucene Uwe Schindler 1, Benny Bräuer.
Presented by Scientific Annotation Middleware Software infrastructure to support rich scientific records and the processes that produce them Jens Schwidder.
Apache Jakarta Project. What is Jakarta’s mission Jakarta is a Project of the Apache Software Foundation, charged with the creation and maintenance of.
Presented by Jens Schwidder Tara D. Gibson James D. Myers Computing & Computational Sciences Directorate Oak Ridge National Laboratory Scientific Annotation.
AxKit A member of the Apache XML project Ryan Maslyn Kyle Bechtel.
IBM Software Group ® Managing Reusable Assets Using Rational Suite Shimon Nir.
Metadata and Meta tag. What is metadata? What does metadata do? Metadata schemes What is meta tag? Meta tag example Table of Content.
© FPT SOFTWARE – TRAINING MATERIAL – Internal use 04e-BM/NS/HDCV/FSOFT v2/3 JSP Application Models.
Modern Programming Language. Web Container & Web Applications Web applications are server side applications The most essential requirement.
Data Manipulation with Globus Toolkit Ivan Ivanovski TU München,
Cocoon An XML Web Publishing Framework From the Apache Project Roland Schweitzer.
Rendering Syndicated Library Content in an Institutional Portal: Integrating MyLibrary into uPortal John Fereira: Cornell University Eric Lease Morgan:
Institutional Repositories July 2007 DIGITAL CURATION creating, managing and preserving digital objects Dr D Peters DISA Digital Innovation South.
Basic web application development with Apache Cocoon Basic web application development with Apache Cocoon 2.1 Jasha Joachimsthal Jeroen.
By Jonathan Smith. Road Map Introduction Company Information Project Overview Java Web Design and Development Summary Relation to IUP Acknowledgments.
5/29/2001Y. D. Wu & M. Liu1 Content Management for Digital Library May 29, 2001.
Welcome to Open Source Technology An Overview of Software By Afroz Hippargi, CIT, YASHADA, Pune.
Apache Cocoon – XML Publishing Framework 데이터베이스 연구실 박사 1 학기 이 세영.
MarkLogic The Only Enterprise NoSQL Database Presented by: Aashi Rastogi ( ) Sanket Patel ( )
Lecture Transforming Data: Using Apache Xalan to apply XSLT transformations Marc Dumontier Blueprint Initiative Samuel Lunenfeld Research Institute.
Expertsfromindia for Joomla Development. Introduction Joomla is an open source and free content management system (CMS) for publishing content on the.
IT Governance And Why FOSS Matters? The Institution of Engineering and Technology Monthly Forum March 29th, 2007 Faculty of Engineering University of Peradeniya.
J2EE Platform Overview (Application Architecture)
Document & Web Content Management
Introduction, Features & Technology
PHP / MySQL Introduction
A Match Made In (Ethereal) Heaven
Presentation transcript:

From Creation to Dissemination A Case Study in the Library of Congress’s use Open Source Software DLF Spring Forum Corey Keith

Introduction  What is OSS  Why OSS  OSS used at LC  XML is a favorite tool

Purpose  Share information on available tools –Especially those outside the library community  Promote using OSS  Promote developing OSS

Open Source Initiative  Open source promotes software reliability and quality by supporting independent peer review and rapid evolution of source code.

What is OSS?  Software with source code attached  Access to underlying code  Derivative works –Add functionality –Enhance functionality –Remove functionality  Redistribution (Gotcha)

Why OSS?  Low experimentation cost –Technologies Rapid evolution Move to commercial solution if needed  Not profit driven community

What types of OSS?  Low Level –Programming Libraries –Databases –Languages –Operating Systems –Servers –Internet Infrastructure  High Level

Selecting OSS for use  Size –Active developers  Alive/Dead projects  Reliable Support  Sponsored Projects –Apache Foundation  Licenses

Benefits of OSS  Peer reviewed  Less defects –“Many eyes makes all bugs shallow”  Increased security

Standards  Promote standards through software development  Collaboration by development

Benefits for Software Producers  Development speed –Interesting projects attract developers –Less likely to switch once started  Lower overhead  Smaller institutions can handle larger projects  If you release OSS, hire from OSS community

Benefits for Software Consumers  No vendor lock in  License maintenance  Own vs rent

Costs for Producers  Supporting the project –Goal: Self sustaining  Preparing for open source distribution

Costs for Consumers  Finding technical support  Services –Free software –$$$ Services  Possible increased outsourcing costs

LC Case Study  I Hear America Singing  Digital Production –METS Making  Dissemination –METS Transformation  Project Management

Philosophies  Get data to XML as early as possible  Flexible manipulation of XML with XSLT  Aggregation of XML from variety of sources  Pipelined XSLT

Digital Production  Aggregate from different sources –MySQL DB Descriptive Data –Filesystem for Structure  Apache Cocoon

Apache Cocoon  XML Glue  XML Pipeline  XML Publishing Framework  Separation of content and style

Cocoon Pipeline  Generate XML –From files (or URLS) –From databases –From code (XSP, JSP, PHP) –From MARC21 records  Transform XML –XSLT  Serialize –XML, HTML, PDF, SVG, JPEG  Caching, Logging  Pass parameters to XSLT’s  Maps URL to Generate/Transform/Serialize  Command line interface

MySQL Filesystem Directory Generator ESQL XSP Generator DB to MODS XSLT Aggregator METS Making XSLT METS Digital Production

Dissemination  Apache – Web Server  Tomcat – Application Server  Cocoon – XML Transformation  Lucene – Searching/Indexing

METS File Generator Aggregator Render IHAS Page XSLT Header.xhtml File Generator Footer.xhtml File Generator Behavior XSLT (contactSheet.xsl) Dissemination

Apache Lucene  Text Indexing API  Flexible  Fast  Hierarchical  Cross-collection  Depends how you build indexes

METS File Generator METS to Index XSLT Index XML Index Builder (Java) Lucene Index Cocoon CLI Render Search XSLT XSP Generator Searches Lucene Cocoon

Project Management  CVS  Bugzilla  Ant

CVS  Version Control  Manage multiple document versions –Schemas –Documentation –Code –Libraries  Multiple developer projects

Bugzilla  Web-based defect tracking  Avoids flurry of s  Products – Components  Accountability  Reporting

Ant  Automated build tool  Packaging  Deployment

What you can do to support OSS  Demand (well documented) source code when outsourcing –Release this code  Release code your internal projects

Why this is hard  Clean-up code –Embarrassing  Documentation  Preparing for release  Fears of supporting the code  Hosting  Marketing  Generalizing for others use

Summary  OSS Software –Benefits –Costs  Uses at LC –I Hear America Singing  Questions