Progress in Access Technologies: NLM Video Search Jennifer Marill Chief, Technical Services Division Edward Luczak Systems Architect, Office of Computer.

Slides:



Advertisements
Similar presentations
Support.ebsco.com Small Business Reference Center Tutorial.
Advertisements

Permanent Hosting, Archiving and Indexing of Digital Resources and Assets Raman Ganguly Computer Center University of Vienna.
E-Content Service Group Virtual Meeting Digital Preservation: How to Get Started.
The Advanced, Enterprise Publishing Environment for Cross-media Output to Print & Web.
Samsung Digital Signage
Office 2010 Software Applications in Office 2010 & Files.
Techniques for Creating Accessible, Closed Captioned Web-Based Video California State University - Northridge 22nd Annual International Technology and.
Concepts & Techniques for Accessible, Closed Captioned Web-Based Video 10th Annual Accessing Higher Ground: Accessible Media, Web and Technology Conference.
Vital Implementation Update Vital Implementation Update 11 th January 2006 Paul Bevan – Glen Robson –
Millions of people have been blocked from entering the digital world… Millions of people have been blocked from entering the digital world…
Goals for RUcore o Flexible, extensible cyberinfrastructure for Rutgers University o Integrating platform for legacy information systems o Support preservation.
Gonghua Liu, Ph.D. Instructional Designer & Technologist Woodruff Health Sciences Center Library, Emory University.
V | © OverDrive, Inc | Page 1 Browse, Check Out, Download! Learn how to browse, check out, and download digital titles from [YOUR LIBRARY]
DIGITIZATION OF LOCAL HISTORY COLLECTIONS IN PUBLIC LIBRARY “VLADISLAV PETKOVIC DIS” IN CHACHAK: DIGITIZATION OF THE NEWSPAPER “THE VOICE OF CHACHAK” Bogdan.
JSTOR User Services l February 2009 Using the JSTOR Interface User Services, February 2009.
Tara Guthrie, 2012 Types of Resources: Electronic.
R.Jantz, August 31, Two-day forum on PREMIS Preservation Metadata and the Trusted Digital Repositories August 31, September 1 National Library of.
CONTENT: A model for collaborative database building Trevor Bond Alan Cornish Washington State University Libraries.
WMS: Democratizing Data
Greenstone Digital Library Usage and Implementation By: Paul Raymond A. Afroilan Network Applications Team Preginet, ASTI-DOST.
Presented by Mina Haratiannezhadi 1.  publishing, editing and modifying content  maintenance  central interface  manage workflows 2.
Technology Bootcamp January 18, 2014 Large-Scale Digital Libraries Digitization Process Krystyna K. Matusiak, Ph.D. Assistant Professor Library & Information.
A Digital Preservation Repository for Duke University Libraries Jim Coble Digital Repository Developer Open Repositories 2013.
® Copyright 2010 Adobe Systems Incorporated. All rights reserved. ADOBE® ACCESSIBILITY Video Accessibility in Adobe Flash Andrew Kirkpatrick Adobe Systems.
A New Jersey Statewide Video Portal Based on Open Source Technologies Isaiah Beard Digital Standards & Workflow Manager - SCC Repository Architects: Ron.
ETD Repositories Using DSpace Software Andrew Penman The Robert Gordon University 27 th September 2004.
DuraCloud A service provided by Sandy Payette and Michele Kimpton.
SobekCM’s Community Ecosystems & Socio-Technical Practices Presented by Mark V. Sullivan June 10 th, 2014 Sobek image created by Jeff Dahl and is shared.
Adventures in Digital Asset Management: Fedora at the National Library of Wales Glen Robson National Library of Wales
Web-based workflow software to support book digitization and dissemination The Mounting Books project books.northwestern.edu Open Repositories 2009 Meeting,
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
“Old Style” Libraries, Digital Libraries: Convergences, Divergences, And the Troubles in Between.
Erin Kinney, Wyoming State Library. Motivation #1 priority that came out of 2004 statewide digitization meeting WSL received many reference questions,
NLM Digital Repository Server Architecture January 18, 2011.
Multimedia Digital Library Marcia Johnson. Collection 25 text documents 25 text documents In HTML, PDF, TXT formats (source: Project Gutenberg) In HTML,
Web based METS creation Ralf Stockmann case study.
Project Builder and MediaMatrix: Redefining Access in the Digital Age Dean Rehberger and Michael Fegan MERLOT August 7-10, 2006 New Orleans, LA.
NLM Digital Collections Update for DCFedoraUsersGroup January 22, 2013 John Doyle National Library of Medicine.
1 Helping communities access and explore their newspaper heritage. Rose Holley – Manager Newspaper Digitisation Program
TECHNOLOGY SUPPORT FOR ESSSS Progress, Issues, and Challenges Marshall Breeding Director for Innovative Technology and Research Vanderbilt University Library.
An Iterative Approach to Building Sustainable Repository Services on Fedora Open Repositories 2009, May 19, 2009.
GPO’s Federal Digital System August 17, 2010 U.S. Government Printing Office.
1 Using Digital Technologies to unlock history for researchers. Rose Holley – Manager Newspaper Digitisation Program Australian Academy of the Humanities.
Digital Archives at the National Library of Medicine A presentation at the MLA Session Lighting the Path: Digital Repositories in the Real World May 24,
EVIA Digital Archive New Tools William G. Cowan Mike Durbin Digital Library Program EVIA Digital Archive DLP Brown Bag 20 September 2006.
Presented By: By: By: Web Address: Topic Number: Topic Number: Date: Date:

Digitization Programmes National Library of the Czech Republic Adolf Knoll
The Mint Mapping tool The MoRe aggregator Vassilis Tzouvaras, Dimitris Gavrilis National Technical University of Athens Digital Curation Unit - IMIS, Athena.
Collecting History: Profiles in Science Alexa T. McCray National Library of Medicine Bethesda, MD Stanford University August 21, 1999.
A Multi-Tiered Architecture for Distributed Data Collection and Centralized Data Delivery Stacy Kowalczyk and James Halliday April 28, 2008.
Nikola Tesla Museum Clipping Library Saša Malkov Nenad Mitić Žarko Mijajlović 3 rd SEEDI Int.Conf. Cetinje, Montenegro 14. September 2007.
Soon Joo Hyun Database Systems Research and Development Lab. US-KOREA Joint Workshop on Digital Library t Introduction ICU Information and Communication.
Permanent Hosting, Archiving and Indexing of Digital Resources and Assets Markus Höckner Computer Center University of Vienna.
The library is open Digital Assets Management & Institutional Repository Russian-IUG November 2015 Tomsk, Russia Nabil Saadallah Manager Business.
1 « Luxembourg, 18 April 2007 « Virtual Library of Official Statistics « Dissemination Working Group.
Digitizing Historical Newspapers South Carolina Digital Newspaper Program's participation with the Library of Congress' Chronicling America: Historic American.
RSC Learning Resources Conference 8 th November 2012, Manchester Andrew Bevan (EDINA)
NLM Update and Still Image Serving April 27, 2016 John Doyle, Doron Shalvi, TA Nguyen National Library of Medicine.
ITL conference 2003 Putting Your Content on a Diet Using rich online media without download woes.
CONTENTdm A proven solution September A complete digital collection management software solution Stores, manages and provides access for all digital.
Ball State University Digital Media Repository …a project of the University Libraries Customization, Web Services, and Storage at Ball State using CONTENTdm.
EVIA Digital Archive Technical Overview EVIA Digital Archive DLP Brown Bag: 7 December 2005.
WV DOT Scanning Project
Building Search Systems for Digital Library Collections
Experiences of the Digital Repository of Ireland
Library Technology Conference: Building Exhibits
DIGITAL LIBRARY.
Lesson 5: Multimedia on the Web
NLM Digital Repository The Search for a New Book viewer
Presentation transcript:

Progress in Access Technologies: NLM Video Search Jennifer Marill Chief, Technical Services Division Edward Luczak Systems Architect, Office of Computer and Communications Systems (contractor)

NLM Digital Collections NLM digital repository launched September 27, 2010 NLM digital repository launched September 27, Focus on “Digital Library” functionality: ingest, store, access, and preserve digital assets Focus on “Digital Library” functionality: ingest, store, access, and preserve digital assets Currently two content types: print and video Currently two content types: print and video Based on Fedora-Commons and other open source software Based on Fedora-Commons and other open source software NLM-developed Video Player with Search NLM-developed Video Player with Search 2

Public Domain Books Cholera Online Collection Cholera Online Collection –546 monographs ( ) –TIFF master images, OCR, METS and ALTO files Medicine in the Americas Collection Medicine in the Americas Collection –NLM’s contribution to Medical Heritage Library project –5,500+ books, 1 million+ pages ( ) –Being digitized in-house using Kirtas scanner –JPEG masters, OCR, METS, ALTO, PDF Ingest into NLM Digital Repository: Ingest into NLM Digital Repository: –Created JPEG2000 derivative images for web access –Book and Page objects contain metadata and content 3

Films and Videos Collection U.S. military and public health films ( ) U.S. military and public health films ( ) ‒ 29 films from HMD audiovisual collection (5-52 min) Previous reformatting: Previous reformatting: –Transferred from 16 mm film to Betacam SP –Digitized from Betacam SP to DVD as circulation copy Ingest into NLM Digital Repository: Ingest into NLM Digital Repository: –MPEG-2 (from DVD) used as master –Several derivative video formats (H.264, MPEG-4, …) –Transcripts and captions, preview image and clip 4

Public User Interface Browse & Search (Muradora) Browse & Search (Muradora) ‒ Supports multiple collections, diverse content –Resource display page: metadata, datastreams Book Viewer (NWU) Book Viewer (NWU) –Open source software from Northwestern University –Open source JPEG2000 server (Djatoka) Video Player with Search (NLM) Video Player with Search (NLM) –Started as IT research project and prototype –Features video transcript search and play-head jump 5

6 System Architecture NWU BookViewer NLM Video Player with Search Muradora 1.4b Fedora Solr GSearch CentOS Linux Virtual server, 3 CPUs, 24 GB RAM Djatoka MySQL 5.0 Tomcat Fedora Managed Storage External Storage Solr Index Resource Index Application ServerDatabase ServerFile Server

7 Films and Videos Collection: Requirements Collection should be searchable Collection should be searchable –Repository-wide search to find relevant videos: search catalog metadata and full video transcript –Video search to find and jump to locations within a selected video where a search word occurs –Accurate video transcript should be displayed, with search words highlighted –Accurate, complete video transcript needed Section 508 accessibility requirements Section 508 accessibility requirements –All videos must have accurate captions

8 NLM Video Search Software Development approach Development approach –Researched tools available for video search (e.g., Autonomy Virage) –Developed in-house prototype –Refined and promoted to production –Sharing within Dept. of HHS as open source software

9 NLM Video Search Software Characteristics Characteristics –Developed in Adobe Flash using ActionScript-3 –Plays H.264 video file retrieved from video object in repository (Progressive download) –User can view captions and transcript –Time-tagged captions / transcript file used to search within video (formatted in W3C DXFP XML) –Search hits listed, and also shown as yellow dots on timeline (hover to see context) –Click yellow dot to jump to location in video

10 Creating Captions and Transcripts Attempted speech recognition of audio track Attempted speech recognition of audio track –Adobe Soundbooth and Premiere CS4 –Low accuracy due to poor audio quality, background music “Echo” speech recognition (parroting) useful “Echo” speech recognition (parroting) useful –Dragon NaturallySpeaking 10 MAGpie (WGBH) caption editor (free) MAGpie (WGBH) caption editor (free) –Manual text entry and caption timing –Creates text transcript and DFXP XML caption files Summer students can be very helpful! Summer students can be very helpful!

11 Future Plans HTML5 HTML5 Improve search by using Apache Solr Improve search by using Apache Solr Audio-only version Audio-only version –Playback and search of audio histories

Demonstration 12