Case Study: Report from the Front Lines of Digital Asset Management at CNN Kathy Christensen CNN News Archives August 2001.

Slides:



Advertisements
Similar presentations
IBM WebSphere Everyplace Access for Multiplatforms Managing the e-business Customer Experience.
Advertisements

Software Requirements
IT Works so Uwork(s): Letting Technology work for You!!!
ELibrary Elementary The user-friendly general reference solution for your elementary school 2008.
ELibrary The user-friendly general reference solution 2008.
Chapter 1: The Database Environment
Taxonomy & Ontology Impact on Search Infrastructure John R. McGrath Sr. Director, Fast Search & Transfer.
Putting the Pieces Together Grace Agnew Slide User Description Rights Holder Authentication Rights Video Object Permission Administration.
Mirror Mirror on the wall does your repository reflect it all? Peter West and Timothy Miles-Board EPrints Services University of Southampton Southampton,
GMD German National Research Center for Information Technology Darmstadt University of Technology Perspectives and Priorities for Digital Libraries Research.
ESDS Qualidata: Qualitative Data Preparation and Use John Southall ESDS 26 November 2003.
Yammer Technical Solutions Overview
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Grow your business with your head in the cloud. What is Cloud Computing ? Internet-based computing, whereby shared resources, software and information.
Business Development Suit Presented by Thomas Mathews.
1 Never too much of a good thing. Brand new customer support framework Make the most of the worlds most powerful pre- processing software for additive.
Adaptive Solutions Inc. ADAPTIVE SOLUTIONS, INC. Mobile, Alabama Proudly Presents Technology for individuals with Special Needs.
Cleopatra Enterprise Workshop
Software Requirements
Distributed search for complex heterogeneous media Werner Bailer, José-Manuel López-Cobo, Guillermo Álvaro, Georg Thallinger Search Computing Workshop.
Communications Solutions for Hotels/Motels
23-Nov-2000/Janne Saarela Business opportunities on the semantic Web Janne Saarela.
LeadManager™- Internet Marketing Lead Management Solution May, 2009.
Interoperability Scenarios All Working Groups Meeting May, Rome, Italy.
Building an EMS Database on a Company Intranet By: Nicholas Bollons Sally Goodman.
© 2014 QUAD ONE | | CONFIDENTIAL 1 CLM Application Deck Paleti Sainath Business Analyst.
Information Professionals and Learning Object Repositories … more than just metadata quality … Sarah Currier Stòr Cùram Project Librarian JISC X4L Repository.
Digital Archiving Solutions for the Entertainment Industry August 2010.
® Executive Overview August 2007 Expertise within Reach.
1 CS 430: Information Discovery Lecture 22 Non-Textual Materials 2.
Supervised by Prof. LYU, Rung Tsong Michael Department of Computer Science & Engineering The Chinese University of Hong Kong Prepared by: Chan Pik Wah,
Presentation Outline  Project Aims  Introduction of Digital Video Library  Introduction of Our Work  Considerations and Approach  Design and Implementation.
Advanced Distributed Learning. Conditions Before SCORM  Couldn’t move courses from one Learning Management System to another  Couldn’t reuse content.
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System.
Architecture & Data Management of XML-Based Digital Video Library System Jacky C.K. Ma Michael R. Lyu.
Internet Resources Discovery (IRD) IBM DB2 Digital Library Thanks to Zvika Michnik and Avital Greenberg.
Gerald Schmidt Learning and Teaching Solutions The Open University Embedding automated accessible outputs in open educational resources.
1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System Supervisor: Prof Michael Lyu Presented by: Lewis Ng,
Metadata Presentation by Rick Pitchford Chief Engineer, School of Communication COM 633, Content Analysis Methods Fall 2009.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
Using Taxonomies Effectively in the Organization v. 2.0 KnowledgeNets 2001 Vivian Bliss Microsoft Knowledge Network Group
Planning a digital library How to Build a Digital Library Ian H. Witten and David Bainbridge.
We’ve Developed Insights. Now, how do we commercialize them across the organization and retail customers with speed? Objective: Share how Georgia-Pacific.
Spoken dialog for e-learning supported by domain ontologies Dario Bianchi, Monica Mordonini and Agostino Poggi Dipartimento di Ingegneria dell’Informazione.
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System.
Presentation Path  Introduction to Ved Consultancy and OpenText  Current Challenges  The Valued Customers and Sectors  Our Solutions  Demo. Together,
JENN RILEY METADATA LIBRARIAN IU DIGITAL LIBRARY PROGRAM Introduction to Metadata.
Metadata Strategy Case Study Bill Rosenblatt GiantSteps Media Technology Strategies (212)
Search Update April 1-3, 2009 Joshua Ganderson Laura Baalman.
Planning a digital library How to Build a Digital Library Ian H. Witten and David Bainbridge.
CBSOR,Indian Statistical Institute 30th March 07, ISI,Kokata 1 Digital Repository support for Consortium Dr. Devika P. Madalli Documentation Research &
1 CS 430: Information Discovery Lecture 22 Non-Textual Materials: Informedia.
Introduction to metadata
Introduction to Metadata Jenn Riley Metadata Librarian IU Digital Library Program.
METRO Science Librarians Chris Forbes, CEO Knovel Corporation January 21, Experience Knovel’s Virtual Technical Library with Analysis.
SMX Madrid 2008 Uncovering the Algorithm A Peek Inside How Google Evaluates and Ranks Pages.
TSS Database Inventory. CIRA has… Received and imported the 2002 and 2018 modeling data Decided to initially store only IMPROVE site-specific data Decided.
26/05/2005 Research Infrastructures - 'eInfrastructure: Grid initiatives‘ FP INFRASTRUCTURES-71 DIMMI Project a DI gital M ulti M edia I nfrastructure.
IBM Software Group ® Managing Reusable Assets Using Rational Suite Shimon Nir.
Ask a Librarian: The Role of Librarians in the Music Information Retrieval Community Jenn Riley, Indiana University Constance A. Mayer, University of Maryland.
Gerald Schmidt Learning and Teaching Solutions The Open University Producing DAISY talking books without manual intervention.
DANIELA KOLAROVA INSTITUTE OF INFORMATION TECHNOLOGIES, BAS Multimedia Semantics and the Semantic Web.
Santi Thompson - Metadata Coordinator Annie Wu - Head, Metadata and Bibliographic Services 2013 TCDL Conference Austin, TX.
Introduction to metadata for IDAH fellows Jenn Riley Metadata Librarian Digital Library Program.
ETERE NUNZIO The ultimate end-to-end solution for your NewsRoom.
NA Sales Training 2007 The Digital Marketing Space.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
Digital Video Library - Jacky Ma.
Visual Information Retrieval
AZ.PBSLearningMedia.org Next Generation Digital Content from Eight – Arizona PBS FREE to educators and families I am excited to share with you a free.
Presentation transcript:

Case Study: Report from the Front Lines of Digital Asset Management at CNN Kathy Christensen CNN News Archives August 2001

2 CNN Background Multiple products: CNN, Headline News, CNN International, CNN.com et al, CNN/SI, CNNfn, CNN en Espanol, Airport Network, Inflight CNN Library as central resource –Information research –Archive –Footage licensing

3 Whats in the CNN archive? Type of material –10%: programs (Larry King, Crossfire, etc) –90% is raw footage & edited cut items (pkgs, sots, vos) Volume –150,000+ hours of footage in Atlanta plus additional footage in bureaus –1,000,000+items in Atlanta central catalog plus 600,000 across bureau catalogs Growth –2000 items archived per week in Atlanta culled from many times more incoming items 1/3 of items per day are cut (3 hrs) 2/3 of items per day are raw (90 hrs) –30,000 hours archived in 2000

4 Who are the archive clients? CNN –daily news - TV and Interactive –documentary - TV and Interactive –other (Sales, Marketing, PR, Legal, etc) AOL-TW companies (TNT, TBS, Warner Bros) External customers (Imagesource clients)

5 The Archive Project (aka core of CNNs digital future) Purpose –Preserve assets –Extend usage of assets –Create efficiencies –Facilitate new business opportunities –Create media management framework for the digital CNN

6 Pre-Digital Scenario

7 Digital Scenario

8 System goals and challenges –Multiple resolutions captured simultaneously - to serve broadcast, edit and Internet –Generate as much meaningful cataloging data automatically as possible - technology continuing to improve –Support the necessary human cataloging with powerful tools –Support retrieval needs of diverse user communities

9 Our Approach –Assemble a diverse internal team with multidisciplinary expertise R&D, Engineering, IT, Library Science, Users –Co-developers with Sony and IBM Key Principles –Custom solution not desired –Focus on interoperability and standards –Phased development get started and build on it

10 Users drive cataloging & search requirements Production usually demands video of versus stories about –Automatically captured narrative track excellent for finding about but often misses the of-- what do we see in the footage? –Special challenge of raw video -- b-roll often has no track to capture High-pressure, fast turn-around, 24-hour environment requires highly precise results, extremely quickly Long-term documentary production can tolerate more browsing but still requires reliably comprehensive retrieval News domain requires reliance on accuracy of editorial metadata - bad data and inadequate search systems equal journalistic problems

11 Enablers of accuracy, precision, speed, thoroughness Controlled vs Free-form Data Entry - build data entry aids which support consistent entry Adequate size for keyword and video description fields Controlled classification terms with a mechanism for dynamically updating the classifications Fielded Tags for –best of video –about but not seen –natural sound Flexibility in search approaches - free-text, controlled vocabulary, field-specific, user control over precision vs fuzziness, user control over tracks to include, user control over weighting and display of results

12 Technology strengths supplement human weaknesses Automatic capture of closed-caption text improves retrieval of small, specific portions of programming about something -- a viewer need which is not easily met now. Voice-to-text transcription even at 60% accuracy fills a not-easily met need to find specific soundbites in raw speeches, interviews, hearings, etc. Video to video matching supports identification of permutations of the same video piece across the catalog

13 Technology strengths supplements human strengths Making sense of images, putting them into editorial context, and attaching words so they may be retrieved –Automatic scene change detection facilitates speedy review of item by human cataloger –Face recognition software may not know who a particular face is, but can know that the video contains a face which a human can then identify

14 Technology strengths also supplement technology weaknesses Speech-to-text weakness - some of the data most likely to be search on… names of people, companies, places –Phonetic-based search strengths can cover speech-to-text search weakness Phonetic track useful for searching but doesnt provide textual cataloging data –Speech-to-text transcription useful as representation of the content of the asset

15 Food for thought … Responsibilities –to the parent company –to the user communities –to the rightsholders –to posterity??? This means thinking about –Physical integrity of the content (quality, lossless conversions, standards, migration) –Intellectual integrity of content…ethics