Content Challenges for Open Government Dale Waldt Sr. Analyst / Consultant

Slides:



Advertisements
Similar presentations
Data Mining and the Web Susan Dumais Microsoft Research KDD97 Panel - Aug 17, 1997.
Advertisements

Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Data Publishing Service Indiana University Stacy Kowalczyk April 9, 2010.
Presentation by Priyanka Sawarkar
© 2007 IBM Corporation Enterprise Content Management Integrating Content, Process, and Connectivity for Competitive Advantage Malcolm Holden October 2007.
Iowa Code and Rules Easy Navigation and Search Scope Analysis &Planning Phases Completed Request for Execution Funding.
WHY CMS? WHY NOW? CONTENT MANAGEMENT SYSTEM. CMS OVERVIEW Why CMS? What is it? What are the benefits and how can it help me? Centralia College web content.
Visibility Information Exchange Web System. Source Data Import Source Data Validation Database Rules Program Logic Storage RetrievalPresentation AnalysisInterpretation.
Transformations at GPO: An Update on the Government Printing Office's Future Digital System George Barnum Coalition for Networked Information December.
Systematic Review Data Repository (SRDR™) The Systematic Review Data Repository (SRDR™) was developed by the Tufts Evidence-based Practice Center (EPC),
11© 2011 Hitachi Data Systems. All rights reserved. HITACHI DATA DISCOVERY FOR MICROSOFT® SHAREPOINT ® SOLUTION SCALING YOUR SHAREPOINT ENVIRONMENT PRESENTER.
IAEA International Atomic Energy Agency ICSTI 2013 Annual Members’ Meeting March 2013.
® IBM India Research Lab © 2006 IBM Corporation Challenges in Building a Strategic Information Integration Infrastructure Mukesh Mohania IBM India Research.
Microsoft Office 4/16/2017 © 2012 Microsoft Corporation. All rights reserved. Microsoft, Windows, and other product names are or may be registered trademarks.
Sakai Overview ITS Teaching and Learning Interactive Aurora Collado January 10, 2008.
Tools and Services for the Long Term Preservation and Access of Digital Archives Joseph JaJa, Mike Smorul, and Sangchul Song Institute for Advanced Computer.
Microsoft Office Open XML Formats Brian Jones Lead Program Manager Microsoft Corporation.
Versus: A Web Repository Daniel Gomes, João P. Campos, Mário J. Silva XLDB Research Group University of Lisbon [dcg, jcampos, Versus is.
Renaud Comte [MVP]
Microsoft Office Sharepoint Server 2007 (MOSS) Overview Momentum Microsoft November 15, 2007.
Software Documentation Written By: Ian Sommerville Presentation By: Stephen Lopez-Couto.
Midwest Documentum User Group Harley-Davidson Documentum WCM 10/10/2006.
January 2013 CDMI: An Introduction. Big Data Complexity Volume Speed “Big Data” refers to datasets whose size is beyond the ability of typical tools to.
Federated Search: True Enterprise Search Abe Lederman, President and CTO Deep Web Technologies Search Engine Meeting – April 28-29, 2008.
Content Management Interoperability Services (CMIS)
IAEA International Atomic Energy Agency Agenda item 2.6 INIS Collection Search 36 th Consultative Meeting of INIS Liaison Officers 4-5 October 2012, Vienna,
Powered by Employment Security Department WorkSource Integrated Technology Solution.
SharePoint MOSS Platform Server-based Excel spreadsheets and data visualization, Report Center, BI Web Parts, KPIs/Dashboards Enterprise.
Web Publisher. Rinaldo De Paolis General Manager – Qualitem & Connected Systems.
- 1 - Roadmap to Re-aligning the Customer Master with Oracle's TCA Northern California OAUG March 7, 2005.
Presentation Outline (hidden slide) Technical Level: 100 Intended Audience: TDMs, ITPros, ITDMs, BI specialists Objectives (what do you want the audience.
Fundamentals of XML Management Greg Alexopoulos Systems Engineer Documentum.
material assembled from the web pages at
Sept 19,  Provides a common set of terminology and definitions  A framework for describing resources and processes  Enables computer based interoperability.
Indo-US Workshop, June23-25, 2003 Building Digital Libraries for Communities using Kepler Framework M. Zubair Old Dominion University.
© 2008 IBM Corporation ® IBM Cognos Business Viewpoint Miguel Garcia - Solutions Architect.
Delivering business value through Context Driven Content Management Karsten Fogh Ho-Lanng, CTO.
Themes Architecture Content Metadata Interoperability Standards Knowledge Organisation Systems Use and Users Legal and Economic Issues The Future.
GPO’s Federal Digital System August 17, 2010 U.S. Government Printing Office.
@ 2008 Copyright NIC I Do not distribute without permission E-Services for Transforming to the Next Generation Government “A Case Study of India” Suchitra.
1 A Very Large Digital Library Technology Demonstration William Y. Arms Cornell University.
“Confidential –Internal Halliburton Use Only. © 2004 Halliburton. All Rights Reserved.” Portal Brief OracleAS Portal A component of Oracle Application.
Virtual techdays INDIA │ august 2010 ENTERPRISE CONTENT MANAGEMENT WITH SHAREPOINT 2010 Naresh K Satapathy │ Solution Specialist, Microsoft Corporation.
Delivering Fixed Content to Oracle Portal Doug Daniels & Ken Barrette Quest Software.
DSpace vs Fedora Ralph LeVan OCLC Research. What Do You Want From a Repository? How do you create your metadata? How do you assemble your objects? How.
Create Content Capture Content Review Content Edit Content Version Content Version Content Translate Content Translate Content Format Content Transform.
The Mint Mapping tool The MoRe aggregator Vassilis Tzouvaras, Dimitris Gavrilis National Technical University of Athens Digital Curation Unit - IMIS, Athena.
1 © Xchanging 2010 no part of this document may be circulated, quoted or reproduced without prior written approval of Xchanging. MOSS Training – UI customization.
Metadata By N.Gopinath AP/CSE Metadata and it’s role in the lifecycle. The collection, maintenance, and deployment of metadata Metadata and tool integration.
JISC/NSF PI Meeting, June Archon - A Digital Library that Federates Physics Collections with Varying Degrees of Metadata Richness Department of Computer.
Copyright 2010, The World Bank Group. All Rights Reserved. Recommended Tabulations and Dissemination Section B.
The library is open Digital Assets Management & Institutional Repository Russian-IUG November 2015 Tomsk, Russia Nabil Saadallah Manager Business.
 A content management system ( CMS ) is a system providing a collection of procedures used to manage work flow in a collaborative environment. These.
Enteprise Content Management from Microsoft. 20% structured 80% unstructured 90% of unstructured data is unmanaged Volume of data is increasing ~36%/year.
Business Data Integration with MOSS 2007 Naveedullah Khan PMP, MCAD.NET Senior Consultant.
Introduction to SQL Server 2000 Reporting Services Jeff Dumas Technical Specialist Microsoft Corporation
Workflow based Knowledge Management Solutions 1 InfoAxon Technologies Limited Confidential.
MasterCard Global Marketing Center An Alfresco Case Study Jay Mandel, MasterCard International Mike Vertal, Rivet Logic Corporation 15 November 2012.
June 30, 2005 Public Web Site Search Project Update: 6/30/2005 Linda Busdiecker & Andy Nguyen Department of Information Technology.
GEOSS Common Infrastructure (GCI) The GEOSS Common Infrastructure allows Earth Observations users to search, access and use the data, information, tools.
© 2006 Epiance, Inc. Confidential and Proprietary 1.
Moving on : Repository Services after the RAE
Joseph JaJa, Mike Smorul, and Sangchul Song
Accessing and Surfacing LOB Data in SharePoint 2010
Software Documentation
Flexible Extensible Digital Object Repository Architecture
Flexible Extensible Digital Object Repository Architecture
Easy TMF Introduction & Demo for QED Clinical Services
Reportnet 3.0 Database Feasibility Study – Approach
Presentation transcript:

Content Challenges for Open Government Dale Waldt Sr. Analyst / Consultant

2 Content Challenges for Open Government High volume / aggregation Complexity / heterogeneous formats Complex content integration and delivery Timing / updating / currency of information

3 High-Volume / Heterogeneous Content A federal agency in the US maintains an extremely large records archive Petabytes of content, constantly updated from hundreds of sources Diverse formats / document types Mix of structured / unstructured content HTML, PDF, Word, CSV, Binaries, RDMS, etc. Not feasible / allowed to "normalize" into consistent format for storage, indexing, searching, delivery

4 High-Volume / Heterogeneous Content 100 PB DB Search Indexing Index Metadata Crawler Metadata & References 100 GB Query Handler

5 High-Volume / Heterogeneous Content Challenges Difficult to index for search Diverse data formats / lack of transparency Opportunities X-Query provides flexible access into diverse content Crawlers harvest metadata / build index into content Web applications access standard metadata / WS API

6 Content Integration / Delivery Mass.gov 300,000 pages 240+ contributors 8+ centralized production team Specialized audience views Urgent news / news feed Task-specific information Content by agency

7 Content Integration / Delivery Metadata enrichment to automate creation of views Easy to enforce a taxonomy Feeds automated search/query processes Aids dynamic assembly

8 Content Integration / Delivery Challenges Maintaining consistent look and feel, navigation Manual build lists maintained for each "view" not scalable Infrequent contributors cannot master complex editing tools Opportunities Metadata to support dynamic assembly / search views Controlled vocabulary to organize information

9 Real-Time Updating Iowa Legislature Bills/amendments/ laws/statutes content Tracking info / dates Real-time updates Links to related content Historical information & versions

10 Real-Time Updating How a Bill Becomes a Law…

11 Real-Time Updating Challenges Automated processes needed to support volume / real-time updates Aging tools need updating Opportunities Metadata to integrate related content Workflow designed to capture / report actions / content versions Query tools for accessing real-time reporting information / content

12 Lessons Learned Robust data architecture enables robust information delivery Legacy data / systems need updating Search tools need metadata for custom views Process needs automation for scalability Users need simple tools that produce rich content

13 The Role of Standards Data models for content processing & validation Taxonomies for classification & reorganization Interoperability of shared content repositories Transformation & rendering of content Processes & policies for consistency & governance "Standards leverage and communicate the work of others to reduce development time and increase accuracy of content."

14 Resources Gilbane publications Enabling the Promise of Open Government: Addressing Large-Scale Integration, Storage, and Access of Complex Information Content Management Interoperability Services (CMIS): Addressing Contemporary Requirements for Content Integration Download for Free

15 Questions?