R EALLY [ ] S TRATEGIES It’s all about the content XML That Pays Off for Your Content Database “It’s all about the content.” Lisa Bos www.reallysi.com.

Slides:



Advertisements
Similar presentations
1 of 15 Information Access Internal Information © FAO 2005 IMARK Investing in Information for Development Information Access Internal Information.
Advertisements

Classification & Your Intranet: From Chaos to Control Susan Stearns Inmagic, Inc. E-Libraries E204 May, 2003.
Modelling with databases. Database management systems (DBMS) Modelling with databases Coaching modelling with databases Advantages and limitations of.
SharePoint Forms All you ever wanted to know about forms but were afraid to ask.
CHAPTER OBJECTIVE: NORMALIZATION THE SNOWFLAKE SCHEMA.
Business Development Suit Presented by Thomas Mathews.
XML DOCUMENTS AND DATABASES
Taxonomies, Lexicons and Organizing Knowledge Wendi Pohs, IBM Software Group.
Management Information Systems, Sixth Edition
Chapter 5 Database Processing.
Basics of HTML What is HTML?  HTML or Hyper Text Markup Language is the standard markup language used to create Web pages.  HTML is.
Chapter 4.
Systems Analysis and Design 9th Edition
Sixth Hour Lecture 10:30 – 11:20 am, September 9 Framework for a Software Management Process – Artifacts of the Process (Part II, Chapter 6 of Royce’ book)
Using the Semantic Web to Construct an Ontology- Based Repository for Software Patterns Scott Henninger Computer Science and Engineering University of.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
Managing Data Resources
M.S. Access Module CAS 133 Russ Erdman. M.S. Access Module Assignment Overview Two options for the unit: All students complete Units A, B and C In class.
IS4401 Project Technology Issues. Introduction This seminar covers Databases When to use a Database What Database to use Development Tools Visual Studio.
Database Processing Chapter "No, Drew, You Don’t Know Anything About Report Writing.” Copyright © 2014 Pearson Education, Inc. Publishing as Prentice.
Software Documentation Written By: Ian Sommerville Presentation By: Stephen Lopez-Couto.
Jennifer O’Donnell EDUC 681 Survey of Instructional Technology Applications Adobe Captivate.
Adding metadata to web pages Please note: this is a temporary test document for use in internal testing only.
Designing a Presentation
Linux Operations and Administration
Basic tasks of generic software Chapter 3. Contents This presentation covers the following: – The basic tasks of standard/generic software including:
OFC304 Excel 2003 Overview: XML Support Joseph Chirilov Program Manager.
SDPL 2001Notes 7: XML Web Sites1 7 XML Web-Site Architectures n How (and why) to apply XML techniques in the implementation of Web sites? 7.1 XML, Databases,
Using Taxonomies Effectively in the Organization v. 2.0 KnowledgeNets 2001 Vivian Bliss Microsoft Knowledge Network Group
AL-MAAREFA COLLEGE FOR SCIENCE AND TECHNOLOGY INFO 232: DATABASE SYSTEMS CHAPTER 1 DATABASE SYSTEMS (Cont’d) Instructor Ms. Arwa Binsaleh.
Page Up or Down to navigate through the program.
MIS 385/MBA 664 Systems Implementation with DBMS/ Database Management Dave Salisbury ( )
Introduction to XML. XML - Connectivity is Key Need for customized page layout – e.g. filter to display only recent data Downloadable product comparisons.
Requirements Engineering Requirements Elicitation Process Lecture-9.
© 2001 Business & Information Systems 2/e1 Chapter 8 Personal Productivity and Problem Solving.
Lead Black Slide Powered by DeSiaMore1. 2 Chapter 8 Personal Productivity and Problem Solving.
Access 2013 Microsoft Access 2013 is a database application that is ideal for gathering and understanding data that’s been collected on just about anything.
Chapter 12 View Design and Integration. McGraw-Hill/Irwin © 2004 The McGraw-Hill Companies, Inc. All rights reserved. Outline Motivation for view design.
Using Taxonomies Effectively in the Organization KMWorld 2000 Mike Crandall Microsoft Information Services
Database Design and Management CPTG /23/2015Chapter 12 of 38 Functions of a Database Store data Store data School: student records, class schedules,
Chapter 9 View Design and Integration. © 2001 The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Outline Motivation for view design.
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
Using the Right Method to Collect Information IW233 Amanda Murphy.
MANAGING DATA RESOURCES ~ pertemuan 7 ~ Oleh: Ir. Abdul Hayat, MTI.
INFO1408 Database Design Concepts Week 15: Introduction to Database Management Systems.
Meeting Management/Planning. Today Go over basics of meeting management Introduce key elements of creating a plan.
XML and Its Applications Ben Y. Zhao, CS294-7 Spring 1999.
WEB 2.0 PATTERNS Carolina Marin. Content  Introduction  The Participation-Collaboration Pattern  The Collaborative Tagging Pattern.
© 2013, published by Flat World Knowledge Chapter 10 Understanding Software: A Primer for Managers 10-1.
Software Reuse Course: # The Johns-Hopkins University Montgomery County Campus Fall 2000 Session 4 Lecture # 3 - September 28, 2004.
Oman College of Management and Technology Course – MM Topic 7 Production and Distribution of Multimedia Titles CS/MIS Department.
©2007 Really Strategies, Inc. CONFIDENTIAL 1 Native XML Content Management Philadelphia XML Users’ Group.
DITA: Not just for Tech Docs Ann Rockley The Rockley Group.
1 CS 430 Database Theory Winter 2005 Lecture 3: A Fifty Minute Introduction to Data Modeling.
Enterprise Oracle Solutions Oracle Report Manager The New ADI and More Revised:June 20091Report Manager/SROAUG Presentation.
The Concepts of Business Intelligence Microsoft® Business Intelligence Solutions.
Managing Data Resources File Organization and databases for business information systems.
L A B E L Marina Karapetyan.
Software Documentation
Tagging documents made easy, using machine learning
Databases.
What’s New in Colectica 5.3 Part 1
The Re3gistry software and the INSPIRE Registry
MANAGING DATA RESOURCES
What’s New in Colectica 5.3 Part 2
What is a CMS. CMS is content management system CMS is a software that stores content.
CHAPTER 1: THE DATABASE ENVIRONMENT AND DEVELOPMENT PROCESS
Test Your Tech Blogging is: Someone's online journal.
Computer Basics Applications.
Presentation transcript:

R EALLY [ ] S TRATEGIES It’s all about the content XML That Pays Off for Your Content Database “It’s all about the content.” Lisa Bos “It’s all about the content.” Lisa Bos

R EALLY [ ] S TRATEGIES It’s all about the content This year’s theme: Context This year’s Knowledge Management track focuses on context.  XML, together with a database, is an excellent way to capture content context.  Choosing when to use XML and among the tools for managing XML is also about context  Content lifecycle context  Organizational context This year’s Knowledge Management track focuses on context.  XML, together with a database, is an excellent way to capture content context.  Choosing when to use XML and among the tools for managing XML is also about context  Content lifecycle context  Organizational context

R EALLY [ ] S TRATEGIES It’s all about the content Audience survey Hands on experience with XML? Theoretical knowledge of XML? Little or no familiarity with XML? Organization using XML now? Hands on experience with XML? Theoretical knowledge of XML? Little or no familiarity with XML? Organization using XML now?

R EALLY [ ] S TRATEGIES It’s all about the content XML basics What it looks like  InfoToday … Hierarchical DTDs/schemas (tags and attribute rules) Basic concepts  Meaningful names  Rules-based (= consistency)  Format-independent and predictable What it looks like  InfoToday … Hierarchical DTDs/schemas (tags and attribute rules) Basic concepts  Meaningful names  Rules-based (= consistency)  Format-independent and predictable

R EALLY [ ] S TRATEGIES It’s all about the content Introduction If you’re confused about how or whether to combine XML with databases, you’re not alone Today, few people are confused about the value of relational databases. Getting there with XML is more difficult because relational databases and XML are both complementary and overlapping. If you’re confused about how or whether to combine XML with databases, you’re not alone Today, few people are confused about the value of relational databases. Getting there with XML is more difficult because relational databases and XML are both complementary and overlapping.

R EALLY [ ] S TRATEGIES It’s all about the content Documents and data Documents:  Longer  Internal structure  Usually meant to be read from top to bottom  Often used to support data Documents:  Longer  Internal structure  Usually meant to be read from top to bottom  Often used to support data Data  Shorter (“fields”)  Little if any internal structure  Often used to support documents (metadata)  Sometimes a collection of data presented as a document

R EALLY [ ] S TRATEGIES It’s all about the content Data and documents Your documents might contain data  Financial reports Your data might contain mini- documents  Long descriptions Reality: a continuum of content Labels are for our convenience Your documents might contain data  Financial reports Your data might contain mini- documents  Long descriptions Reality: a continuum of content Labels are for our convenience

R EALLY [ ] S TRATEGIES It’s all about the content Why differentiate data & documents? Labels imply tools and functionality  Think about what you do with Excel versus what you do with Word Especially important to:  Storage and searching  Editing tools (forms and document editors)  Interchange (sharing content among systems and organizations) Yesterday’s and most of today’s tools are optimized for one type of functionality Labels imply tools and functionality  Think about what you do with Excel versus what you do with Word Especially important to:  Storage and searching  Editing tools (forms and document editors)  Interchange (sharing content among systems and organizations) Yesterday’s and most of today’s tools are optimized for one type of functionality Storage Interchange (data loading, publishing, integration, …) Editing

R EALLY [ ] S TRATEGIES It’s all about the content Uh oh But what about this continuum of content? How do I handle that? Figure out the best way for you (today) to support the functionality you need in each of the three main areas mentioned in the previous slide But what about this continuum of content? How do I handle that? Figure out the best way for you (today) to support the functionality you need in each of the three main areas mentioned in the previous slide

R EALLY [ ] S TRATEGIES It’s all about the content Generally speaking: When to use XML? XML is often a good choice for document capture.  In general, it’s easier to model document structures hierarchically – if you need to model them at all. XML is sometimes a good choice for data storage.  In general, it’s easier to model data relationally. Context (lifecycle stage) is critical in determining architecture. XML is often a good choice for document capture.  In general, it’s easier to model document structures hierarchically – if you need to model them at all. XML is sometimes a good choice for data storage.  In general, it’s easier to model data relationally. Context (lifecycle stage) is critical in determining architecture.

R EALLY [ ] S TRATEGIES It’s all about the content Storage & searching options for XML/data 1.Relational databases  Optimization for complex data queries  Mature  Most databases or development platforms have some XML “awareness”  Some ability to search XML  Can write software to deconstruct XML and store it as fields in a database, and then to reconstruct for output as XML  Performance okay  Not as easy as it sounds 1.Relational databases  Optimization for complex data queries  Mature  Most databases or development platforms have some XML “awareness”  Some ability to search XML  Can write software to deconstruct XML and store it as fields in a database, and then to reconstruct for output as XML  Performance okay  Not as easy as it sounds

R EALLY [ ] S TRATEGIES It’s all about the content Storage and searching options for XML/data (cont’d) 2.XML databases  Optimized for searching XML hierarchies  Some ability to handle more “relational” content  Might mean you choose to embed metadata within your XML documents  Less mature 3.Combination approaches  Relational database and an XML database  XML “chunks” in a relational database 4.File system 2.XML databases  Optimized for searching XML hierarchies  Some ability to handle more “relational” content  Might mean you choose to embed metadata within your XML documents  Less mature 3.Combination approaches  Relational database and an XML database  XML “chunks” in a relational database 4.File system

R EALLY [ ] S TRATEGIES It’s all about the content Editing options for XML/data 1.Forms  Custom  Can include text boxes  Can include text boxes with XML support (functional limitations)  Very easy to make available over the Web 2.XML editors  Provide both a document and a forms presentation  Built in document editing and XML handling features  More difficult to make available over the Web  More expensive 3.Word processors: Lots of customization 4.Combo: Different tools for different content 1.Forms  Custom  Can include text boxes  Can include text boxes with XML support (functional limitations)  Very easy to make available over the Web 2.XML editors  Provide both a document and a forms presentation  Built in document editing and XML handling features  More difficult to make available over the Web  More expensive 3.Word processors: Lots of customization 4.Combo: Different tools for different content

R EALLY [ ] S TRATEGIES It’s all about the content Interchange options Lots of options. For example:  Database replication  Data as XML (files or via software) is loaded into another database that understands more about the data relationships than is reflected in the XML  Application level interchange that doesn’t involve XML at all  … Tailor approach to the needs of the systems involved Lots of options. For example:  Database replication  Data as XML (files or via software) is loaded into another database that understands more about the data relationships than is reflected in the XML  Application level interchange that doesn’t involve XML at all  … Tailor approach to the needs of the systems involved

R EALLY [ ] S TRATEGIES It’s all about the content Summary Do you need to control document elements?  XML for documents worth considering. Is editing/presenting subsets of a document important?  XML for documents, XML databases worth considering. Is searching specific document elements important to you?  XML database worth considering. Do you have complex data relationships?  Use a relational database (with an XML database?) Is your organization risk-averse?  Stick with relational databases. Do you need to control document elements?  XML for documents worth considering. Is editing/presenting subsets of a document important?  XML for documents, XML databases worth considering. Is searching specific document elements important to you?  XML database worth considering. Do you have complex data relationships?  Use a relational database (with an XML database?) Is your organization risk-averse?  Stick with relational databases.

R EALLY [ ] S TRATEGIES It’s all about the content Aside #1: Wait a minute! XML for documents? Do I have to? No, you don’t.

R EALLY [ ] S TRATEGIES It’s all about the content Aside #2: Transformation To move between the three areas discussed – storage, editing, and interchange – content must be transformed This is more work than you might expect It’s easier when your content is well- modeled Look very closely at the tools for transformation in prospective systems To move between the three areas discussed – storage, editing, and interchange – content must be transformed This is more work than you might expect It’s easier when your content is well- modeled Look very closely at the tools for transformation in prospective systems

R EALLY [ ] S TRATEGIES It’s all about the content Final Words Model your content first. Actively decide what the line is between data and documents in your environment – or that there isn’t a firm line. Be pragmatic – you probably need to choose an approach based on what’s possible, affordable, and acceptable (risk), not just architecturally appealing. Experiment and learn before making final choices. If you get the content right, you can change your implementation later. Model your content first. Actively decide what the line is between data and documents in your environment – or that there isn’t a firm line. Be pragmatic – you probably need to choose an approach based on what’s possible, affordable, and acceptable (risk), not just architecturally appealing. Experiment and learn before making final choices. If you get the content right, you can change your implementation later.

R EALLY [ ] S TRATEGIES It’s all about the content Thank you.