Presentation is loading. Please wait.

Presentation is loading. Please wait.

Open XML Formats Fabio Santini.NET Developer Evangelist Microsoft Italy.

Similar presentations


Presentation on theme: "Open XML Formats Fabio Santini.NET Developer Evangelist Microsoft Italy."— Presentation transcript:

1

2 Open XML Formats Fabio Santini.NET Developer Evangelist Microsoft Italy

3 New community formed to help bring developers together Currently sponsored by almost 40 institutions from around the world Community and website for information exchange Free of Charge: Available to everyone that wants to participate; encourage development on all platforms Be one of the first to join the community! http://openxmldeveloper.org

4 FF301 Agenda Overview of the new formats Demo: New Office document Additional benefits of the new formats Role of XML in Office documents Structural details of the new formats Custom defined schema support File format benefits for developers Solution capabilities

5 Open XML Formats New XML file formats for Word, Excel and PowerPoint New formats will be the default file formats, with new file type extensions (.docx;.pptx;.xlsx) Fully 100% compatible with existing formats Open, transparent format improves interoperability XML - Transparent, XML format enables new integration scenarios for documents and LOB systems ZIP container - allows for standard compression on all files without user effort Licensing - Removed need for license by providing a Covenant that says we won’t enforce IP against folks implementing the format (100% royalty free) Standardization Ecma International - created TC45 to fully document the Open XML formats Members include: Apple, Barclays Capital, BP, the British Library, Essilor, Intel Corporation, NextPage Inc., Statoil ASA and Toshiba Current spec is already over 2000 pages

6 Office Open XML Formats

7 Microsoft Office Open XML Formats Added Benefits: compact and robust ZIP container allows for standard compression on all files without user effort (Dramatic file size improvements) Significantly more robust files to help minimize data loss Backward Compatible: Office 2000, Office XP, Office 2003 will all support the new formats Patches for compatibility available by launch Open, edit and save new formats Legacy support: Current Office 97-2003 binary file formats supported Support for XML formats from Office 2003, Office XP continued Developers: Endless potential for developers Build solutions to read, write, and modify Office files (without the need to run Office APIs)

8 Benefits of the Office Open XML Formats

9 Evolution Of File Formats Office 2000 Early Innovation XML document properties Office 97 Existing binary file formats designed in 1994, launched in Office 97 Office XP First XML Format Spreadsheet XML Office 2003 Breakthrough XML Support WordML, SpreadsheetML Custom-defined schema Office 2007 New XML Formats XML file format default XML PowerPoint format “Wave 12”

10 The Role Of XML With Documents ScenarioExample Document Assembly Server-based or user-assisted construction of documents from archived content or database content Create sales reports from financial and forecast data stored in a CRM system Content Reuse Much easier to move content between documents, including different document types Apply content stored in Word documents to Web pages quickly and efficiently Content Tagging Add domain-specific metadata to document content to enable custom solutions Tag presentations using a specific taxonomy to improve knowledge management efficiency Document Interrogation Query document repositories based on custom data, content types or document metadata Search for all documents containing a specific company name or sales contact Document Sanitization Remove unwanted content like comments or embedded code from your document when appropriate Remove all tracked changes and comments from a Word document before it is published

11 Open XML Formats Architecture User view: single Office “file” Questionnaire. docx Document Parts Most parts are XML Each XML part is a discreet, compressed component Can add, extract and modify individual parts without using Office programs Corruption or absence of any part would not prohibit the file from being opened Developer view: modular file

12 Modifying An Excel Spreadsheet

13 Components Of The New Formats Package – ZIP Container Part – The “files” inside the ZIP Content Types – Each part has a content type that is enforced on open Relationships – Any part that references another part must do so via a relationship

14 Create A Document From Scratch

15 The Role Of XML Reference and custom-defined schemas Custom-defined Schemas Data-oriented (e.g., Price, Invoice) Represents the business information stored in the document Enable System Integration XML Reference Schemas Display-oriented (e.g. Bold, Italics, Tables, Paragraphs, Styles) Open Document Format Enable Archival & File Formats Interoperability

16 The Role Of XML Reference and custom-defined schemas XML Reference Schemas Display-oriented (e.g. Bold, Italics, Tables, Paragraphs, Styles) Open Document Format Enable Archival & File Formats Interoperability <w:p> John Doe John Doe Health Agency Health Agency </w:p>

17 3/24/2004 Health Agency 100 25% … Custom-defined Schemas Data-oriented (e.g., Price, Invoice) Represents the business information stored in the document Enable System Integration The Role Of XML Reference and custom-defined schemas

18 Custom Defined Schema

19 Developing With The Formats More Reliable Solutions Third-party tools were main cause of document corruptions Fully Documented Formats Freely available for download with a royalty free license Office file format schemas - Used to validate content for a given part Samples, samples, samples In the form of code “snippets” for easier use and integration into your VSTO solutions WinFx Package APIs Access/maintain parts and relationships within a file Takes care of all ZIP level functionality XPath Navigation within content XML DOM Manipulating content Office Open XML Resource Kit Tools for constructing and deconstructing the new file formats Design time Validation tool Parses a file and reports on schema, relationship errors and warnings Runtime serialization tool Flattens package into a single file for ease of development in simple construction scenarios

20 Real World Example Schema documentation Initial submission to Ecma International TC45 was about 2500 pages ~25 namespaces ~94 XSD files 622 simple types 1444 complex types 3299 attributes 3371 enumerations 3464 elements Obviously, there is a huge amount of documentation to be done

21 Sample Solution Scenarios Data interoperability Content manipulation Content sharing and reuse Document assembly Document security Managing sensitive information Document styling Document profiling

22 Resources Office Preview Site http://www.microsoft.com/office/preview/ Brian Jones’s Blog http://blogs.msdn.com/Brian_Jones/ Kevin Boske’s Blog http://blogs.msdn.com/KevinBoske/ Office 2003 Reference Schema Information http://www.microsoft.com/office/xml/

23 © 2006 Microsoft Corporation. All rights reserved. This presentation is for informational purposes only. Microsoft makes no warranties, express or implied, in this summary.

24 Demo Title Name Title Group


Download ppt "Open XML Formats Fabio Santini.NET Developer Evangelist Microsoft Italy."

Similar presentations


Ads by Google