Structured-Document Processing Languages (3 cu), Spring 2001 Pekka Kilpeläinen University of Kuopio Department of CS & Applied Math

Slides:



Advertisements
Similar presentations
DOCUMENT TYPES. Digital Documents Converting documents to an electronic format will preserve those documents, but how would such a process be organized?
Advertisements

XML: Extensible Markup Language
Processing of structured documents Spring 2003, Part 1 Helena Ahonen-Myka.
1 XML Data Management Course Outline and Organisation Werner Nutt.
Introduction to CS170. CS170 has multiple sections Each section has its own class websites URLs for different sections: Section 000:
Querying XML Documents and Data CBU Summer School (2 ECTS) Prof. Pekka Kilpeläinen Univ of Kuopio, Dept of Computer Science
Web Design! John Paxton Montana State University - Bozeman CCSC Northwestern Region Saturday, October 8 th, 2004.
©Silberschatz, Korth and Sudarshan10.1Database System Concepts W3C Activities HTML: is the lingua franca for publishing on the Web XHTML: an XML application.
CSCE156: Introduction to Computer Science II Instructor Stephen Scott Website
XML A brief introduction ---by Yongzhu Li. XML --- a brief introduction 2 CSI668 Topics in System Architecture SUNY Albany Computer Science Department.
The Semantic Web Week 1 Module Content + Assessment Lee McCluskey, room 2/07 Department of Computing And Mathematical Sciences Module.
Media: Text “Words and symbols in any form, spoken or written, are the most common system of communication.” ~ unknown.
Overview of the MS Program Jan Prins. The Computer Science MS Objective – prepare students for advanced technical careers in computing or a related field.
Technical Report Writing and Presentation Skills Course Outline 1.
+ Connecting to the Web Week 7, Lecture A. + Midterm Basics Thursday February 28 during Class The lab Tuesday, February 26 is optional review Class on.
Lecture 1, 7/21/2005Natural Language Processing1 CS60057 Speech &Natural Language Processing Autumn 2005 Lecture 1 21 July 2005.
Prof. dr Slobodanka Đorđević-Kajan Dr Dragan Stojanović
C O M P U T E R G R A P H I C S Guoying Zhao 1 / 16 Computer Graphics Course Introduction.
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke1 XML Taken from Chapter 7.
CS 2104 Prog. Lang. Concepts Dr. Abhik Roychoudhury School of Computing Introduction.
Structured-Document Processing Languages Spring 2011 Course Review Repetitio mater studiorum est!
School of Computing and Management Sciences © Sheffield Hallam University To understand the Oracle XML notes you need to have an understanding of all these.
Digital Systems 1 Digital Systems Research Presentation Mohammad Sharifkhani ©
OPERATING SYSTEMS AND LANGUAGE TRANSLATORS CIS 2380 TERM 2 – LANGUAGE TRANSLATORS Lee McCluskey – 23/09/20151.
CSS 404 Internet Concepts. XP Objectives Developing a Web page and a Website Working with CSS (Cascading Style Sheets) Web Tables Web Forms Multimedia.
XML 1 Enterprise Applications CE00465-M XML. 2 Enterprise Applications CE00465-M XML Overview Extensible Mark-up Language (XML) is a meta-language that.
1 XML Data Management Course Outline and Organisation Werner Nutt.
UFCEUS-20-2 Web Programming Prakash Chatterjee Room 3P16 Lecture 1 : Introduction & Course Outline.
Structured-Document Processing Languages Spring 2005 Course Review Repetitio mater studiorum est!
CMSC 345, Spring CMSC 345 Software Design and Development Spring 2003 Section 0101 Ms. Susan Mitchell “Welcome to the School of Hard Knocks”
Course Introduction Software Engineering
CST 229 Introduction to Grammars Dr. Sherry Yang Room 213 (503)
Structured-Document Processing Languages (3 cu), Spring 2002 Pekka Kilpeläinen University of Kuopio Department of CS & Applied Math
Intro. to XML & XML DB Bun Yue Professor, CS/CIS UHCL.
Lei Bu Preliminary Introduction to the Theory of Computation.
February 25, ICE 1341 – Programming Languages (Lecture #1) In-Young Ko Programming Languages (ICE 1341) Lecture #1 Programming Languages (ICE 1341)
SDPL 2001Notes 4: Intro to Stylesheets1 4. Introduction to Stylesheets n Discussed recently: –Programmatic manipulation of (data-oriented) documents n.
1 Database Management for Electronic Commerce and EBusiness Walt Scacchi, Ph.D. GSM 274/FEMBA 274 Spring 2002.
XML A web enabled data description language 4/22/2001 By Mark Lawson & Edward Ryan L’Herault.
1 Technical & Business Writing (ENG-315) Muhammad Bilal Bashir UIIT, Rawalpindi.
UFCEUS-20-2 Web Programming Lecture 1 Module Introduction & Outline.
Course Overview Stephen M. Thebaut, Ph.D. University of Florida Software Engineering Foundations.
CMSC 345, Fall CMSC 345 Software Design and Development Fall 2003 Section 0101 Ms. Susan Mitchell Leave Your Ego at the Door!
Introduction to XML This presentation covers introductory features of XML. What XML is and what it is not? What does it do? Put different related technologies.
1 Overview of XSL. 2 Outline We will use Roger Costello’s tutorial The purpose of this presentation is  To give a quick overview of XSL  To describe.
King Saud UniversityCSC112 - First Semester CSC 112 Java Programming I Introduction.
Structured-Document Processing Languages (3 cu), Spring 2004 Pekka Kilpeläinen University of Kuopio Department of CS & Applied Math
COMP9321 Web Application Engineering Semester 2, 2015 Dr. Amin Beheshti Service Oriented Computing Group, CSE, UNSW Australia Week 4 1COMP9321, 15s2, Week.
Structured-Document Processing Languages Spring 2007 Course Review Repetitio mater studiorum est!
SDPL 2002Notes 4: Intro to Style Sheets1 4. Introduction to Style Sheets n Discussed recently: –Programmatic manipulation of documents n Now a more human-oriented.
SDPL 20064: Introduction to Style Sheets1 4. Introduction to Style Sheets n Discussed recently: –(APIs for) procedural manipulation of documents n Now.
Structured-Document Processing Languages Spring 2004 Course Review Repetitio mater studiorum est!
©Silberschatz, Korth and Sudarshan10.1Database System Concepts W3C - The World Wide Web Consortium W3C - The World Wide Web Consortium.
UFIEUS-20-2 Web Programming Prakash Chatterjee / Dan Dixon Room 3P16 Lecture 1 : Introduction & Course.
Structured-Document Processing Languages (3 cu/6 ECTS cp), Spring 2007 Pekka Kilpeläinen University of Kuopio Department of Computer Science
Sheet 1XML Technology in E-Commerce 2001Lecture 0 XML Technology in E-Commerce Klaas van den Berg & Ivan Kurtev 2000/2001 – trimester 3.
King Saud University1 CSC 112 Java Programming I Introduction.
SDPL 2004Notes 4: Intro to Style Sheets1 4. Introduction to Style Sheets n Discussed recently: –Programmatic manipulation of documents n Now a more human.
Structured-Document Processing Languages (5 cp), Spring 2011 Pekka Kilpeläinen University of Eastern Finland School of Computing
Course Overview Stephen M. Thebaut, Ph.D. University of Florida Software Engineering.
I Copyright © 2004, Oracle. All rights reserved. Introduction.
Scott C. Johnson Lecturer Rochester Institute of Technology Spring 2016.
ECE/CS 352 Digital System Fundamentals1 ECE/CS 352 Digital Systems Fundamentals Spring 2001 Introduction Charles R. Kime.
CS140 – Computer Programming 1 Course Overview First Semester – Fall /1438 – 2016/2017 CS140 - Computer Programming 11.
CSE202 Database Management Systems
Structured-Document Processing Languages
Logic and Computer Design Fundamentals
Introduction to Internet Programming
XML Data Introduction, Well-formed XML.
CS 456 Interactive Software.
Presentation transcript:

Structured-Document Processing Languages (3 cu), Spring 2001 Pekka Kilpeläinen University of Kuopio Department of CS & Applied Math

SDPL 2001Notes 1: Introduction2 1 Introduction First: Overview and Arrangements What this course is about? 1.1 Structured Documents Review of basic notions

SDPL 2001Notes 1: Introduction3 Goals of the Course n Sufficient familiarity with the most important models and languages for –manipulating –representing –transforming and –querying structured documents (or XML) n Emphasis on active formalisms (for describing processes on documents) instead of describing documents/data

SDPL 2001Notes 1: Introduction4 NOT an Exhaustive Survey n Bias in selecting course topics: –estimated usefulness/value »centrality (implying longer-lasting value) »maturity: Stable specifications? Existing implementations? –Lecturer up-to-date?

SDPL 2001Notes 1: Introduction5 Preliminary Outline 1 Introduction Overview and Arrangements 1.1 Structured Documents 2 Document Grammars and Instances 2.1 Grammars and Parse Trees 2.2 Review of XML basics 3 Programmatic Manipulation of Structured Documents (XML APIs) 3.1 SAX 3.2 DOM

SDPL 2001Notes 1: Introduction6 Preliminary Outline (2) 4 Styling Structured Documents I 4.1 Essentials of Cascading Style Sheets 5 Transforming Structured Documents 5.1 Addressing: XPath 5.2 XSLT 6 Styling Structured Documents II: XSL 7 XML Web-Site Architectures 8 Querying Structured Documents 8.1 Region Algebra and sgrep 8.2 XML Query Languages

SDPL 2001Notes 1: Introduction7 Methodological Goals n Some central professional skills –consulting of technical specifications –experimenting with SW implementations n Ability to think…? –to find out relationships –to apply knowledge in new situations n ("Pidgin English" for scientific communication)

SDPL 2001Notes 1: Introduction8 Administration n An elective graduate-level (laudatur) special course –suitable for all specialisation lines (esp. CS/SWE) n 3 cu (  120 hours of work) n Lectures Mar 7 - May 2, Microteknia MT2 –Lecturer: n Assistant:

SDPL 2001Notes 1: Introduction9 Administration: Exercises n Exercises Mar 22 - May 3, MT2; May 11, MT1 –essential for familiarising with the subject –mainly normal homework assignments, solutions discussed in-class –1 or 2 groups, depending on attendance n + a few (1-3) "mini-projects" »reading and summarising tasks? »hands-on experimentation? »to be handed-in to lecturer –credited like other exercises (scaled based on quality by a factor in [0, 1.5])

SDPL 2001Notes 1: Introduction10 Administration: Grading n Course examination on Wed, May 23, in Auditorium –minimum of 50% of exam points to pass the course Grade = (32*Exam/MaxExam + 12*HomeWork/MaxHomeWork - 8)/3 Grade = (32*Exam/MaxExam + 12*HomeWork/MaxHomeWork - 8)/3 n Opportunity to retake the exam –June 7 (  50% to pass, grade with/without homework credits)

SDPL 2001Notes 1: Introduction11 Material n No single textbook n Reports, articles n Course home page – –lecture notes, exercises, reference material, announcements, … n Recommended (but not required) text: Deitel, Deitel, Nieto, Lin & Sadhu: XML - How to Program. Prentice Hall, 2001.

SDPL 2001Notes 1: Introduction12 Background Check n Basic knowledge of structured documents and document standards –Course "Document standards"? –HTML? n Programming languages and concepts –OO programming, Java? –Unix/Linux \ Windows? n Formal language theory –Theory of Computation / "Ohjelmoinnin ja laskennan teoria"? –regular expressions, automata? –context-free grammars, parse trees?

SDPL 2001Notes 1: Introduction13 Course Expectations?

SDPL 2001Notes 1: Introduction Structured Documents n Document: –a structured representation of information on some medium (  message) –normally for a human reader »memos, manuals, articles, books, … –also application-to-application messages »EDI (electronic data interchange) –"prose-oriented XML" vs "data-oriented XML" –possibly non-permanent, dynamically generated –processable or conceivable as a unit »(a web page vs a web site)

SDPL 2001Notes 1: Introduction15 Text-Based Documents n We concentrate on textual or text-based documents –character data major constituent of information content –as opposed to, say multimedia documents n Next: Presentation vs Structure

SDPL 2001Notes 1: Introduction16 Presentation vs Structure n Presentation informs the human reader about the meaning of text and the role of its parts n Markup: indicating the presentation or the meaning of different parts of text –originally hand-written annotations for the typesetter –nowadays primarily codes embedded in digital documents

SDPL 2001Notes 1: Introduction17 Markup n Procedural markup –formatting commands (start boldface, produce an empty line, indent 5 mm,...) –proprietary word processor formats, nroff, TeX,... n Descriptive or generic markup –indicating the logical structure of text using chosen names –LaTeX: \begin{abstract}... \end{abstract} –HTML:.... –HTML:.... n Markup language –a fixed set of markup notations (e.g. nroff, TeX, HTML, SVG, …)

SDPL 2001Notes 1: Introduction18 Structured documents? Most liberally, any document is structured (punctuation, words, sentences, fields, …) but especially descriptively marked-up documents... especially if they adhere to a rigorous specification of structure.

SDPL 2001Notes 1: Introduction19 Structure in documents n Hierarchy or nesting is ubiquitous –chapters of books, warnings in maintenance manuals,... n Linear order essential in prose documents –less important in documents representing data objects n Hypertext and cross-references n We'll be mainly dealing with manipulation of hierarchical, or tree-like document structures Next: How these are modelled?