Documents, Text Editors, Text Retrieval, and Web Pages Class 3 LBSC 690 Information Technology.

Slides:



Advertisements
Similar presentations
Authoring Languages and Web Authoring Software 4.01 Examine web page development and design.
Advertisements

The Client-Server Model for the Web 1. A Web Client (usually in the form of a web browser) makes an HTTP request to a specific web server. 2. The Web Server.
Project 1 Introduction to HTML.
Basic IR: Queries Query is statement of user’s information need. Index is designed to map queries to likely to be relevant documents. Query type, content,
Text Retrieval and Spreadsheets Class 4 LBSC 690 Information Technology.
1 Computing for Todays Lecture 22 Yumei Huo Fall 2006.
Developing a Basic Web Page with HTML
Dreamweaver 8 Concepts and Techniques Introduction Web Site Development and Macromedia Dreamweaver 8.
Mgt 240 Lecture Website Construction: Software and Language Alternatives March 29, 2005.
Developing a Basic Web Page Posting Files on UMBC
1st Project Introduction to HTML.
4.01B Authoring Languages and Web Authoring Software 4.01 Examine webpage development and design.
COMPUTERS AND INFORMATION SYSTEMS HTML. How the Web Works To access a web site  Enter its address (URL) in the address box of your browser 
HTML 1 Introduction to HTML. 2 Objectives Describe the Internet and its associated key terms Describe the World Wide Web and its associated key terms.
Chapter ONE Introduction to HTML.
Web Design Basic Concepts.
CPSC 203 Introduction to Computers Lab 39, 40 By Jie (Jeff) Gao.
HTML Comprehensive Concepts and Techniques Intro Project Introduction to HTML.
1 Networks and the Internet A network is a structure linking computers together for the purpose of sharing resources such as printers and files Users typically.
Getting Started with Expression Web 3
What Is the Internet? The Internet is a worldwide collection of computer networks that links together millions of computers used by businesses, the government,
XP 1 HTML: The Language of the Web A Web page is a text file written in a language called Hypertext Markup Language. A markup language is a language that.
Chapter 7 Web Content Mining Xxxxxx. Introduction Web-content mining techniques are used to discover useful information from content on the web – textual.
Chapter 1 Understanding the Web Design Environment Principles of Web Design, 4 th Edition.
Week 1 Understanding the Web Design Environment. 1-2 HTML: Then and Now HTML is an application of the Standard Generalized Markup Language Intended to.
University of Sunderland CDM105 Session 5 Web Authoring Tools The past and present A history of web authoring tools and an overview of Macromedia Dreamweaver.
Programming in HTML.  Programming Language  Used to design/create web pages  Hyper Text Markup Language  Markup Language  Series of Markup tags 
HTML, XHTML, and CSS Sixth Edition Chapter 1 Introduction to HTML, XHTML, and CSS.
XP New Perspectives on The Internet, Sixth Edition— Comprehensive Tutorial 8 1 Creating Effective Web Pages Creating HTML Documents Tutorial 8.
Microsoft Internet Explorer and the Internet Using Microsoft Explorer 5.
Web Programming : Building Internet Applications Chris Bates CSE :
Introduction to HTML Tutorial 1 eXtensible Markup Language (XML)
Objective Understand concepts used to web-based digital media. Course Weight : 5%
1 Creating Web Pages Part 1. 2 OVERVIEW: HTML-What is it? HyperText Markup Language, the authoring language used to create documents on the World Wide.
Introduction to web development and HTML MGMT 230 LAB.
Web software. Two types of web software Browser software – used to search for and view websites. Web development software – used to create webpages/websites.
The Internet 8th Edition Tutorial 4 Searching the Web.
ECA 228 Internet/Intranet Design I Intro to Markup.
INP 150: Basic HTML Term: Winter 2002 Section: H1 Time: Mon/Wed 5:30- 7:25 pm Place: TI237 Instructor: Paul J. Millis.
4 1 SEARCHING THE WEB Using Search Engines and Directories Effectively New Perspectives on THE INTERNET.
Slide No. 1 Slide No. 1 HTML and Web Publishing Continued CS 104 CS 104.
LBSC 690 Session 5A Programming. Languages How do we learn a language? Learn by listening Then reading Then writing How do we teach programming? Learn.
LBSC 690 Session 5A Programming. Languages How do we learn a language? Learn by listening Then reading Then writing How do we teach programming? Learn.
HTML BTEC National in Computing Section5. Create Information “HTML: defining HTML, discussing HTML uses and demonstrating HTML basics, HTML structure…..
Web Application Programming Presented by: Mehwish Shafiq.
The Web Wizard’s Guide to HTML Chapter One World Wide Web Basics.
Living Online Lesson 3 Using the Internet IC3 Basics Internet and Computing Core Certification Ambrose, Bergerud, Buscge, Morrison, Wells-Pusins.
4.01B Authoring Languages and Web Authoring Software 4.01 Examine webpage development and design.
HTML Concepts and Techniques Fifth Edition Chapter 1 Introduction to HTML.
Text Retrieval and Spreadsheets Session 4 LBSC 690 Information Technology.
Website Design, Development and Maintenance ONLY TAKE DOWN NOTES ON INDICATED SLIDES.
LBSC 690 Session 4 Programming. Languages How do we learn a language? Learn by listening Then reading Then writing How do we teach programming? Learn.
Writing Your Own Web Page: Using HTML and FrontPage Chapter 10.
HTML Hyper Text Markup Language. The Basics u HTML documents contain “tags” which instruct the Browser software on how to present the information within.
Introduction to HTML Simple facts yet crucial to beginning of study in fundamentals of web page design!
Web Design Terminology Unit 2 STEM. 1. Accessibility – a web page or site that address the users limitations or disabilities 2. Active server page (ASP)
Introduction. Internet Worldwide collection of computers and computer networks that link people to businesses, governmental agencies, educational institutions,
Web Design Principles 5 th Edition Chapter 3 Writing HTML for the Modern Web.
XP Creating Web Pages with Microsoft Office
Web Page Programming Terms. Chapter 1 Objectives Describe Internet and Understand Key terms Describe World Wide Web and its Key terms Identify types and.
4.01B Authoring Languages and Web Authoring Software
Exploring Microsoft Word 2000
Project 1 Introduction to HTML.
COMPSCI 111 / 111G An introduction to practical computing
Creating a Successful Web Presence
4.01B Authoring Languages and Web Authoring Software
Lesson 9: GUI HTML Editors and Mobile Web Sites
Tutorial Developing a Basic Web Page
Documents, Text Editors, and Web Pages
An Introduction to HTML Pages
Presentation transcript:

Documents, Text Editors, Text Retrieval, and Web Pages Class 3 LBSC 690 Information Technology

Agenda Questions Unix Survival Guide Document Creation (Word Processing and HTML) Document Retrieval Project Overview

Unix Survival Guide WAM account Directory structure (mkdir, cd,.., /) How much space is used (du, ls -l) Eliminating unneeded files (rm) Managing mail (pine, attachments) Moving files (mv, cp, ftp) Editing files (pico, more) Web anywhere (lynx)

Document Creation Editors Word Processors Desktop Publishing Structured Documents HTML/SGML/XML

Editors (Text Editing vs. Word Processing) Purpose –Create and modify ASCII text Examples –pico, axe, and emacs on WAM Advantages –Compatible with virtually everything (VT-100) Disadvantages –Limited format control, sometimes no mouse

Word Processors Purpose –Create documents intended for human readers Examples –Microsoft Word and Word Perfect in OWL Advantages –Good format control –WYSIWYG (“What You See is What You Get”) Disadvantages –No (universal) standard interchange format

Desktop Publishing Purpose –Produce documents for wide (paper) distribution Examples –Adobe Pagemaker in the WAM labs Advantages –Allows very detailed layout control Disadvantages –Requires fairly extensive user expertise

Structured Documents Purpose –Specify logical structure of the documents Examples – , HTML, LaTeX, SGML/XML Advantages –Allows easy reformatting for different displays Disadvantages –Hard to read unless “rendered” before viewing

Hyper-Text Markup Language (HTML) Purpose –Structured document language for web pages Advantages –Adapts easily to different display capabilities –Widely available rendering software (browsers) Disadvantages –Direct control over layout is limited –The HTML “standard” is still evolving

First Steps in HTML Find a web page you like Select “Document Source” in “View” menu Compare HTML code with rendered version –Observe how to achieve each effect Select “Save As” in “File” menu FTP the file to ~/../pub/ on WAM Edit the file using pico

HTML Document Structure Markup tags (open and close) bracket content … Title shows up in the Web browser’s frame Headers show up in the page itself For each link, specify the URL and link text link text Inline graphics can replace the link text

Designing Web Pages Key design issues: –Content: What do you want to publish? –Style: How do you want to present it? –Syntax: How can you achieve that presentation? Sources of information –Online tutorials (Yahoo points to lots of these) –Technical materials (e.g., the HTML 3.0 spec)

Style Guidelines Design for generic browsers –And test on every version you wish to support Provide appropriate access points –User needs and navigation strategies differ Design useful navigational aids –A web search may lead to the middle of a site Include some indication of currency –Date of last update, “new” icons, etc.

HTML Editors Goal is to create web pages, not learn HTML! Several are available –In Explorer, “Edit-Page” for Front Page Express –In Netscape, “File-Edit Page” for Composer You may still need to edit the HTML file –Some editors use browser-specific features –Some HTML features may be missing entirely –File names may be butchered by FTP

SGML/XML Generalized Markup Languages –SGML - Standard Generalized Markup Language (for paper documents) –XML - eXtensible Markup Language (for Web documents) (see W3C) These allow people to design –DTDs - Document-type definitions A Document also needs: –DSSSL - Document Stylesheet Specification Language

Document Retrieval Making documents is often easier than finding them! Hypertext vs. Cataloging vs. Searching –yahoo vs. altavista Lots of applications –Chasing down citations in papers you read –Web search engines –Managing your personal files Two basic approaches to searching –Explicit queries (“information retrieval”) –“Watch what I do” (“adaptive filtering”)

Ways of Searching for Text Controlled vocabulary –Manual indexing based on named concepts Free text –Characterize documents by the words the contain Social filtering –Exchange and interpret personal ratings

“Exact Match” Retrieval Find all documents with some characteristic –Indexed as “Presidents -- United States” –Containing the words “Clinton” and “Peso” –Read by my boss A set of documents is returned –Each is as likely to be useful as any other –Usually listed in date or alphabetical order

Ranked Retrieval Put most useful documents near top of a list –Put possibly useful documents lower in the list No need to exclude any documents –Just list those least likely to be useful last Two basic techniques –Similarity-based –Probability-based

Similarity-Based Retrieval Assume “most useful” = most similar to query Lots of clues to meaning –Repeated words are good cues to meaning –Rarely used words make searches more selective Easily combined –Compute a “weight” for each term –Add up the weights for query terms in a document

Project Overview Goal: Solve a practical problem –One which is fairly complex You choose the technology –Make a set of web pages (a web “site”) –Make a database (optional for summer 690) –Do something else that is equally complex Multimedia presentation, Java program, … Suggest two-person groups

Web Projects Have significant content! (see “What is a Book” web site under CLIS Dean’s Award) Multiple access points –Taxonomy, search engine, map, etc. Be creative (in a useful way)! For example: –Choose a novel application –Engage the user with an interactive approach –Adopt an innovative organization –Implement a creative layout

Database Projects (very ambitious for Summer 690) Your focus should be on scalability –What if the IRS decided to use your database? The user interface is important –Designed to be used without taking 690 first! Include enough content to allow testing –But focus on organization, not on content The same creativity issues as web projects

Project Timeline and Deliverables (summer 690) Project specification (1-2 pages) Should include User Manual (FAQ) and Test Plan components Project demonstrations last week of class –Scheduled individually –All two/three team members get the same grade