Search. Search issues How do we say what we want? –I want a story about pigs –I want a picture of a rooster –How many televisions were sold in Vietnam.

Slides:



Advertisements
Similar presentations
Introduction to Web Design XHTML. The Basics Elements and Tags are the basics of any webpage.
Advertisements

Searching & Saving Web Resources ADE100- Computer Literacy Lecture 23.
Procedure for Developing a Multimedia Presentation 6.02 Apply procedures to develop multimedia presentations used in business.
UEC 01 : Computer Skills & Programming Concepts I 1PUA – Computer Engineering Department – UEC01 – Dr. Mona Abou - Of Lecture 10: Presentation Graphic.
Tries Standard Tries Compressed Tries Suffix Tries.
. Website and file organization. How websites work.
Algorithms (Contd.). How do we describe algorithms? Pseudocode –Combines English, simple code constructs –Works with various types of primitives Could.
Searching The Web Search Engines are computer programs (variously called robots, crawlers, spiders, worms) that automatically visit Web sites and, starting.
Copyright 2003 The McGraw-Hill Companies, Inc CHAPTER Application Software computing ESSENTIALS    
S OFTWARE AND M ULTIMEDIA Chapter 6 Created by S. Cox.
Start -> All Programs -> Classes -> Web Expressions -> Dreamweaver.
WMES3103 : INFORMATION RETRIEVAL INDEXING AND SEARCHING.
Search engines fdm 20c introduction to digital media lecture warren sack / film & digital media department / university of california, santa.
Software and Multimedia
AN ULTIMATE PORTAL TO GOOGLE’S SUCCESS BY: BENJAMIN NGUYEN & BRITTANY HAHN.
Did someone say Music? November 7, Adding MUSIC to a web page: 1. Find your music! 2. A good website is 3. You will.
Searching the Internet Using Google Tips and Tricks.
Lecture #32 WWW Search. Review: Data Organization Kinds of things to organize –Menu items –Text –Images –Sound –Videos –Records (I.e. a person ’ s name,
ULI101 – XHTML Basics (Part II) What is Markup Language? XHTML vs. HTML General XHTML Rules Block Level XHTML Tags XHTML Validation.
Basic Web Applications 2. Search Engine Why we need search ensigns? Why we need search ensigns? –because there are hundreds of millions of pages available.
Click on surfer mouse to catch a wave. The Internet is a worldwide network of _______ that are connected by wires and cables. Click the picture below.
Task 2.3: Stand Up – a presentation to encourage people to stand up to cyberbullying You will be working together on this activity with the same people.
Computer Vocabulary Acceptable Use Policy conduct expected from someone using a computer.
 The World Wide Web is a collection of electronic documents linked together like a spider web.  These documents are stored on computers called servers.
Unit 2, cont. September 12 More HTML. Attributes Some tags are modifiable with attributes This changes the way a tag behaves Modifying a tag requires.
Software. Generic Software  e.g. word processing, spreadsheet and database. – This simply implies that any of the dozens of spreadsheet packages, for.
Creating webpages in Google Sites. 1- Create a Gmail account.
Understand business uses of presentation software and methods of distribution.
LOGO Searching the Web CHAPTER 2 Eastern Mediterranean University School of Computing and Technology Department of Information Technology ITEC229 Client-Side.
Search Engines. Search Strategies Define the search topic(s) and break it down into its component parts What terms, words or phrases do you use to describe.
استاد : مهندس حسین پور ارائه دهنده : احسان جوانمرد Google Architecture.
Quiz # 1 Chapters 1,2, & 3.
Lecture # 30 Data Organization and Binary Search.
Return to Outline Copyright © 2011 by Maribeth H. Price 3-1 Labeling and annotation.
The World Wide Web: Information Resource. Hock, Randolph. The Extreme Searcher’s Internet Handbook. 2 nd ed. CyberAge Books: Medford. (2007). Internet.
Authoring and Multimedia Data. Data Sources Data Types Data Compression Techniques Data Security.
Text and Advanced HTML. Review - Basic data Numbers Characters (ASCII & UNICODE) Records (collection of named things) Arrays (collection of indexed things)
By Mrs. Houle  It is time to visit the Internet. You are going to jump onto Google Chrome to learn how to use the web browser tools to get around. Along.
Lawrence Snyder University of Washington, Seattle © Lawrence Snyder 2004.
The World Wide Web: Information Resource. How a Search Engine works… How Search Works - YouTube
The World Wide Web. What is the worldwide web? The content of the worldwide web is held on individual pages which are gathered together to form websites.
Software AS Module Heathcote Ch. 22. Importance of Information  Information technology is fundamental to the success of any business  The information.
Lesson 6 Links. Creating Folders  For every web site/page, you need to create a separate folder  The computer cannot find links if they are not stored.
Procedure for Developing a Multimedia Presentation Apply procedures to develop multimedia presentations used in business.
CS100 Final Review Study the quizzes Find out what you missed on the midterms.
Setting up a search engine KS 2 Search: appreciate how results are selected.
Creating a Google Site For a Digital Portfolio Purpose.
Web Design Terminology Unit 2 STEM. 1. Accessibility – a web page or site that address the users limitations or disabilities 2. Active server page (ASP)
Chapter 8 Adding Multimedia Content to Web Pages HTML5 & CSS 7 th Edition.
1 Chapter 5 (3 rd ed) Your library is an excellent resource tool. Your library is an excellent resource tool.
Third Grade Home Directory/H-Drive The location on the server where individual users can save their work. This directory is named the same as the username.
Client-Side Internet and Web Programming
Information Architecture
Procedure for Developing a Multimedia Presentation
Procedure for Developing a Multimedia Presentation
Software and Multimedia
Software and Multimedia
Procedure for Developing a Multimedia Presentation
Procedure for Developing a Multimedia Presentation
Procedure for Developing a Multimedia Presentation
Procedure for Developing a Multimedia Presentation
Procedure for Developing a Multimedia Presentation
Files Handling In today’s lesson we will look at:
Technology Vocabulary
Procedure for Developing a Multimedia Presentation
Procedure for Developing a Multimedia Presentation
Computer Terms 1 Terms Internet Terms 1 Internet Terms 2 Computer
Internet Vocabulary Terms
Procedure for Developing a Multimedia Presentation
Presentation transcript:

Search

Search issues How do we say what we want? –I want a story about pigs –I want a picture of a rooster –How many televisions were sold in Vietnam during 2000? –Find a movie like this one How does the computer find what we said?

Things to search for Records Text Images Audio Video

Records Car –Price: $5,000 –Miles: 20,000 –Year: 1994 –Make: Toyota –Doors: 2 Queries Price < 6000 & Miles< Make == Toyota & Year > 1993

Queries Make == Toyota & Year >1993

Queries Make == Toyota & Year >1993

Queries Year >1993 or Price < $3,000

Queries Year >1993 or Price < $3,000

Databases Large collections of records Accessed by queries

Things to search for Records  Text Images Audio Video

Text searching How do I say what I want? –Type some phrase I want a story about pigs How will the computer match this? –What is text? An array of characters –What can can a computer do with text? Match characters

Text searching People think in words not characters How do I convert an array of characters into an array of words? –Collect together sequences of letters –How do I know if character C is a letter? C>=“a” & C =“A” & C<=“Z”

Convert to words Because people think in words

Every document is an array of words I want a story about pigs How will I find the right documents? –Find all documents that have the word “pigs”

Searching text How will I find pigs fast? –Hint: the “URL Lookup” assignment –Create an index of all words With each word store the name or address of each document that contains that word –Search the index for “pigs” Return the list of documents Use a binary search on the word list (50,000 words)

Problems What if a document has the word “Pig” but not “pigs”? Normalize –Case - make all words lower case Pig -> pig –Stemming - remove all suffixes and prefixes before putting a word into the index pigs -> pig piggy -> pig

Problems I want a story about pigs? –How does the computer know to search for pigs? It doesn’t –How does the computer know what a story is? It doesn’t

Searching I want a story about pigs Pick out the important words and search for them –Which words are important? –D = number of times a word appears in a document –A = average number of times a word appears in all documents –Importance = D/A Why?

How do we create an index of all documents on the Web? Try = a list of URLs Seen = all URLs from Seen While (Try is not empty) {Page = take a URL from Try Words = all the “important” words in Page add Page to the index using all of Words Links = all URLs in Page for every Link that is not in Seen add Link to Try and to Seen }

Other ways to find important words and important documents A Document is important if many other documents point to it A word is important in document D if that word occurs frequently in documents that link to document D.

Images What will I say when searching for an image? –I want a rooster picture –Draw a picture of a rooster?

Search by picture? ?

What’s in a picture? Computers don’t understand the contents of images To a computer an image is an array of colors

I want a picture of a rooster Label all of the pictures How does Google do it? –File name of the picture “rooster-crossingSt.jpg” –Words around the picture in the HTML

Audio Talking –Use speech recognition to convert audio to text –With each recognized word keep track of where in the audio it was recognized. Build an index using the recognized text –Normalize based on how words sound rather than are spelled.

Video Where in “Casablanca” does Bogart say “Play it again Sam” ? –he never does, he just says “play it” How can the computer find that? –Transcribe the audio –Speech recognition on the audio

Video Does Woody ever kiss Bo Peep? Exactly what color is a kiss?

Video Does Woody ever kiss Bo Peep? Annotate every frame with who is in the frame and search for frames with both Woody and Bo Peep.

So what’s with this?

Or this?

Is Woody cheating?

Search Records –Queries = And Or Text –Normalized words (case, stemming, thesaurus) Images –Add words Audio –Transcribe or recognize as words Video –Transcribe –Annotate