2003.08.26 - SLIDE 1IS 202 - Fall 2003 Course Introduction Prof. Ray Larson & Prof. Marc Davis UC Berkeley SIMS Tuesday and Thursday 10:30 am - 12:00 am.

Slides:



Advertisements
Similar presentations
Chapter 5: Introduction to Information Retrieval
Advertisements

Integrating Educational Technology into the Curriculum
Ying Wang EDN 303 Fall Objectives Define curriculum-specific learning Explain the difference between computer, information, and integration literacy.
1 i206: Distributed Computing Applications & Infrastructure 2012
SLIDE 1IS 257 – Fall 2007 Codes and Rules for Description: History 2 University of California, Berkeley School of Information IS 245: Organization.
Information Retrieval in Practice
Search and Retrieval: More on Term Weighting and Document Ranking Prof. Marti Hearst SIMS 202, Lecture 22.
Models for Information Retrieval Mainly used in science and research, (probably?) less often in real systems But: Research results have significance for.
SLIDE 1IS 202 – FALL 2004 Lecture 29: Final Review Prof. Ray Larson & Prof. Marc Davis UC Berkeley SIMS Tuesday and Thursday 10:30 am - 12:00.
Oct 31, 2000Database Management -- Fall R. Larson Database Management: Introduction to Terms and Concepts University of California, Berkeley School.
SLIDE 1IS 202 – FALL 2004 Lecture 13: Midterm Review Prof. Ray Larson & Prof. Marc Davis UC Berkeley SIMS Tuesday and Thursday 10:30 am -
8/28/97Information Organization and Retrieval Metadata and Data Structures University of California, Berkeley School of Information Management and Systems.
SLIDE 1IS Fall 2002 Course Introduction Prof. Ray Larson & Prof. Marc Davis UC Berkeley SIMS Tuesday and Thursday 10:30 am - 12:00 am.
Systems Analysis and Design for Electronic Commerce, Networked Business Processes, and Virtual Enterprises Walt Scacchi, Ph.D. GSM 271 and FEMBA 271 Spring.
10/23/2001Information Organization and Retrieval Information Structures and Metadata University of California, Berkeley School of Information Management.
A metadata-based approach Marti Hearst Associate Professor BT Visit August 18, 2005.
SLIDE 1IS 202 – FALL 2002 Prof. Ray Larson & Prof. Marc Davis UC Berkeley SIMS Tuesday and Thursday 10:30 am - 12:00 am Fall 2002
8/31/2000Information Organization and Retrieval What is Information? The Nature, Growth and Characteristics of Information University of California, Berkeley.
© Anselm SpoerriInfo + Web Tech Course Information Technologies Info + Web Tech Course Anselm Spoerri PhD (MIT) Rutgers University
The Subject Librarian's Role in Building Digital Collections: Where Information Management and Subject Expertise Meet Ruth Vondracek Oregon State University.
SLIDE 1IS 202 – FALL 2003 Lecture 26: Final Review Prof. Ray Larson & Prof. Marc Davis UC Berkeley SIMS Tuesday and Thursday 10:30 am - 12:00.
ISP 433/633 Week 7 Web IR. Web is a unique collection Largest repository of data Unedited Can be anything –Information type –Sources Changing –Growing.
SLIDE 1IS Fall 2004 Course Introduction Prof. Ray Larson & Prof. Marc Davis UC Berkeley SIMS Tuesday and Thursday 10:30 am - 12:00 am.
10/24/2000Information Organization and Retrieval Information Structures and Metadata University of California, Berkeley School of Information Management.
Computer comunication B Information retrieval Repetition Retrieval models Wildcards Web information retrieval Digital libraries.
8/28/2001Information Organization and Retrieval SIMS 202 Information Organization and Retrieval Prof. Ray Larson & Prof. Warren Sack UC Berkeley SIMS Tues/Thurs.
SIMS 202 Information Organization and Retrieval Prof. Marti Hearst and Prof. Ray Larson UC Berkeley SIMS Tues/Thurs 9:30-11:00am Fall 2000.
CS580: Building Web Based Information Systems Roger Alexander & Adele Howe The purpose of the course is to teach theory and practice underlying the construction.
Overview of Search Engines
The Integration of Embedded Librarians at Tuskegee University Juanita M. Roberts Director Library Services Ford Motor Company Library/Learning Resources.
Teaching Metadata and Networked Information Organization & Retrieval The UNT SLIS Experience William E. Moen School of Library and Information Sciences.
Library 150 Information Literacy & Research Skills E. Chisato Uyeki Fall 2006: Week 1 September 22, 2006.
Lecture 1 Page 1 CS 111 Summer 2015 Introduction CS 111 Operating System Principles.
Put it to the Test: Usability Testing of Library Web Sites Nicole Campbell, Washington State University.
Information Retrieval CENG 555 Spring Course Web Page Authoritative source of administrivia In-class announcements generally reflected on Web.
LIS 506 (Fall 2006) LIS 506 Information Technology Week 11: Digital Libraries & Institutional Repositories.
Information Retrieval and Web Search Lecture 1. Course overview Instructor: Rada Mihalcea Class web page:
Personal Information Management Vitor R. Carvalho : Personalized Information Retrieval Carnegie Mellon University February 8 th 2005.
1 Database Management for Electronic Commerce and EBusiness Walt Scacchi, Ph.D. GSM 274/FEMBA 274 Spring 2002.
Proposal for Term Project J. H. Wang Mar. 2, 2015.
EU Project proposal. Andrei S. Lopatenko 1 EU Project Proposal CERIF-SW Andrei S. Lopatenko Vienna University of Technology
Overviews of ITCS 6161/8161: Advanced Topics on Database Systems Dr. Jianping Fan Department of Computer Science UNC-Charlotte
Advanced Database Course (ESED5204) Eng. Hanan Alyazji University of Palestine Software Engineering Department.
+ Introduction to Class IST210 Class Lecture. + Course Objectives Understand the importance of data, databases, and database management Design and implement.
1 Technologies for Electronic Commerce and EBusiness Walt Scacchi, Ph.D. FEMBA 290 Winter 2003.
The Structure of Information Retrieval Systems LBSC 708A/CMSC 838L Douglas W. Oard and Philip Resnik Session 1: September 4, 2001.
Information Retrieval Techniques Israr Hanif M.Phil QAU Islamabad Ph D (In progress) COMSATS.
Digital Libraries Lillian N. Cassel Spring A digital library An informal definition of a digital library is a managed collection of information,
SIMS 202 Information Organization and Retrieval Prof. Marti Hearst and Prof. Ray Larson UC Berkeley SIMS Tues/Thurs 9:30-11:00am Fall 2000.
Data and Applications Security Developments and Directions Dr. Bhavani Thuraisingham The University of Texas at Dallas Lecture #15 Secure Multimedia Data.
Introduction to Information Retrieval Example of information need in the context of the world wide web: “Find all documents containing information on computer.
Information Retrieval CSE 8337 Spring 2007 Introduction/Overview Some Material for these slides obtained from: Modern Information Retrieval by Ricardo.
Information Retrieval and Web Search Course overview Instructor: Rada Mihalcea.
Information Retrieval
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
Prof. James A. Landay Computer Science Department Stanford University Winter 2016 dt+UX 2 : USER EXPERIENCE DESIGN PROJECT Introduction & Course Overview.
1 Module 8 Reporting Results. 2 Learning Objectives At the end of this session participants will:  Understand key points to effectively present results.
Search and Retrieval: Finding Out About Prof. Marti Hearst SIMS 202, Lecture 18.
Information Retrieval CIS-462 Dr. Samir Tartir 2013/2014 First Semester.
B. Prabhakaran1 Multimedia Systems Reference Text “Multimedia Database Management Systems” by B. Prabhakaran, Kluwer Academic Publishers. – Kluwer bought.
INFORMATION STROAGE AND RETRIEVAL SYSTEM By Ms. Preeti Patel Lecturer School of Library And Information Science DAVV, Indore
IMS 4212: Course Introduction 1 Dr. Lawrence West, Management Dept., University of Central Florida ISM 4212 Dr. Larry West
“Babeş-Bolyai” University Faculty of Economics and Business Administration Second semester 1st year, English line of study Business IT Introductive course.
Organization of Information LSIS Summer II (2005)
SIMS 202, Marti Hearst Final Review Prof. Marti Hearst SIMS 202.
Information Retrieval in Practice
Information Storage and Retrieval Fall Lecture 1: Introduction and History.
Proposal for Term Project
University of California, Berkeley
Information Retrieval CIS-462
Presentation transcript:

SLIDE 1IS Fall 2003 Course Introduction Prof. Ray Larson & Prof. Marc Davis UC Berkeley SIMS Tuesday and Thursday 10:30 am - 12:00 am Fall 2003 SIMS 202: Information Organization and Retrieval Credits to Marti Hearst for some of the slides in this lecture

SLIDE 2IS Fall 2003 Today Introductions Course Overview Administrivia

SLIDE 3IS Fall 2003 Today Introductions Course Overview Administrivia

SLIDE 4IS Fall 2003 IS202 Teaching Team Professor Ray Larson Professor Marc Davis TA Mayjane Co TA Maria Lawrence

SLIDE 5IS Fall 2003 Who Am I? Professor and Associate Dean at SIMS Here from the founding of SIMS, faculty member of the “previous school”

SLIDE 6IS Fall 2003 What Do I Do? Research –Design, development and evaluation of information retrieval systems and digital libraries –Cheshire II and III –Bibliometrics of the WWW –Geographic information retrieval (GIR) –Distributed search and retrieval –Applications of Grid computing to (large-scale) IR Teaching –Information Retrieval –Database Management

SLIDE 7IS Fall 2003 Who Am I? Assistant Professor at SIMS (School of Information Management and Systems) Background 1980 – 1984B.A. from Wesleyan University in the College of Letters 1984 – 1987M.A. from the University of Konstanz in Literary Theory and Philosophy 1990 – 1995Ph.D. from MIT Media Laboratory in Media Arts and Sciences 1993 – 1998Member of the Research Staff and Project Coordinator at Interval Research Corporation 1999 – 2002Chairman and CTO of Amova

SLIDE 8IS Fall 2003 What Do I Do? Create technology and applications that will enable daily media consumers to become daily media producers Research and teaching in the theory, design, and development of digital media systems for creating and using media metadata to automate media production and reuse –Research Director of the Garage Cinema Research group Executive Committee member and co-founder of the Center for New Media –Teaching Multimedia Information Digital Media Design Studio

SLIDE 9IS Fall 2003 Student Introductions Who are you? –Name –Undergrad degree –Special areas of expertise and interest Why are you here? –What you want to learn from the course

SLIDE 10IS Fall 2003 Today Introductions Course Overview Administrivia

SLIDE 11IS Fall 2003 Goals of the Course Learn about –Design, development, and use of information organization and retrieval systems –Practical and theoretical foundations of information organization and analysis –Evaluation of information access systems –Cognitive and user-centric considerations –Hands-on experience with information systems

SLIDE 12IS Fall 2003 Two Main Themes Information Organization and Design Information Retrieval and the Search Process

SLIDE 13IS Fall 2003 Information Organization and Retrieval To organize is to (1) furnish with organs, make organic, make into living tissue, become organic; (2) form into an organic whole; give orderly structure to; frame and put into working order; make arrangements for. Knowledge is knowing, familiarity gained by experience; person’s range of information; a theoretical or practical understanding of; the sum of what is known. To retrieve is to (1) recover by investigation or effort of memory, restore to knowledge or recall to mind; regain possession of; (2) rescue from a bad state, revive, repair, set right. Information is (1) informing, telling; thing told, knowledge, items of knowledge, news. The Oxford English Dictionary, cf. Rowley

SLIDE 14IS Fall 2003 (Approximate) Course Schedule Organization –Overview –Categorization –Knowledge Representation –Metadata Introduction –Controlled Vocabularies Introduction –Thesaurus Design and Construction –Multimedia Information Organization and Retrieval –Metadata for Media –Database Design –XML

SLIDE 15IS Fall 2003 Information Properties Information can be communicated electronically –Broadcasting –Networking Information can be easily duplicated and shared –Problems of ownership –Problems of control Adapted from ‘Silicon Dreams’ by Robert W. Lucky

SLIDE 16IS Fall 2003 Information Hierarchy Wisdom Knowledge Information Data

SLIDE 17IS Fall 2003 Information Hierarchy Data –The raw material of information Information –Data organized and presented by someone Knowledge –Information read, heard, or seen and understood Wisdom –Distilled and integrated knowledge and understanding

SLIDE 18IS Fall 2003 Information Where is the Life we have lost in living? Where is the wisdom we have lost in knowledge? Where is the knowledge we have lost in information? -- T.S. Eliot, “The Rock” Where is the information we have lost in data?

SLIDE 19IS Fall 2003 Information Life Cycle Creation UtilizationSearching Active Inactive Semi-Active Retention/ Mining Disposition Discard Using Creating Authoring Modifying Organizing Indexing Storing Retrieval Distribution Networking Accessing Filtering

SLIDE 20IS Fall 2003 Authoring/Modifying Converting data+information+knowledge to new information Creating information from observation, thought Editing and publication Gatekeeping

SLIDE 21IS Fall 2003 Organizing/Indexing Collecting and integrating information Affects data, information, and metadata “Metadata” describes data and information –More on this later Organizing information –Types of organization? Indexing

SLIDE 22IS Fall 2003 Storing/Retrieving Information storage –How and where is information stored? Retrieving information –How is information recovered from storage? –How do we find needed information? –Linked with accessing/filtering stage

SLIDE 23IS Fall 2003 Distribution/Networking Transmission of information –How is information transmitted? Networks vs. broadcast

SLIDE 24IS Fall 2003 Accessing/Filtering Using the organization created in the O/I stage to: –Select desired (or relevant) information –Locate that information –Retrieve the information from its storage location (often via a network)

SLIDE 25IS Fall 2003 Using/Creating Using information Transformation of information to knowledge Knowledge to new data and new information

SLIDE 26IS Fall 2003 Key Issues in This Course How to describe information resources in ways so that they may be effectively used by those who need to use them –Organizing How to find the appropriate information resources for someone’s (or your own) needs –Retrieving

SLIDE 27IS Fall 2003 Key Issues Creation UtilizationSearching Active Inactive Semi-Active Retention/ Mining Disposition Discard Using Creating Authoring Modifying Organizing Indexing Storing Retrieval Distribution Networking Accessing Filtering

SLIDE 28IS Fall 2003 (Approximate) Course Schedule Retrieval –Introduction to Search Process –Boolean Queries and Text Processing –Statistical Properties of Text and Vector Representation –Probabilistic Ranking and Relevance Feedback –Evaluation –Web Search Issues and Architecture –Interfaces for Information Retrieval Organization –Overview –Categorization –Knowledge Representation –Metadata Introduction –Controlled Vocabularies Introduction –Thesaurus Design and Construction –Multimedia Information Organization and Retrieval –Metadata for Media –Database Design –XML

SLIDE 29IS Fall 2003 Web Search Questions What do people search for? How do people use search engines? –How often do people find what they are looking for? –How difficult is it for people to find what they are looking for? How can search engines be improved?

SLIDE 30IS Fall 2003 What Do People Search for on the Web? Study by Spink et al., Oct 98 – –Survey on Excite, 13 questions –Data for 316 surveys

SLIDE 31IS Fall 2003 What Do People Search for on the Web? Topics Genealogy/Public Figure:12% Computer related:12% Business:12% Entertainment: 8% Medical: 8% Politics & Government 7% News 7% Hobbies 6% General info/surfing 6% Science 6% Travel 5% Arts/education/shopping/images 14% Something is missing…

SLIDE 32IS Fall 2003 What Do People Search for on the Web? 4660 sex 3129 yahoo 2191 internal site admin check from kho 1520 chat 1498 porn 1315 horoscopes 1284 pokemon 1283 SiteScope test 1223 hotmail 1163 games 1151 mp weather maps 1036 yahoo.com 983 ebay 980 recipes 50,000 queries from excite 1997 Most frequent terms:

SLIDE 33IS Fall 2003 Why Do These Differ? Self-reporting survey The nature of language –Only a few ways to say certain things –Many different ways to express most concepts UFO, flying saucer, space ship, satellite How many ways are there to talk about history?

SLIDE 34IS Fall the a to of and in s for on this is by with or at all are from e you be that not an as home it i have if new t your page about com information Source: What is on the Web?

SLIDE 35IS Fall 2003 Intranet Queries (Aug 2000) 3351 bearfacts 3349 telebears 1909 extension 1874 schedule+of+classes 1780 bearlink 1737 bear+facts 1468 decal 1443 infobears 1227 calendar 989 career+center 974 campus+map 920 academic+calendar 840 map 773 bookstore 741 class+pass 738 housing 721 tele-bears 716 directory 667 schedule 627 recipes 602 transcripts 582 tuition 577 seti 563 registrar 550 info+bears 543 class+schedule 470 financial+aid

SLIDE 36IS Fall 2003 Intranet Queries Summary of sample data from 3 weeks of UCB queries –13.2% Telebears/BearFacts/InfoBears/BearLink (12297) –6.7% Schedule of classes or final exams (6222) –5.4% Summer Session (5041) –3.2% Extension (2932) –3.1% Academic Calendar (2846) –2.4% Directories (2202) –1.7% Career Center (1588) –1.7% Housing (1583) –1.5% Map (1393) Average query length over last 4 months: 1.8 words This suggests what is difficult to find from the home page

SLIDE 37IS Fall 2003 IR Issues in the Course What metadata is collected How the indexes are created How queries are formed How documents are ranked How shortest paths are computed How the system is built –… among other things! –This is just an introduction! Much more on these issues in the second half of the course

SLIDE 38IS Fall 2003 Course Format Most classes will be lecture/discussion sessions –Lecture ~60 minutes –Discussion ~20 minutes For each class students will prepare discussion questions for each reading and help lead discussion Some classes will be working sessions –Information Organization Summary and Phone Project Update –Phone Project Presentations –Final Review Some classes will be exams –In Class Midterm Exam –Final Exam

SLIDE 39IS Fall 2003 IS202 Course Project

SLIDE 40IS Fall 2003 Phone Project Goals Experience the actual process of information organization and retrieval –Especially as regards mobile media metadata creation, sharing, and (re)use Work in small, focused teams performing a variety of tasks –Image capture, cataloging, and application design Explore and design new applications for an emerging information organization and retrieval platform Develop an ongoing resource for SIMS (an annotated photo database) for –Internal research and teaching –External promotional and informational purposes

SLIDE 41IS Fall 2003 Phone Project Requirements Create engaging and useful application scenarios and photos Create a shared, reusable resource of annotated photos –All photos will be stored in one directory –Design your metadata So that all photos would be accessible from all applications Not only for the needs of your particular application, but also for the reusability of your photos and metadata

SLIDE 42IS Fall 2003 Assignments and Exams Approximately 12 assignments –Most due within one week to ten days –Many related to the Phone Project –Sometimes “checked”, sometimes graded Final exam (during finals week) Grading –Assignments: 60% Not evenly weighted –Final: 25% –Class Participation: 15%

SLIDE 43IS Fall 2003 Today Introductions Course Overview Administrivia

SLIDE 44IS Fall 2003 Readings Course reader –Will be available in about a week (will announce) –Textbooks Modern Information Retrieval, Baeza-Yates and Ribiero-Neto (Eds.), Addison Wesley, 1999 The Organization of Information, Arlene G. Taylor, Libraries Unlimited, 1999,

SLIDE 45IS Fall 2003 Homework (!) Read the handouts –Borges, Dennett, and Reddy Write one or two paragraphs on –What is information, according to your background or area of expertise? Due in class this Thursday, Aug 29

SLIDE 46IS Fall 2003 Next Time More information about the Phone Project More on what is information? And how much of it is out there?