CNI 2003/Herlocker, Jung, and Webster1 Collaborative Filtering: Possibilities for Digital Libraries Jon Herlocker Janet Webster Seikyung Jung Oregon State.

Slides:



Advertisements
Similar presentations
Describing Grey Literature Again: A Survey of Collection Policies Heather Lehman Information School, University of Washington Janet Webster Guin Library,
Advertisements

Recommender Systems & Collaborative Filtering
1 Scholarly Publishing Initiatives in ARL Libraries: a Penn State Perspective Nancy L. Eaton, Dean University Libraries and Scholarly Communications The.
While You Were Out: How Students are Transforming Information and What it Means for Publishing Kate Wittenberg The Electronic Publishing Initiative at.
Designing and Evaluating Mobile Interaction: Challenges and Trends Authors: Marco de Sa and Luis Carrico.
Presentation at WebEx Meeting June 15,  Context  Challenge  Anticipated Outcomes  Framework  Timeline & Guidance  Comment and Questions.
CS 431 The Semester in Elevator Speak Carl Lagoze – Cornell University May 5, 2004.
TDL Labs Partnerships for Exploration Luis Francisco-Revilla, Unmil P. Karadkar School of Information The University of Texas at Austin.
Symposium on Digital Curation in the Era of Big Data: Career Opportunities and Educational Requirements: A Data Scientist Perspective Dr. Vicki Lynn Ferrini.
Funding provided by the National Science Foundation DLI-Phase 2 NSF Award # A Digital Library of Reusable Science and Math Resources for Undergraduate.
Co-Directors: Yigal Arens USC / Information Sciences Institute Judith Klavans Columbia University.
Andy Gorman - Center for LifeLong Learning and Design6/14/01 S P I D E R Sharing Pertinent Information in Dynamically Evolving Repositories Projects generate.
2/7/2001 Presentation at the University of Kansas Digital Libraries – Meeting the Challenges Beth Forrest Warner.
WikiConversation Scotty Allen Phong Le. Goal Support joint document production asynchronously via localized comment capability In context of different.
Digital Library Service Integration (DLSI) --> Looking for Collections and Services to be DLSI Testbeds
John OckerbloomDec. 6, 2002 Supporting learning at the library Towards integrating LMS and digital library technology at Penn John Mark Ockerbloom CNI.
Information Access Douglas W. Oard College of Information Studies and Institute for Advanced Computer Studies Design Understanding.
Nnadi & Bieber, NJIT © Lightweight Integration of Documents and Services (Digital Library Integration Infrastructure) Nkechi Nnadi and Michael Bieber.
Center for the Study of Digital Libraries Texas A&M University College Station, TX.
Welcome! Chicago Seminar Anton Hristov Sitefinity Product Strategy & Learn more at sitefinity.com Content Management System.
AERO Meeting | September 24, 2009 EthicShare: Building an Inter-Institutional Scholarly Research Community Kate McCready Cecily Marcus.
QuestionPoint and the Library of Congress FEDLINK Fall OCLC Users Group Meeting Linda J. White Public Service Collections Library of Congress FEDLINK Fall.
Cataloging in digital age Li Sun Asian Languages Cataloger Metadata Librarian Cataloging and Metadata Services Rutgers University Libraries CEAL Annual.
The Natural Resources Digital Library Needs, Partners, and Challenges Bonnie Avery, Janine Salwasser, & Janet Webster Oregon State University.
Schipholweg 99 | P.O. box 876 | 2300 AW Leiden | The Netherlands | tel + 31 (0)71– | fax +31 (0)71– ELAG UPDATE Michel G Wesseling 17.
Presenter: Karla Strieb Assistant Executive Director Transforming Research Libraries June 3, 2010 Supporting E-science: Progress at Research Institutions.
Effective User Services for High Performance Computing A White Paper by the TeraGrid Science Advisory Board May 2009.
` Tangible Interaction with the R Software Environment Using the Meuse Dataset Rachel Bradford, Landon Rogge, Dr. Brygg Ullmer, Dr. Christopher White `
1 Recommender Systems and Collaborative Filtering Jon Herlocker Assistant Professor School of Electrical Engineering and Computer Science Oregon State.
BETH Bologna (Italy) What is known must be shared Building on the insights from OCLC Research.
1 CNI 2005 Fall BriefingTechLens TechLens: Exploring the Use of Recommenders to Support Users of Digital Libraries Joseph A. Konstan, Nishikant Kapoor,
NDIIPP The Next Phase Meg Williams Associate General Counsel The Library of Congress.
Libraries in Web 2.0 environment Mihaela Banek Zorica University of Zagreb, Faculty of Humanities and Social Sceinces, Department of information sciences.
1 The Digital Public Library for Flanders A strategic look into the future Jan Braeckman Based on consultancy by ONE Agency Vlaams Centrum voor Openbare.
The Information Challenge Exponential growth of resources New researchers with new needs Multiple communication options New expectations and opportunities.
Lecture 2 Jan 13, 2010 Social Search. What is Social Search? Social Information Access –a stream of research that explores methods for organizing users’
NBDL (National Biology Digital Library) A NSDL Core Integration System Project PI: Su-Shing Chen n University of Missouri-Columbia n National Computational.
Teachers’ Domain: An Accessible Digital Library for Education Bryan Gould and Trisha O’Connell WGBH National Center for Accessible Media
CONCLUSION & FUTURE WORK Given a new user with an information gathering task consisting of document IDs and respective term vectors, this can be compared.
Promoting the Sustainability of a Digital Initiatives Project User-Centered Assessment and Testing of Aerial Photographs of Colorado Holley Long, Kathryn.
Shaping the scientific evolution of technology enhanced learning noe-kaleidoscope.org #90 contractors — 23 countries 1100 researchers (2/3) and PhD students.
Freelib: A Self-sustainable Digital Library for Education Community Ashraf Amrou, Kurt Maly, Mohammad Zubair Computer Science Dept., Old Dominion University.
Overview of Elementary Media Center Collection Development Stacy Darwin LSIS 5505-OL1 Dr. Cogdell October 22, 2010.
Information in the Digital Environment Information Seeking Models Dr. Dania Bilal IS 530 Spring 2005.
DriveSense’14 NSF Workshop on Large-Scale Traffic and Driving Activity Data DriveSense’14, Oct 30-31, Norfolk, VA.
The National Center for Professional and Research Ethics Presents Ethics CORE Michael C. Loui University of Illinois at Urbana-Champaign.
Breakout # 1 – Data Collecting and Making It Available Data definition “ Any information that [environmental] researchers need to accomplish their tasks”
A Leap in the Dark A Pilot Project for an Electronic-Only Engineering Collection Laurel Kristick and Margaret Mellinger Oregon State University Libraries.
Introduction to Information Retrieval Example of information need in the context of the world wide web: “Find all documents containing information on computer.
Image Classification for Automatic Annotation
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
Individualized Knowledge Access David Karger Lynn Andrea Stein.
User Modeling and Recommender Systems: Introduction to recommender systems Adolfo Ruiz Calleja 06/09/2014.
National Science Foundation Science of Learning Centers RESEARCH EDUCATIONWORKFORCE.
Digital Data Collections ARL, CNI, CLIR, and DLF Forum October 28, 2005 Washington DC Chris Greer Program Director National Science Foundation.
Augmenting (personal) IR Readings Review Evaluation Papers returned & discussed Papers and Projects checkin time.
Preliminary Findings Baseline Assessment of Scientists’ Data Sharing Practices Carol Tenopir, University of Tennessee
High Risk 1. Ensure productive use of GRID computing through participation of biologists to shape the development of the GRID. 2. Develop user-friendly.
All Hands Meeting 2005 BIRN-CC: Building, Maintaining and Maturing a National Information Infrastructure to Enable and Advance Biomedical Research.
Digital Data Collections in Biology Collaborative Expedition Workshop November 8, 2005 Arlington, Virginia Chris Greer Program Director National Science.
A Shared Commitment to Digital Preservation and Access.
IAUTL June 2002 Michelle Cadoree, Library of Congress Virtual Reference: Making it Work For You.
Proactive Analytic Workspaces fro Heterogeneous Data
Summon® 2.0 Discovery Reinvented
Jarek Nabrzyski Director, Center for Research Computing
NSDL: A New Tool for Teaching and Learning.
Automating Support for CMMI Level 5 Organizational Improvement
Briefing to ARL Membership
Haystack: an Adaptive Personalized Information Retrieval System
Presentation transcript:

CNI 2003/Herlocker, Jung, and Webster1 Collaborative Filtering: Possibilities for Digital Libraries Jon Herlocker Janet Webster Seikyung Jung Oregon State University

CNI 2003/Herlocker, Jung, and Webster2 Current search engines are insufficient.

CNI 2003/Herlocker, Jung, and Webster3 Two important search engine problems They don’t understand: –Quality –Context

CNI 2003/Herlocker, Jung, and Webster4 But First: Our Context Why are we standing up here? We think we can improve the digital library experience.

CNI 2003/Herlocker, Jung, and Webster5 Today’s Context 1. Research questions & hypotheses 2. Collaborative filtering 3. Our approach to CF in the Library 4. Challenges of collaborative filtering for library search 5. Initial lessons learned

CNI 2003/Herlocker, Jung, and Webster6 The Librarian’s Questions As electronic information increases in amount and value, how to provide access to it? How to change digital libraries from disconnected collections to integrated systems? How to integrate the expertise of librarians into the development process? How to adapt traditional library values to new opportunities?

CNI 2003/Herlocker, Jung, and Webster7 The Computer Scientist’s Questions What is the next big leap in document search technology? How to overcome the limitations of software’s ability to understand language? How can we build a search engine that learns by observing searchers?

CNI 2003/Herlocker, Jung, and Webster8 Our Research Hypotheses Enabling the entire community to participate in organizing and recommending information will add value to the digital library In other words: Collaborative Filtering will increase the value of a digital library

CNI 2003/Herlocker, Jung, and Webster9 What is Collaborative Filtering? Communities of people sharing their evaluations of content Recommendations are transferred between people of like interest Examples: –MovieLens.org –Epinions.com –Launchcast (launch.yahoo.com) –Amazon.com

CNI 2003/Herlocker, Jung, and Webster10 CF and Libraries Search is central to user experience of digital library Collaborative Filtering: –Could overcome the limitations of current search technology –CF already exists in libraries. Not search, but cataloguing (OCLC) Adapting CF for document searching is not trivial. –Information needs are dynamic.

CNI 2003/Herlocker, Jung, and Webster11 Our Approach OSU Libraries Recommender System – Perform at CF at query level Match similar queries in addition to similar users – Generate results based on past user recommendations – Infer recommendations from user behavior – Integrate with existing library systems and traditions

CNI 2003/Herlocker, Jung, and Webster12

CNI 2003/Herlocker, Jung, and Webster13

CNI 2003/Herlocker, Jung, and Webster14

CNI 2003/Herlocker, Jung, and Webster15 The Benefits of CF Quality is considered. –Recommendations are based on human evaluations. Context is considered. The system gets better as it’s used. Doesn’t require significant, centralized human resources

CNI 2003/Herlocker, Jung, and Webster16 CS Challenges How to collect evaluations? How to identify the “useful” element of recommendations? How to represent the information needs of searchers? How to rank results? How to design the interface?

CNI 2003/Herlocker, Jung, and Webster17 Library Challenges How to balance privacy with personalization & involvement? How to maintain authority of recommended information? How to deal with timeliness of information? How to integrate with existing library systems? How to fund research in the library setting?

CNI 2003/Herlocker, Jung, and Webster18 What We’ve Learned Weakness of “old” search technology affects perception of new Wrapper technology minimizes IT commitment Existing internal data can be used to jumpstart system Controlled experiments show –Increased performance –Increased perception of non-tangibles

CNI 2003/Herlocker, Jung, and Webster19 CF and Digital Libraries Helps handle more electronic information Improve search results Shapes direction of digital libraries Supports collaboration on many levels Nothing ventured, nothing gained.

CNI 2003/Herlocker, Jung, and Webster20 Funding OSU Libraries Gray Chair for Innovative Technologies National Partnership for Advanced Computing Infrastructure (NSF) Georgia Pacific HMSC internship

CNI 2003/Herlocker, Jung, and Webster21 More information –Silence of the Sleeper Malcom Gladwell, The New Yorker, October 4th, 1999 (gladwell.com) –System for Electronic Recommendation Filtering Prototype (SERF) for OSU Libraries

CNI 2003/Herlocker, Jung, and Webster22 Contacts Janet Webster –Oregon State University Libraries, Hatfield Marine Science Center Jon Herlocker –Oregon State University, School of Electrical Engineering & Computer Science