Bieber et al., NJIT ©2005 - Slide 1 Lightweight Integration and Recommendation of Documents and Services ------- Digital Library Service Integration, IntegraL.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

GMD German National Research Center for Information Technology Darmstadt University of Technology Perspectives and Priorities for Digital Libraries Research.
Catherine Worrall Slide Library Co-ordinator, University College Falmouth.
Digital Library Service Integration Senior Projects Professors Bieber, Im and Wu Information Systems Department College of Computing Sciences New Jersey.
Haystack: Per-User Information Environment 1999 Conference on Information and Knowledge Management Eytan Adar et al Presented by Xiao Hu CS491CXZ.
1 Distributed Agents for User-Friendly Access of Digital Libraries DAFFODIL Effective Support for Using Digital Libraries Norbert Fuhr University of Duisburg-Essen,
DLESE and NSDL The role of the Digital Library for Earth System Education* (DLESE) in the National SMETE Digital Library Presented by Dave Fulker Director.
Web Mining Research: A Survey Authors: Raymond Kosala & Hendrik Blockeel Presenter: Ryan Patterson April 23rd 2014 CS332 Data Mining pg 01.
Funding provided by the National Science Foundation DLI-Phase 2 NSF Award # A Digital Library of Reusable Science and Math Resources for Undergraduate.
BC Integration of Systems and Resources MetaLib at Boston College Theresa Lyman Digital Resources Reference Librarian Boston College Libraries.
OntoBlog: Informal Knowledge Management by Semantic Blogging Aman Shakya 1, Vilas Wuwongse 2, Hideaki Takeda 1, Ikki Ohmukai 1 1 National Institute of.
OntoBlog: Linking Ontology and Blogs Aman Shakya 1, Vilas Wuwongse 2, Hideaki Takeda 1, Ikki Ohmukai 1 1 National Institute of Informatics, Japan 2 Asian.
Information Retrieval in Practice
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
Bieber, Catanio & Zhang, NJIT © Ubiquitous Metainformation and the W Y W W Y W I Principle Michael Bieber*, Joe Catanio*, Li Zhang** *Information.
Interfaces for Selecting and Understanding Collections.
Integrating Hypermedia Functionality into Database Applications Anirban Bhaumik * +, Deepti Dixit *, Roberto Galnares *, Manolis Tzagarakis **, Michalis.
Bieber, Catanio & Zhang, NJIT © Ubiquitous Metainformation and the W Y W W Y W I Principle Michael Bieber*, Joe Catanio*, Li Zhang** *Information.
Bieber et al., NJIT © Slide 1 Lightweight Integration of Documents and Services Digital Library Service Integration, IntegraL and IntLib.
Digital Library Service Integration (DLSI) --> Looking for Collections and Services to be DLSI Testbeds
PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment Natalya F. Noy and Mark A. Musen.
Bieber et al., NJIT © Slide 1 Digital Library Integration Masters Project and Masters Thesis Summer and Fall 2005 CIS 786 / CIS Fall.
Nnadi & Bieber, NJIT © Lightweight Integration of Documents and Services (Digital Library Integration Infrastructure) Nkechi Nnadi and Michael Bieber.
Recognizing User Interest and Document Value from Reading and Organizing Activities in Document Triage Rajiv Badi, Soonil Bae, J. Michael Moore, Konstantinos.
Dynamic Hypermedia Engine Professor Michael Bieber
BUILDING A WORKFORCE OF INFORMATION PROFESSIONALS FOR 21ST CENTURY GLOBAL INFORMATION ACCESS (IMLS AWARD ) Jung-ran Park, Chris Yang, Yuji Tosaka,
Using ProQuest Databases Jackson Community College Atkinson Library.
Bieber et al., NJIT © Digital Library Service Integration Michael Bieber, Il Im, Yi-Fang Wu Xin Chen, Dong-ho Kim, Nkechi Nnadi Vikas Achhpiliya.
Personalized Ontologies for Web Search and Caching Susan Gauch Information and Telecommunications Technology Center Electrical Engineering and Computer.
Overview of Search Engines
Artificial Intelligence Research Centre Program Systems Institute Russian Academy of Science Pereslavl-Zalessky Russia.
Educause October 29, 2001 A GEM of a Resource: The Gateway to Educational Materials Copyright Nancy Virgil Morgan, This work is the intellectual.
CS598CXZ Course Summary ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
What we learned while building DLESE Katy Ginger Metadata Architect, Meteorologist, Instructional Designer.
Challenges in Information Retrieval and Language Modeling Michael Shepherd Dalhousie University Halifax, NS Canada.
The Digital Library for Earth System Education: A Community Resource
Information Need Question Understanding Selecting Sources Information Retrieval and Extraction Answer Determina tion Answer Presentation This work is supported.
ISpheres Project. Project Overview iSpheresCore iSpheresImage Demonstration References.
Making the Connections An E-Matrix for Managing Resources in the Dis-integrated Library System David Goldsmith and Greg Raschke, NCSU Libraries ACRL, April.
OpenURL Link Resolvers 101
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
Topical Crawlers for Building Digital Library Collections Presenter: Qiaozhu Mei.
Metadata Lessons Learned Katy Ginger Digital Learning Sciences University Corporation for Atmospheric Research (UCAR)
Markup and Validation Agents in Vijjana – A Pragmatic model for Self- Organizing, Collaborative, Domain- Centric Knowledge Networks S. Devalapalli, R.
The Internet 8th Edition Tutorial 4 Searching the Web.
The Digital Library for Earth System Science: Contributing resources and collections Meeting with GLOBE 5/29/03 Holly Devaul.
CONCLUSION & FUTURE WORK Normally, users perform search tasks using multiple applications in concert: a search engine interface presents lists of potentially.
4 1 SEARCHING THE WEB Using Search Engines and Directories Effectively New Perspectives on THE INTERNET.
Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI
The Digital Library for Earth System Science: Contributing resources and collections GCCS Internship Orientation Holly Devaul 19 June 2003.
Strategies for subject navigation of linked Web sites using RDF topic maps Carol Jean Godby Devon Smith OCLC Online Computer Library Center Knowledge Technologies.
Data Integration Hanna Zhong Department of Computer Science University of Illinois, Urbana-Champaign 11/12/2009.
JISC/NSF PI Meeting, June Archon - A Digital Library that Federates Physics Collections with Varying Degrees of Metadata Richness Department of Computer.
May 26-28ICNEE 2003 ARCHON: BUILDING LEARNING ENVIRONMENTS THROUGH EXTENDED DIGITAL LIBRARY SERVICES Hesham Anan, Kurt Maly, Mohammad Zubair,et al. Digital.
Oct 12-14, 2003NSDL Challenges in Building Federation Services over Harvested Metadata Kurt Maly, Michael Nelson, Mohammad Zubair Digital Library.
NSDL & Access Management David Millman Columbia University Jan ‘02.
Overviews of the Library of Texas & ZLOT Project Dr. William E. Moen Principal Investigator.
User Modeling and Recommender Systems: Introduction to recommender systems Adolfo Ruiz Calleja 06/09/2014.
Information Visualization, Human-Computer Interaction, and Cognitive Psychology: Domain Visualizations Kevin W. Boyack Sandia National Laboratories.
Feb 24-27, 2004ICDL 2004, New Dehli Improving Federated Service for Non-cooperating Digital Libraries R. Shi, K. Maly, M. Zubair Department of Computer.
Web 2.0: Making the Web Work for You, Illustrated Unit A: Research 2.0.
Collaborative Query Previews in Digital Libraries Lin Fu, Dion Goh, Schubert Foo Division of Information Studies School of Communication and Information.
Managing ETDs with Associated Complex Digital Objects Gabrielle V. Michalek Director, Scholarly Publishing, Archives and Data Services Carnegie Mellon.
Discovery and Metadata March 9, 2004 John Weatherley
Information Retrieval in Practice
Search Engine Architecture
WHAT DOES THE FUTURE HOLD? Ann Ellis Dec. 18, 2000
Federated & Meta Search
Introduction to Information Retrieval
Presentation transcript:

Bieber et al., NJIT © Slide 1 Lightweight Integration and Recommendation of Documents and Services Digital Library Service Integration, IntegraL and IntLib Projects Michael Bieber *, Il Im *, Vincent Oria ** Richard Sweeney ***, Yi-Fang Wu * * Information Systems Department *** Robert Van Houten Library ** Computer Science Department College of Computing Sciences New Jersey Institute of Technology April 2005

Bieber et al., NJIT © Slide 2 Outline Motivation Illustrations Structural Relationships 3 Types of Integration Personalizing Links Federated Metasearch Recommendations Contributions and Vision Call for Collaboration Project Details

Bieber et al., NJIT © Slide 3 Challenges for Library Users Need to know what resources to use before they can access them Finding related information outside current system Need to leave current page to do related tasks Why? Library resources aren’t integrated well ==> Project Goal: –Bring relevant resources directly to the user Library resources: databases (e.g., EBSCOhost, ACM Digital Library), external digital libraries, on-line catalog, special collections, library services (e.g., interlibrary loan)...

Bieber et al., NJIT © Slide 4 Integration through Linking automatically generate link anchors on elements recognized based on: –structural relationships –lexical relationships automatically generate links –to related information –to relevant services ==> lightweight integration of –documents containing links and –documents/services the links point to

Bieber et al., NJIT © Slide 5 Prototype Services for a launch-date element: - search by launch date - search by month and year - search by year

Bieber et al., NJIT © Slide 6 Prototype Services for a document element: - open - summarize in 3 sentences

Bieber et al., NJIT © Slide 7 Mock-up for a library database Services from multiple systems (customized to user tasks/preferences)

Bieber et al., NJIT © Slide 8 Benefits of Integration for a system (collection/service) Users: direct access to related systems –enlarges a system’s feature set Links leads users to a system –systems gain wider use Users become aware of other systems –systems gain wider awareness Direct access to a system’s features –streamlined access (bypassing menus)

Bieber et al., NJIT © Slide 9 structural elements and links lexical elements and links Two Types of Links: (1) structural based on element type * title, author, source (2) lexical (found in a glossary)

Bieber et al., NJIT © Slide 10 Structural Relationships Links generated based on application structure, not search or lexical analysis –You cannot do a search on the display text “$127,322.12” to find related information… –But you can find relationships for the element Sales[2002] $85,101.99$127, Expenses2002 Sales

Bieber et al., NJIT © Slide 11 Outline Motivation Illustrations Structural Relationships 3 Types of Integration Personalizing Links Federated Metasearch Recommendations Contributions and Vision Call for Collaboration Project Details

Bieber et al., NJIT © Slide 12 Three Types of Integration: (1) for documents to receive anchors and links (2) to provide services (which become links) (3) to provide glossaries for content analysis Require a document schema mapper to recognize structural elements: -wrapper -fixed template -XML markup -etc.

Bieber et al., NJIT © Slide 13 Three Types of Integration: (1) for documents to receive anchors and links (2) to provide services (which become links) (3) to provide glossaries for content analysis Linking Rules represent * every service * that a system can provide * for each kind of element.

Bieber et al., NJIT © Slide 14 Three Types of Integration: (1) for documents to receive anchors and links (2) to provide services (which become links) (3) to provide glossaries for content analysis Linking Rules represent * every service * that a system can provide * for each kind of element. Example ==>

Bieber et al., NJIT © Slide 15 Example Linking Rule from the AskNSDL system –a) element type (“concept”) –b) link display label (“Ask an expert about this”) –c) relationship metadata –d) destination collection or service (“Ask NSDL”) –e) the exact command to send to the destination system (logs the user into AskNSDL, opens question template, fills in the element instance (i.e., “physics teaching”) as the subject, and places the cursor in the question area) –f) any relevant conditions for including this relationship

Bieber et al., NJIT © Slide 16 Three Types of Integration: (1) for documents to receive anchors and links (2) to provide services (which become links) (3) to provide glossaries for content analysis Lexical analysis by: NJIT Noun Phrase Extractor NJIT Ontology Developer

Bieber et al., NJIT © Slide 17 Each system is integrated independently: (1) Schema mappers for individual systems (2) Linking rules are plugged in” independently for each service (3) Glossaries and thesauri can be independent of other systems

Bieber et al., NJIT © Slide 18 Outline Motivation Illustrations Structural Relationships 3 Types of Integration Personalizing Links Federated Metasearch Recommendations Contributions and Vision Call for Collaboration Project Details

Bieber et al., NJIT © Slide 19 Personalizing the Links Customize the list of links according to: Collaborative Filtering –Matching user’s “click stream” to other users’ time spent at each destination asking users to rate links user task information

Bieber et al., NJIT © Slide 20 Federated Metasearch Searches, merges & ranks

Bieber et al., NJIT © Slide 21 Federated Metasearch Searches, merges & ranks Clusters results by concept

Bieber et al., NJIT © Slide 22 Federated Metasearch: Clustering by Concept concept hierarchy search results are clustered by concept

Bieber et al., NJIT © Slide 23 Outline Motivation Illustrations Structural Relationships 3 Types of Integration Personalizing Links Federated Metasearch Recommendations: General Recommendation Engine Contributions and Vision Call for Collaboration Project Details

Bieber et al., NJIT © Slide 24 Integration With Partner Libraries Minimum integration More complex integration

Bieber et al., NJIT © Slide 25

Bieber et al., NJIT © Slide 26 General Recommendation Engine Research Goals Integrate three major recommendation technologies – Collaborative filtering (CF), Content-based (CB), and Knowledge-based (KB) recommendation Automatically identify users ’ current task (search mode) Study the impacts of the recommendations on information search

Bieber et al., NJIT © Slide 27 Collaborative Filtering (CF) Recommendations based on similarities of people Traditional CF requires direct user inputs Clickstream-based CF (CCF) does not require direct user inputs Works well for preference goods Does not work so well for information-intensive items

Bieber et al., NJIT © Slide 28 Content-based Filtering (CB) Recommendations based on similarities of contents –titles, authors, abstracts, or full texts Information retrieval (IR) techniques are used –e.g., tf.idf value Documents similar in content are recommended Demo:

Bieber et al., NJIT © Slide 29 Knowledge-based Recommendation (KB) CF and CB lack a holistic view –why a document is relevant for a user KB recommends items based on a certain knowledge structure (ontology) KB requires knowledge engineering Goal: to build a automated (or semi- automated) ontology engine based on the ‘ Self- organizing tree ’ algorithm ( Khan and Luo, 2002)

Bieber et al., NJIT © Slide 30 Automatic User Profile Extractor Records a user ’ s recent documents The user ’ s profile is represented by a set of keywords from those documents As the user visits more documents, his/her profile is updated

Bieber et al., NJIT © Slide 31 Outline Motivation Illustrations Structural Relationships 3 Types of Integration Personalizing Links Federated Metasearch Recommendations Contributions and Vision Call for Collaboration Project Details

Bieber et al., NJIT © Slide 32 Contributions straightforward, sustainable approach for integrating documents and services –Lightweight integration through linking combining structural links with content-based links next-generation collaborative filtering federated metasearch next-generation recommendations integrating traditional and digital libraries widespread dissemination

Bieber et al., NJIT © Slide 33 Vision A nationwide virtual library to and from –your local library –other physical libraries –digital libraries incorporating –traditional library resources –digital library resources Bringing relevant resources directly to the user!

Bieber et al., NJIT © Slide 34 Looking for Collaboration Additional document systems, digital library collections, services and glossaries to integrate Physical library partners Digital library partners Web services to integrate Other suggestions welcome!

Bieber et al., NJIT © Slide 35 Additional Slides

Bieber et al., NJIT © Slide 36 Outline Motivation Illustrations Structural Relationships 3 Types of Integration Personalizing Links Federated Metasearch Contributions and Vision Call for Collaboration Project Details

Bieber et al., NJIT © Slide 37 Digital Library Service Integration NSF National Science Digital Library Award #DUE ; Tasks Develop Integration Infrastructure Integrate digital library collections and services Collaborative filtering Evaluation Partners NASA GSFC Library AskNSDL Earth Science Picture of the Day System Atmospheric Visualization Collection Metis Workflow (University of Colorado, Boulder) University of Arizona

Bieber et al., NJIT © Slide 38 IntLib Institute of Museum and Library Services Award #LG ; Tasks - to integrate: EBSCOhost Gale’s Discovery Collection ProQuest On-line Catalog Systems New Jersey Digital Highway The IntLib Project focuses on integrating the resources of public libraries primarily (and university libraries secondarily) with digital libraries. Additional Partner Newark Public Library

Bieber et al., NJIT © Slide 39 IntegraL NSF National Science Digital Library Award #DUE ; Tasks - to integrate: ACM Digital Library Elsevier Science Direct (permission pending) NJIT Electronic Thesis collection JerseyClicks StartingPoint Digital Library for Earth Science Education (DLESE) NSDL Core Integration features an on-line bookstore The IntegraL project focuses on integrating specific resources of college libraries with those of the NSDL. Additional Partners Cumberland C.C. Ramapo College Olin College of Engineering

Bieber et al., NJIT © Slide 40 General Recommendation Engine NSF National Science Digital Library Award; Tasks - to integrate: Collaborative filtering recommendations Content-based recommendations Knowledge-based recommendations Partners Digital Library for Earth Science Education (DLESE) Eisenhower National Clearinghouse for Mathematics and Science Education Computer Vision Education Digital Library

Bieber et al., NJIT © Slide 41 Which additional services would you suggest?

Bieber et al., NJIT © Slide 42 Ph.D. Research Opportunities Integration through linking Customizing services to work in many domains Service chaining: creating new services by chaining existing ones –e.g., translate and summarize documents Collaborative Filtering for digital libraries Lexical analysis –Automatic summarization of returned documents –including maintaining multiple glossaries/thesauri Full Virtual Community support –analysis, tools, processes, evaluation

Bieber et al., NJIT © Slide 43 User’s Web Browser AskNSDL Schema Mapper AVC Schema Mapper NSSDC Schema Mapper CI Search Service Schema Mapper Service Schema Mapper (i) AskNSDL AVC N’l Space Science Data Center NSDL CI Search Service Service (i) ME Link Mapping Engine ME Broker ME Desktop Metainformation Engine ME Lexical Analysis existing system or Web service uses Java, XML, XPath, etc. Internal Architecture