Searching uPortal with a third party Search Engine Katya Sadovsky University of California, Irvine Administrative Computing Services

Slides:



Advertisements
Similar presentations
Support.ebsco.com Searching the Petroleum Abstracts TULSA ® Database Tutorial.
Advertisements

EPrints Web Configuratio n Management. SQL database Web server Scripts to configure repository activities Configuration files EPrints - the Administrator's.
Business Development Suit Presented by Thomas Mathews.
Microsoft Dynamics® SL
Overview QW Gateway is a new front-end to QuipWare
The KB on its way to Web 2.0 Lower the barrier for users to remix the output of services. Theo van Veen, ELAG 2006, April 26.
Chapter 9 Chapter 9: Managing Groups, Folders, Files, and Object Security.
New uPortal Contributions from the University of Wisconsin-Madison Jim Helwig University of Wisconsin-Madison Eric Dalquist Unicon, Inc. JA-SIG December.
UPortal: A framework for the Personalization of Library Services John Fereira: Programmer/Analyst Cornell University Mann Library.
1 ETT 429 Spring 2007 Microsoft Publisher II. 2 World Wide Web Terminology Internet Web pages Browsers Search Engines.
SiS Technical Training Development Track Technical Training(s) Day 1 – Day 2.
How to Create Top Ranking Searchable and Accessible Documents Chris Pollett and Elizabeth Tu April, 2010.
A Product of Enterprise Content Management System (CMS) Web & Portal Content Management Systems for faster web publishing Copyright.
SQL Server Reporting Services
Sharepoint Portal Server Basics. Introduction Sharepoint server belongs to Microsoft family of servers Integrated suite of server capabilities Hosted.
Tutorial Introduction Fidelity NTSConnect is an innovative Web-based software solution designed for use by customers of Fidelity National Title Insurance.
Hosted Exchange The purpose of this Startup Guide is to familiarize you with ExchangeDefender's Exchange and SharePoint Hosting. ExchangeDefender.
Web 2.0: Concepts and Applications 2 Publishing Online.
Scottish Legal Aid Board Content Management using OpenCms Martin Spinks CTO Navyblue Tuesday, March 16, 2010.
Primavera Highlights During COLLABORATE  Primavera Key Note: Making the Most of Your Oracle Primavera Investment Dick Faris, Primavera Co-Founder & Oracle.
Classroom User Training June 29, 2005 Presented by:
Server-side Scripting Powering the webs favourite services.
Using Styles and Style Sheets for Design
FPDS- NG Reports Overview December 16, Today’s Goals Provide an overview of the FPDS-NG reporting capability Demonstrate each of the reporting tools.
XP Using Frames in a Web Site Ali Alfayly. XP Tutorial Objectives Create frames for a Web site Control the appearance and placement of frames Control.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
® IBM Software Group © 2009 IBM Corporation Rational Publishing Engine RQM Multi Level Report Tutorial David Rennie, IBM Rational Services A/NZ
Chapter 7 Web Content Mining Xxxxxx. Introduction Web-content mining techniques are used to discover useful information from content on the web – textual.
In addition to Word, Excel, PowerPoint, and Access, Microsoft Office® 2013 includes additional applications, including Outlook, OneNote, and Office Web.
South Dakota Library Network MetaLib Management Basics Customizing QuickSets South Dakota Library Network 1200 University, Unit 9672 Spearfish, SD
Tutorial 121 Creating a New Web Forms Page You will find that creating Web Forms is similar to creating traditional Windows applications in Visual Basic.
Linking electronic documents and standardisation of URL’s What can libraries do to enhance dynamic linking and bring related information within a distance.
New uPortal Contributions from the University of Wisconsin-Madison Jim Helwig University of Wisconsin-Madison Eric Dalquist Unicon, Inc. JA-SIG December.
User Guide to DBPIA for Institutional Members Nurimedia Co., Ltd. 2012
THE ISTOG FORUM Member Gateway to Inservice Testing Related Operating Experience and More! Presenter: Jeff Neyhard.
EPA’s Environmental Terminology System and Services (ETSS) Michael Pendleton Data Standards Branch, EPA/OEI Ecoiformatics Technical Collaborative Indicators.
The Internet 8th Edition Tutorial 4 Searching the Web.
Module 10 Administering and Configuring SharePoint Search.
With Windows 7 Introductory© 2011 Pearson Education, Inc. Publishing as Prentice Hall1 Windows 7 Introductory Chapter 3 Advanced File Management and Advanced.
Core Publisher: Station Administrator Tools. Training 1: Site Administration Training 2: Programs Training 3: Content Tagging Training 4: Creating Posts.
Solutions using Microsoft Content Management Server 2002 Connector for SharePoint Technologies Sue Corke Mark Harrison Microsoft UK.
Table of Contents TopicSlide Administrator Login 2 Administrator Navigations 3 Managing AlternativeDr.com Blogs 4 Managing Dr. Lloyd May Blogs 5 Managing.
Copyright © 2006, Infinite Campus, Inc. All rights reserved. User Security Administration.
Microsoft ® Official Course Structuring and Publishing Websites for All Users Microsoft SharePoint 2013 SharePoint Practice.
January 2006Colby College ITS Setting Up Course Pages.
The Diagnostic Pathfinder System Introduction Getting Started.
Implementing FrameMaker Conditional Text Using Epic Editor Profiling Malcolm Graham Nortel, Ottawa, Canada.
Combining ArcGIS for Server & ArcGIS Online Julia Guard and Matt Monson.
2004/051 >> Supply Chain Solutions That Deliver Users.
Configuring and Deploying Web Applications Lesson 7.
Hyperion Artifact Life Cycle Management Agenda  Overview  Demo  Tips & Tricks  Takeaways  Queries.
Rendering Syndicated Library Content in an Institutional Portal: Integrating MyLibrary into uPortal John Fereira: Cornell University Eric Lease Morgan:
UpgradinguPortal to What’s new that matters Better use of third party frameworks Faster! Improved caching Drag and Drop New Skin & Theme Accessibility.
Module 5: Managing Content. Overview Publishing Content Executing Reports Creating Cached Instances Creating Snapshots and Report History Creating Subscriptions.
Getting Your Content in the Penn State Student Portal Presented By James Leous, Program Manager James Vuccolo, Lead Research Programmer.
Authored by Frank Hamelly, Microsoft MVP Regional Chapters.
Building KFS using KNS Presented by James SmithJustin Beltran University of ArizonaUniversity of California, Irvine.
June 30, 2005 Public Web Site Search Project Update: 6/30/2005 Linda Busdiecker & Andy Nguyen Department of Information Technology.
EIONET Portal Tool Kit Miruna Bulandra Finsiel Romania Copenhagen, 28 May 2002.
SharePoint 2007 Business Intelligence
CONTENT MANAGEMENT SYSTEM CSIR-NISCAIR, New Delhi
Cognos 8 Cognos Connection Cognos CoE
What’s New in Universal Access for ADP Workforce Now
GT Portal v. 2.0 Data Delivery
EBSCOhost Advanced Search Guided Style
Presentation transcript:

Searching uPortal with a third party Search Engine Katya Sadovsky University of California, Irvine Administrative Computing Services

Agenda §Our goals §Our current setup §Built-in vs. Third Party Search Engine §Dynamic vs. Static Content §Issues in combining uPortal with a search engine §Demonstration §Questions & Answers

Our goals §Use the portal as a “gateway” to information §Allow users to search for pertinent portal content §Present users with integrated search results (portal and non-portal content) §Aid the search engine in weighing the results (meaningful page title, metadata, etc.)

Our current setup §uPortal §Verity Ultraseek Search Engine (formerly Inktomi) §Tomcat 4.0.6

Built-in vs. Third Party Search Engine §Pros to using a built-in search engine: l Ensure generation of correct links to content l Present users with customized (user-specific) result sets l Ability to fully utilize channel metadata l Employ portal’s authorization infrastructure

Built-in vs. Third Party Search Engine §Pros to using a third party search engine l Well tested mature functionality l Well developed dictionary and thesaurus l Ability to search content beyond uPortal and present users with integrated search results l URL filtering capabilities l Useful but optional: nice administrative GUI, quick link definitions

Dynamic v.s. Static Content §uPortal generates dynamic content that depends on user's preferences, security level, browser and operating system §Most search engines are designed to work with static content: l Search engines index content on a periodic basis and use cached/stored index to present user with search results l Search results are not user-specific l Only public content is indexed

Issues/Areas of difficulty §User Agent setting §Filtering out certain URLs §Deciding what to search: l Search index/start page l Searchable v.s. non-searchable content §Generating links to channels using: l global (published) vs. instance (subscribed) ID l functional names §Page title used in search results

User Agent §Issues: l uPortal needs to know the mapping between a user agent and a MIME type/output type l When user agent is not recognized, uPortal will display a screen allowing users to choose a profile to use §Solutions: l If you know the user agent reported by the search engine – add a mapping to the UP_USER_UA_MAP table l Choose a search engine that allows you to specify a user agent

Example: setting a search engine user agent

Filtering out certain URLs §Issues: l A search engine may follow a link that includes a channel option or command l uPortal URL tags: Dynamically generated for each URL hit Tags, other than 'idempotent' make search result senseless While indexing content, a search engine may enter a loop referencing the same page with different tags

Filtering out certain URLs (cont’d) §Solutions: l acquire a search engine that allows URL filtering and filter out all “offending” URLs l If available with the search engine, use advanced URL “de-duping”

Example: Filtering out certain URLs

Example: using URL filters

What to search: index/start page §Issues: l A user layout may not be used as a starting point for a search engine: a typical layout doesn't contain all the channels l Need a page with 'idempotent' links to all the searchable channels §Solutions: l Searchable Channel Index channel

What to search: searchable v.s. non-searchable content §Issue: l not all channels needed to be included in the search §Solution: l added a 'searchable' attribute to all the channels

CSearchRegistry channel

CSearchRegistry: stylesheet

Generating links to channels §Problem: channel instance (subscribed) IDs vary from user to user, so the search result links are inconsistent §Solutions: link to channels using l global (published) IDs -- involves code changes l functional names (fname) -- this is a new functionality, available in CVS (Concurrent Versions System)

Linking to channels via their published IDs: implementation plan §Modified org/jasig/portal/UserInstance.java to recognize that user is asking for a published channel that may not be in user’s layout §Create a temporary hidden folder in user’s layout to store “temporary” channels (make sure to delete this folder before layout is saved to the database) §Add XML channel definitions to this hidden folder §Proceed to render as usual

Page titles used in search results §Issues: l Out of the box, uPortal has a statically set page title (no matter what channel is viewed) l Search engines generally use page titles (or other metadata) for: search result titles result ranking de-duping l Users have to be trained to enter meaningful page titles when creating documents/channels (e.g. do not start each page title with UCIrvine)

Page titles used in search results §Solution : when channels are rendered in 'focused’ or ‘detached’ mode, add channel title to the default page title (following is a fragment of webpages/stylesheets/org/jasig/portal/layout/tab-column/nested- tables/nested-tables.xsl ): §

Example: page titles

Conclusions §There are tradeoffs when using either a built-in or a third-party search engine §We have yet to address the following issues: l searching restricted content l creating META data tags to help the search engine with content ranking §Overall, our portal project could not succeed without a search function

Links §UC Irvine’s uPortal installation (SNAP): §This presentation:

Demo

Questions ?