© 2012 Deep Web Technologies, Inc. Swetswise Searcher Powered by Explorit Research Accelerator By Abe Lederman President and CTO Copenhagen, Denmark 11.

Slides:



Advertisements
Similar presentations
Open Source Intelligence: Presented by Abe Lederman, President and CTO Deep Web Technologies, LLC IOP 06 Sheraton Premier, Tysons Corner, Virginia January.
Advertisements

Resource Navigator Discovering, delivering and managing your information resources.
IAEA International Atomic Energy Agency ICSTI 2013 Annual Members’ Meeting March 2013.
© 2009 Deep Web Technologies, Inc. Federated Search: A Tool for Knowledge Discovery iGroup Online Education Conference Presented by Abe Lederman Founder.
Not All Federated Search Engines are Created Equal Abe Lederman, President and CTO Deep Web Technologies, Inc. Next Generation Library Technologies, May.
Information Retrieval in Practice
1 Do More Searching in Less Time Fall Term 2010 Helen B. Josephine
Google Tools and your Library - the Possibilities are Exponential Google CSE Google CSE Google Scholar Google Scholar Google My Library Google.
Overview of Search Engines
Web Searching. Web Search Engine A web search engine is designed to search for information on the World Wide Web and FTP servers The search results are.
Federated Search: True Enterprise Search Abe Lederman, President and CTO Deep Web Technologies Search Engine Meeting – April 28-29, 2008.
Global Discovery: Turning Vision into Reality Presented by Abe Lederman, President and CTO Deep Web Technologies, LLC Symposium: Global Discovery on the.
Buyer Advertising & UMass Boston Navigating the Changing Landscape of Recruitment Communications Presented to: November 18, 2014.
Abe Lederman, President and CTO Deep Web Technologies 2008 STIP Working Meeting, April 23, 2008 Federated Search: The Technology For Making Global Discovery.
Divide and Conquer: Challenges in Scaling Federated Search Presented by Abe Lederman, President and CTO Deep Web Technologies, LLC SearchEngine Meeting.
HOW SEARCH ENGINE WORKS. Aasim Bashir.. What is a Search Engine? Search engine: It is a website dedicated to search other websites and there contents.
African Public Libraries Summit, September 19-21, 2012 Ahmed Al-Awah Head, Information Systems & Technical Services Library and Information Management.
Databases and Library Catalogs Global Index Medicus/Global Health Library PubMed Source Bibliographic Database: International Health and Disability.
Alberto Isoardo Seminario autunnale CIBER Novembre 2007 ROMA.
© 2011 Deep Web Technologies, Inc. By Abe Lederman President and CTO June 26, 2011 Understanding Differences Between Federated Search and Discovery Services.
© 2012 Deep Web Technologies, Inc. 03 December 2012 By Abe Lederman, CEO Deep Web Technologies Show and Tell Presentation to.
LIBRARY RESOURCE DISCOVERY PRODUCTS: COMMERCIAL AND OPEN SOURCE OPTIONS Web Manager’s Academy Marshall Breeding Director for Innovative Technology and.
Five Years InterLab ’07 Los Alamos, New Mexico October 1–3, 2007 Valerie S. Allen, MSLIS U.S. Department of Energy Office of Scientific and.
Science Research: Journey to 10,000 Sources Presented by: Abe Lederman, President and Founder Deep Web Technologies, Inc. Special Libraries Association.
Using xSearch Provided by Deep Web Technologies and Stanford University Libraries and Academic Information.
© 2010 Deep Web Technologies, Inc. By Abe Lederman President and CTO Explorit Federated Search.
© 2009 Deep Web Technologies, Inc. Federated Search Presentation Explorit Research Accelerator Focus Deep. Get Results.
© 2013 Deep Web Technologies, Inc. Abe Lederman President and CTO Deep Web Technologies ANKOS 2013 Annual Meeting April 26, 2013 Federated Search: A Discovery.
Web Scale Discovery Service Vs Federated Search NIKESH NARAYANAN
Applying Grid Computing Research to Commercial IR Applications Presented by Carl Sylvia, SBIR Project Manager Deep Web Technologies, LLC GGF-14 – June.
Not All Federated Searches are Created Equal Abe Lederman, President and CTO Deep Web Technologies Thomson Scientific Government Event, April 10, 2008.
© 2012 Deep Web Technologies, Inc. SwetsWise Medical Searcher Powered by Explorit Research Accelerator By Abe Lederman President and CTO July 15, 2012.
Search Engine By Bhupendra Ratha, Lecturer School of Library and Information Science Devi Ahilya University, Indore
NCSU Libraries Kristin Antelman NCSU Libraries June 24, 2006.
ORBIS & PORTALS E-Journal Workshop Michael Markwith, TDNet Inc. Reed College Library May 9, 2002.
Abe Lederman, President and CTO Deep Web Technologies, Inc. ScienceEducation.gov Meeting National Academy of Sciences, March 18, 2009 A Look at the Technology.
29-30 October, 2006, Estonia 1 IST4Balt Information analysis using social bookmarking and other tools IST4Balt Information analysis using social bookmarking.
DISCOVERY PRODUCTS AND SERVICES: Introduction and current trends Marshall Breeding Director for Innovative Technology and Research Vanderbilt University.
© 2009 Deep Web Technologies, Inc. Federated Search for Academic Libraries Explorit Research Accelerator Focus Deep. Get Results.
EBSCO Discovery Service. Discovery Background –Quickly –By small development teams –Using rudimentary relevance algorithms built around searching article.
Searching the web Enormous amount of information –In 1994, 100 thousand pages indexed –In 1997, 100 million pages indexed –In June, 2000, 500 million pages.
Sharon M. Jordan Assistant Director for Program Integration U.S. DOE Office of Scientific & Technical Information Vantage Point: Government R&D Results.
© 2009 Deep Web Technologies, Inc. Federated Search for Government Agencies Explorit Research Accelerator Focus Deep. Get Results.
Uniting Global Information with Federated Search Abe Lederman, President, Deep Web Technologies Dr. Rosanne Hessmiller, CEO, Ferguson-Lynch Presentation.
1 OSTI - Accelerating Science Information Dr. Walter L. Warnick Director U.S. Department of Energy Office of Scientific and Technical Information Federal.
Search Engines By: Faruq Hasan.
COPYRIGHT © 2007 MUSEGLOBAL, INC. ALL RIGHTS RESERVED PAGE 1 Turn Content into Insight From Silos to Solutions: How Advanced Content Integration Creates.
OARE Module 4: Summon Searching. What is Summon? Summon is a Google-like search engine that provides fast, relevancy-ranked results: Enter the search.
Deep Web Technologies Presentation to Gale for PowerSearchPlus Abe Lederman, President and Founder Maxine Swisa, Vice President of Engineering May 18,
Mercury – A Service Oriented Web-based system for finding and retrieving Biogeochemical, Ecological and other land- based data National Aeronautics and.
Federated Search: The Good and the Bad Abe Lederman, President and CTO Deep Web Technologies, Inc. APLA May 9, 2008.
Dr. Walter L. Warnick Director Office of Scientific and Technical Information Office of Science ARPA-E June 24, 2010 Innovative Web Resources Can Advance.
© 2010 Deep Web Technologies, Inc. Taking the Library Back from Google Abe Lederman, President and CTO Deep Web Technologies May 12, 2010.
Leveraging Publisher’s Search Engines to Deliver Relevant Results to Users Presented by Abe Lederman, President and CTO Deep Web Technologies, LLC 28 th.
Advancing Science: OSTI’s Current and Future Search Strategies Jeff Given IT Operations Manager Computer Protection Program Manager Office of Scientific.
The World Wide Web. What is the worldwide web? The content of the worldwide web is held on individual pages which are gathered together to form websites.
Behrooz ChitsazLorrie Apple Johnson Microsoft ResearchU.S. Department of Energy.
EVERY CONNECTION has a starting point. A compelling end user environment: OCLC’s view Marianne Klomp Product Manager OCLC EUSIDIC 2008 London, UK.
Saving Time with Federated Search Abe Lederman, President, Deep Web Technologies Terry Colby, Director of Sales, Deep Web Technologies Websearch University,
Taking the Library Back from Google Abe Lederman, President and CTO October 18-20, 2007.
WebScan: Implementing QueryServer 2.0 Karl Geiger, Amgen Inc. BRS NA UG August 1999.
DISCOVERY SYSTEMS: SOLUTIONS A USER COULD LOVE OVERVIEW OF DISCOVERY SYSTEMS Marshall Breeding Director for Innovative Technology and Research Vanderbilt.
Week-6 (Lecture-1) Publishing and Browsing the Web: Publishing: 1. upload the following items on the web Google documents Spreadsheets Presentations drawings.
Turn Content into Insight
By Abe Lederman President and CTO June 26, 2011
Uniting Global Information with Federated Search
Uniting Global Information with Federated Search
Summon - HINARI Search (Basic Course: Module 7 Part A)
Access to Quality, Deep Web Research Content
Presentation transcript:

© 2012 Deep Web Technologies, Inc. Swetswise Searcher Powered by Explorit Research Accelerator By Abe Lederman President and CTO Copenhagen, Denmark 11 June 2012

© 2012 Deep Web Technologies, Inc. 2 About Deep Web Technologies... Founded by Abe Lederman in 2002 – A co-founder of Verity, acquired by Autonomy – BS & MS Degrees in Computer Science from MIT – 25 years experience in Information Retrieval 20 person company based in Santa Fe, New Mexico Over $5M in DOE SBIR Grants ( ) Pioneer/trailblazer in federated search

© 2012 Deep Web Technologies, Inc. 3 Customers Include... Academic: Stanford University George Mason University Texas Medical Center University College of Cork Tennessee Community College Consortia Public Portals: WorldWideScience.org Science.gov Biznar Mednar ScienceResearch.com Government: Defense Technical Info Center (DTIC) Office of Sci. & Tech. Info (DOE-OSTI) UNECA European Space Agency Corporate: Boeing BASF Intel HP P&G

© 2012 Deep Web Technologies, Inc. 4 What is the Deep Web? The Deep Web is a collection of internet information sources that are generally not accessible to web spiders or crawlers and can not, therefore, be indexed for search by popular search engines such as Google, Yahoo! or Bing (the Surface Web). It is estimated that there is more than 500 times more content in the Deep Web than the Surface Web.

© 2012 Deep Web Technologies, Inc. 5 What is “Federated Search”? “Federated Search is an application or service that allows users to submit a real-time search in parallel to multiple, distributed information sources and retrieve aggregated, ranked and de-duplicated results.”

© 2012 Deep Web Technologies, Inc. 6 Public Web Sources Public Web Sources One Search, Many Sources Blogs eBooks Enter Your Search… Begin Search Internal Databases Internal Databases Journals Wikis Subscription Sources Subscription Sources

© 2012 Deep Web Technologies, Inc. 7 Why Federated Search? 4 Big Reasons… 1. Provides greater efficiency than searching sources one by one 2. Returns the most current information because sources are searched in real-time 3. Eliminates learning disparate publisher interfaces 4. Simplifies discovery of the most relevant results

© 2012 Deep Web Technologies, Inc. 8 Best Science-Focused Engines 5 of 9 created by DWT Science.gov WorldWideScience.org ScienceResearch.com ScienceAccelerator Scitopia.org

© 2012 Deep Web Technologies, Inc. 9 Science.gov (2002)

© 2012 Deep Web Technologies, Inc. 10 WorldWideScience.org (2007)

© 2012 Deep Web Technologies, Inc. 11 Science Accelerator (2006)

© 2012 Deep Web Technologies, Inc. 12 ScienceResearch.com (2005)

© 2012 Deep Web Technologies, Inc. 13 Scitopia.org ( )

© 2012 Deep Web Technologies, Inc. 14 Presentation available at: Presentation available at:

© 2012 Deep Web Technologies, Inc. 15 It is too slow Connectors break Brings back too few results from each source Brings back too many results Unable to rank results well (meta- data differences, lack of info) Federated Search Has Gotten a Bad Reputation

© 2012 Deep Web Technologies, Inc. SW Searcher vs. Discovery Services SwetsWise SearcherDiscovery Service Real-time search of multiple collections Multiple collections are indexed to one database Initial results returned in 3-4 seconds – Remaining results incrementally returned in up to 30 seconds Results returned within 1-3 seconds New results are available as soon as on publisher’s site New results are available only after re-indexing Searches full text where possible Mostly indexes just metadata Search any collection regardless of publisher Search only collections the service subscribes to

© 2012 Deep Web Technologies, Inc. 17 Drawbacks of Discovery Services Lack of transparency of what’s in Service Incomplete coverage of publisher content Lag between when content appears on publisher site and when available on Discovery Service Normalized metadata loses content source-specific metadata Content in Service limited by relationships, content of general interest

© 2012 Deep Web Technologies, Inc. 18 Landscape is Not So Clear Summon (ProQuest) – Discovery Service EDS (EBSCO) –Discovery Service + Federated Search WorldCat Local (OCLC) –Discovery Service + Federated Search Primo (Ex Libris) –Discovery Service + Federated Search Encore Synergy (Innovative Interfaces) –Limited Discovery Service + Federated Search Explorit (Deep Web Technologies) –Federated Search

© 2012 Deep Web Technologies, Inc. 19 When Should You Choose Federated Search? Access to up-to-date information is important. You want control of your sources. You want to search internal/non- mainstream sources Your research is specialized (ex. Medical and legal) You have a wide range of subscribed content (ex. EBSCO and ProQuest)

© 2012 Deep Web Technologies, Inc. 20 Partners since January 2010

© 2012 Deep Web Technologies, Inc. 21 Major Advantages of SwetsWise Searcher Rich, easy-to-use interface Incremental display of results Sophisticated connector technology Retrieve results or more per source Relevance ranking Smart clustering Alerts and Search Builder Metrics

© 2012 Deep Web Technologies, Inc. 22 Easy-to-use Interface Simple Search Box – One-Search, “Google-like” box – Can be embedded in your home page, blog or intranet.

© 2012 Deep Web Technologies, Inc. 23 Advanced Search Page – Unlimited categories (sources can be in multiple categories) – Select sources to search – One or Two columns – Fielded Searching – Boolean Searching AND, OR, NOT

© 2012 Deep Web Technologies, Inc. 24 Incremental Results

© 2012 Deep Web Technologies, Inc. 25 Connectors: Think “Connections” Connectors make it possible to talk to other data sources –Each source is unique so connectors “normalize” a query –Submit proper authentication to sources –Extract the right results –Parse results to display the data

© 2012 Deep Web Technologies, Inc. 26 Connector Monitoring Proactively monitor connectors Monitor: source health, speed, responsiveness and errors Evaluated by dedicated software maintenance engineers Generally errors are discovered by our team before users ever notice a problem

© 2012 Deep Web Technologies, Inc. 27 Relevance Ranking Occurance of search terms within titles & snippets Assigning weight to sources More current reults are assigned greater weight Read: “Ranking: The Secret Sauce for Searching the Deep Web”“Ranking: The Secret Sauce for Searching the Deep Web”

© 2012 Deep Web Technologies, Inc. 28 Clustering Real-time semantic analysis of results creates clusters on-the-fly. Discover relationships behind the results, not just “keywords.” Read: “Clusters That Think”“Clusters That Think”

© 2012 Deep Web Technologies, Inc. 29 Alerts – Delivery online or via – Daily, Weekly, Monthly – Pick and choose your sources – Export to RSS reader – Maintain database of past results Alerts – Delivery online or via – Daily, Weekly, Monthly – Pick and choose your sources – Export to RSS reader – Maintain database of past results

© 2012 Deep Web Technologies, Inc. 30 Search Builder – Create search pages easily – Choose collections and search fields – Integrates with Course Management Software – Embed search box using built-in widget

© 2012 Deep Web Technologies, Inc. 31 SwetsWise Searcher Metrics Graphics-based or tabular Single day (hourly breakdown) or entire month Downloadable to spreadsheet Reports include: – Number of queries run – Number of results retrieved per source – Average time to retrieve results from a source – Average rank of results retrieved per source – Timeouts/errors by source – Searches run (query strings) – Clickthrough stats

© 2012 Deep Web Technologies, Inc. 32

© 2012 Deep Web Technologies, Inc. Deep Web Technologies hosts the application Client hosts the application Technical support through Deep Web Technologies Client IT staff must support application Deep Web Technologies can access application at any time Deep Web Technologies has limited or no access to the application Deep Web Technologies monitors and maintains connectors Deep Web Technologies monitors and maintains accessible connectors Limited or no ability to access internal sources Can access internal sources Hosted vs. Installed Solutions Hosted Installed

© 2012 Deep Web Technologies, Inc. 34 Multilingual WorldWideScience.org

© 2012 Deep Web Technologies, Inc. 35 WorldWideScience.org is an Excellent Candidate for Multilingual Search A global gateway to international science databases and portals All content is from national governments or vetted by national governments Developed in partnership with the DOE Office of Scientific and Technical Information (OSTI), WWS Alliance and Microsoft Research One-stop searching Includes databases from China, Japan, Korea, Germany, and other non-English countries

© 2012 Deep Web Technologies, Inc. 36 How Multilingual Federated Search Works Ranked results translated by Microsoft to user’s language Results returned to user EXPLORIT Microsoft Translator German Chinese Russian Query in user’s language Ranked results in user’s language Query to be translated for each source Query in source’s language Foreign language search engines Results in source’s language Ranking

© 2012 Deep Web Technologies, Inc. 37

© 2012 Deep Web Technologies, Inc. 38 Coming in the Fall Visualization Full-Faceted Navigation Mendeley Integration Document Type and Document Format Clusters Full Text Filter

© 2012 Deep Web Technologies, Inc. 39 Visualization Using our clustering technology, results visualization allows users to see relationships between topics easily.

© 2012 Deep Web Technologies, Inc. 40 Mendeley

© 2012 Deep Web Technologies, Inc. 41 Document Type and Document Format Clusters

© 2012 Deep Web Technologies, Inc. 42 Full Text Filter Access Full Text!

© 2012 Deep Web Technologies, Inc. 43 Future - Mobile Searching

© 2012 Deep Web Technologies, Inc. 44 Thank you! Abe Lederman