Semantic (web) activity at Elsevier Marc Krellenstein VP, Search and Discovery Elsevier October 27, 2004

Slides:



Advertisements
Similar presentations
Data Mining and the Web Susan Dumais Microsoft Research KDD97 Panel - Aug 17, 1997.
Advertisements

Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Dr. Leo Obrst Information Semantics Command & Control Center July 17, 2007 Ontologies Can't Help Records Management Or Can They?
Thesaurus speed dating conclusions. The ideal thesaurus… …is tailor-made for the special needs of its user community. In other words, it is different.
RSP Summer School14-16 September 2009 UK Institutional Repository Search: a collaborative project to showcase UK research output through advanced discovery.
Meta Data Larry, Stirling md on data access – data types, domain meta-data discovery Scott, Ohio State – caBIG md driven architecture semantic md Alexander.
Classification & Your Intranet: From Chaos to Control Susan Stearns Inmagic, Inc. E-Libraries E204 May, 2003.
Web Mining Research: A Survey Authors: Raymond Kosala & Hendrik Blockeel Presenter: Ryan Patterson April 23rd 2014 CS332 Data Mining pg 01.
IAEA International Atomic Energy Agency INIS Collection Search: Introduction and main features INIS Training Seminar 7-11 October 2013, Vienna Domenico.
Galia Angelova Institute for Parallel Processing, Bulgarian Academy of Sciences Visualisation and Semantic Structuring of Content (some.
Technology for integrated access and discovery Presented by: Marc Krellenstein Title: VP, Search and Discovery Advanced Technology Group Date: February.
WebMiningResearch ASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007.
Shared Ontology for Knowledge Management Atanas Kiryakov, Borislav Popov, Ilian Kitchukov, and Krasimir Angelov Meher Shaikh.
The Web is perhaps the single largest data source in the world. Due to the heterogeneity and lack of structure, mining and integration are challenging.
Web Mining Research: A Survey
Evolution of NBII Search-Based Technologies Oct 24, 2002 Donna Roy USGS Center for Biological Informatics.
Web Mining Research: A Survey
WebMiningResearchASurvey Web Mining Research: A Survey Raymond Kosala and Hendrik Blockeel ACM SIGKDD, July 2000 Presented by Shan Huang, 4/24/2007 Revised.
Enterprise Search With SharePoint Portal Server V2 Steve Tullis, Program Manager, Business Portal Group 3/5/2003.
Bieber et al., NJIT © Slide 1 Digital Library Integration Masters Project and Masters Thesis Summer and Fall 2005 CIS 786 / CIS Fall.
Nnadi & Bieber, NJIT © Lightweight Integration of Documents and Services (Digital Library Integration Infrastructure) Nkechi Nnadi and Michael Bieber.
Memoplex Browser: Searching and Browsing in Semantic Networks CPSC 533C - Project Update Yoel Lanir.
Implementing Metadata Marjorie M K Hlava, President Access Innovations, Inc. Albuquerque, NM
1 Semantic Data Management Xavier Lopez, Ph.D., Director, Spatial & Semantic Technologies.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
COHSE Informed WWW Link Navigation Using Ontologies Prof. Carole Goble, Sean Bechhofer Dr. Leslie Carr, Prof. Wendy Hall, Prof. David De Roure, Steve Harris,
Text Analytics And Text Mining Best of Text and Data
Marko Grobelnik Jasna Škrbec Jozef Stefan Institute Social Context as a part of News-Archive-Explorer Web application for exploratory browsing of news.
Rich Foley - Executive Vice President Academic & Public Markets Helen Wilbur - Vice President Consortia Sales & Marketing Digital ArchivesResearch CollectionseBooks.
Research paper: Web Mining Research: A survey SIGKDD Explorations, June Volume 2, Issue 1 Author: R. Kosala and H. Blockeel.
Break Out Session on Infrastructure and Technology: A Report Vipul Kashyap AOS Workshop, Rome, 15 November 2001
LIS 506 (Fall 2006) LIS 506 Information Technology Week 11: Digital Libraries & Institutional Repositories.
GCMD/IDN STATUS AND PLANS Stephen Wharton CWIC Meeting February19, 2015.
GeNii New Contents Services of NII
Multilingual Information Exchange APAN, Bangkok 27 January 2005
Knowledge Representation and Indexing Using the Unified Medical Language System Kenneth Baclawski* Joseph “Jay” Cigna* Mieczyslaw M. Kokar* Peter Major.
Related terms search based on WordNet / Wiktionary and its application in ontology matching RCDL'2009 St. Petersburg Institute for Informatics and Automation.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
ApplicationsApplications Mills Davis Ana Cristina Garcia Peter Mika Gerti Orthofer Giovanni Sacco Maria A. Wimmer (Moderator)
WebMining Web Mining By- Pawan Singh Piyush Arora Pooja Mansharamani Pramod Singh Praveen Kumar 1.
1 nlresearch.com The First ReSearch Engine: Northern Light® Susan M. Stearns Director of Enterprise Marketing March, 1999.
RSC eBook Collection April 2007 RSC eBook Collection Over 700 Books c. 8,000 chapters c. 250,000 pages 10,000 items - tables.
Ontologies and Lexical Semantic Networks, Their Editing and Browsing Pavel Smrž and Martin Povolný Faculty of Informatics,
19/10/20151 Semantic WEB Scientific Data Integration Vladimir Serebryakov Computing Centre of the Russian Academy of Science Proposal: SkTech.RC/IT/Madnick.
EPA’s Environmental Terminology System and Services (ETSS) Michael Pendleton Data Standards Branch, EPA/OEI Ecoiformatics Technical Collaborative Indicators.
Oracle Database 11g Semantics Overview Xavier Lopez, Ph.D., Dir. Of Product Mgt., Spatial & Semantic Technologies Souripriya Das, Ph.D., Consultant Member.
UCSD Libraries Portal Project: Building a Database-Driven Web Content Management System Sharecase, 3/28/2001 Esmé Cowles and Laura Galvan-Estrada.
Module 10a: Display and Arrangement IMT530: Organization of Information Resources Winter, 2008 Michael Crandall.
1 Ontolog OOR-BioPortal Comparative Analysis Todd Schneider 15 October 2009.
IAEA International Atomic Energy Agency INIS Collection Search: Introduction and main features The Role of the International Nuclear Information System.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
From XML to DAML – giving meaning to the World Wide Web Katia Sycara The Robotics Institute
WebDat: A Web-based Test Data Management System J.M.Nogiec January 2007 Overview.
RDFa Primer Bridging the Human and Data webs Presented by: Didit ( )
VIVO architecture March 1, Major Components Vitro is a general-purpose Web-based application leveraging semantic standards VIVO is a customized.
Abstract MarkLogic Database – Only Enterprise NoSQL DB Aashi Rastogi, Sanket V. Patel Department of Computer Science University of Bridgeport, Bridgeport,
Next generation search Marc Krellenstein VP, Search and Discovery Elsevier August 23, 2004
Food and Agriculture Organization of the UN GILW Library and Documentation Systems Division Food, Nutrition and Agriculture Ontology Portal.
Characteristics of Information on the Web Dania Bilal IS 530 Spring 2005.
Data mining in web applications
Data and Applications Security Developments and Directions
Lecture #11: Ontology Engineering Dr. Bhavani Thuraisingham
Federated & Meta Search
Applications of IFLA Namespaces
Cataloging the Internet
Introduction of KNS55 Platform
CSE 635 Multimedia Information Retrieval
Digital Library Issues and Trends
Web Mining Research: A Survey
Presentation transcript:

Semantic (web) activity at Elsevier Marc Krellenstein VP, Search and Discovery Elsevier October 27, 2004

Thesaurus use at Elsevier Elsevier traditionally uses proprietary and standard thesauri for: – Indexing (tagging) articles, books and other materials – Browsing thesaurus-indexed content – Expanding searches against specialized content Overall, a net benefit, but not huge – Limiting a search by category – Clustering documents by category Better than limiting search up front…data-driven

Thesaurus use at Elsevier Elsevier does not currently use thesauri for concept searching – Lack of demonstrated superiority to date over current best practice full text search

Thesaurus use at Elsevier New thesaurus requirements and uses: – Integrated search of proprietary, public and/or local user content using multiple thesauri – Integrating chemical structure info with text documents – Integrating databases with diverse schemas – Supporting text mining – Other uses requested by our customers (e.g., extensibility for local content) – Improved thesaurus navigation – Improved search results

Approaches for new thesaurus uses Creating RDF-based intermediary ontology to map diverse thesauri – Support multiple relationships – Extensible by customers – Improved performance, scalability Experimenting with search options – Improving precision as well as recall Experimenting with visualization techniques (e.g., DOPE browser)

Text mining at Elsevier Consider text mining a now capable technology that will be essential for managing information overload and providing new insights Actively investigating uses and developing applications Can provide both substantive and ‘meta- research’ insights – Trends over time, distribution by author or institution, etc. View RDF as the eventual storage medium for extracted facts – Performance, maintainability, inferencing

To organisms?

Author teams In HIV research?

Indirect links from leukemia to Alzheimer’s via enzymes

Red – Product Pink – Reactant Green – Reagent Brown – Solvent …