Presentation Title Presentation Subtitle and/or Conference Name Place Day Month Year First Name Last Name Job Title.

Slides:



Advertisements
Similar presentations
Using EBSCOs Search Box Builder Tool Tutorial. Would you like to promote your EBSCOhost resources by adding an easy-to-use search box to your website?
Advertisements

PATENTSCOPE What’s new?
The PATENTSCOPE search system 2013 Retrospective Cyberworld December 2013 Sandrine Ammann Marketing & Communications Officer.
Complex queries in the PATENTSCOPE search system Cyberspace September 2013 Sandrine Ammann Marketing & Communications Officer.
Translation tools Cyberworld June 2014 Sandrine Ammann Marketing & Communications Officer.
Complex queries Cyberworld November 2014 Sandrine Ammann Marketing & Communications Officer.
PATENTSCOPE Result list & translation tools March 2015 Sandrine Ammann Marketing & Communications Officer.
Module 1 Dictionary skills Part 1
The PATENTSCOPE search system Cyberspace October 2013 Sandrine Ammann Marketing & Communications Officer.
Advanced search PATENTSCOPE search system Cyberworld February 2015 Sandrine Ammann Marketing & Communications Officer.
Search Strategies Online Search Techniques. Universal Search Techniques Precision- getting results that are relevant, “on topic.” Recall- getting all.
Information retrieval Finding relevant data using irrelevant keys Example: database of photographic images sorted by number, date. DBMS: Well structured.
IPC & the PATENTSCOPE search system Cyberworld May 2015 Sandrine Ammann Marketing & Communications Officer.
Important Task in Patents Retrieval Recall is an Important Factor Given Query Patent -> the Task is to Search all Related Patents Patents have Complex.
Funded under the EU ICT Policy Support Programme Automated Solutions for Patent Translation John Tinsley Project PLuTO WIPO Symposium of.
PATENTSCOPE Overview Cyber world June 2015 Sandrine Ammann Marketing & Communications Officer.
Digital/physical content store. Summary Create a digital content/physical product web store based on osCommerce. Following items can be sold in the store:
Overview of the PATENTSCOPE® search service Jerusalem 21 July 2010 Alex Riechel Associate Officer, Innovation and Technology Support Section.
August 21, 2002Szechenyi National Library Support for Multilingual Information Access Douglas W. Oard College of Information Studies and Institute for.
PATENTSCOPE Patent Search Strategies and Techniques Andrew Czajkowski Head, Innovation and Technology Support Section Centurion September 11, 2014.
Web of Knowledge Service for UK Education April 2007 An Overview Web of Knowledge Support Officer
Access to patent information and the role of classification Mikhail Makarov World Intellectual Property Organization IPC Forum 2006 Geneva.
A Study on Query Expansion Methods for Patent Retrieval Walid MagdyGareth Jones Centre for Next Generation Localisation School of Computing Dublin City.
The PATENTSCOPE search system: CLIR February 2013 Sandrine Ammann Marketing & Communications Officer.
The CLEF 2003 cross language image retrieval task Paul Clough and Mark Sanderson University of Sheffield
SUMMON ® 2.0 DISCOVERY REINVENTED. What is Summon 2.0? A new, streamlined, modern interface New and enhanced features providing layers of contextual guidance.
MIRACLE Multilingual Information RetrievAl for the CLEF campaign DAEDALUS – Data, Decisions and Language, S.A. Universidad Carlos III de.
PATENTSCOPE Result list and analysis tools Web September 2015 Sandrine Ammann Marketing & Communications Officer.
New RCLayout. Do product layout 3 improvements All products Local databases New functionalities.
UA in ImageCLEF 2005 Maximiliano Saiz Noeda. Index System  Indexing  Retrieval Image category classification  Building  Use Experiments and results.
Copyright © 2013, Oracle and/or its affiliates. All rights reserved. 1.
Complex queries in PATENTSCOPE Web November 2015 Sandrine Ammann Marketing & Communications Officer.
How to search using PATENTSCOPE Online October 2015 Sandrine Ammann Marketing & Communications Officer.
Customization in the PATENTSCOPE search system Cyberworld November 2013 Sandrine Ammann Marketing & communications officer.
The PATENTSCOPE search system 2015 Retrospective Cyberworld December 2015 Sandrine Ammann Marketing & Communications Officer.
The PATENTSCOPE search system 2014 Retrospective Cyberworld December 2013 Sandrine Ammann Marketing & Communications Officer.
The Cross Language Image Retrieval Track: ImageCLEF Breakout session discussion.
Cross Language Information Exploitation of Arabic Dr. Elizabeth D. Liddy Center for Natural Language Processing School of Information Studies Syracuse.
Overview of PATENTSCOPE Internet January 2016 Sandrine Ammann Marketing & Communications Officer.
CLIR PATENTSCOPE search system Cyberworld February 2016 Sandrine Ammann Marketing & Communications Officer.
Cross Lingual Patent Retrieval Issues in Korean Language Minah Kim Korea Institute of Patent Information.
CLIR Cyberworld April 2014 Sandrine Ammann Marketing & Communications Officer.
PATENTSCOPE Overview Cyber world January 2014 Sandrine Ammann Marketing & Communications Officer.
PATENTSCOPE search system: Advanced search Cyberspace February 2014 Sandrine Ammann Marketing & Communications Officer.
Factiva.com. What is Factiva? Joint venture between two of the world’s leading sources of company and business news + Knight Ridder Media General Hoover’s.
PATENTSCOPE Result list and analysis tools Web May 2016 Sandrine Ammann Marketing & Communications Officer.
Using the Automatic Captions Feature. Objectives Learn how to use the Automatic Captions feature in YouTube  Edit the generated captions  Extract the.
PATENTSCOPE Patent Search Strategies and Techniques Andrew Czajkowski Head, Innovation and Technology Support Section.
Search Tools and Strategies Andrew Czajkowski Head, Innovation & Technology Support Section.
PATENTSCOPE Browse menu July 2016 Sandrine Ammann Marketing & Communications Officer.
PATENTSCOPE Translation tools
PATENTSCOPE Result list and analysis tools
PATENTSCOPE Translation tools
Designing Cross-Language Information Retrieval System using various Techniques of Query Expansion and Indexing for Improved Performance  Hello everyone,
CLIR PATENTSCOPE search system
Retrospective of 2016 & plans for 2017
Options & Help menus Sandrine Ammann
Retrospective 2017 & Future Plans
Multimedia Information Retrieval
IPC & PATENTSCOPE Sandrine Ammann Marketing & Communications Officer
CLIR PATENTSCOPE search system
PATENTSCOPE Browse menu
Overview of PATENTSCOPE® search service Webinar September 2010
PATENTSCOPE: For beginners
Overview of PATENTSCOPE
PATENTSCOPE: For Beginners
Global Design Database: An introduction
PATENTSCOPE Translation tools
How to search with the PATENTSCOPE search system
Active AI Projects at WIPO
Presentation transcript:

Presentation Title Presentation Subtitle and/or Conference Name Place Day Month Year First Name Last Name Job Title

CLIR PATENTSCOPE search system Cyberworld April 2015 Sandrine Ammann Marketing & Communications Officer

To the PATENTSCOPE search system webinar CLIR

Agenda Latest developments CLIR What is CLIR? How to use it? Why is it useful? How was it developed? What is next? Quiz Q & A session

Latest developements

New: https

National patent collections be added in the future UK DK AU NZ

CLIR Cross-Lingual Information Retrieval

What is it? 1. Finds synonyms: container receptacles/ reservoir/tank 2. Translates into 11 languages container 集装箱 容器 盒 envase contenedor tanque emballage conteneurs contenants recipienti serbatoio riserva コンテナ タンク 貯槽 toevoertank watervat opslagtank Verpackung Transportbehälter Behältnisses contentor receptáculo embalagem Контейнера Емкости резервуара behaallare viravattenbehållare pappersmaskins 용기 기 탱크

CLIR – 12 languages available NON-ASIAN Dutch English French German Italian Portuguese Russian Spanish Swedish ASIAN Chinese Japanese Korean

How to use it?

Interface

Query language Define the language of the query:

Expansion mode 2 modes: Automatic = 1 step Supervised = 4 steps

CLIR: precision vs recall

Precision = the ability to retrieve the most precise results. Trying to find only precisely relevant items (high precision) = miss important items because they don't use quite the same vocabulary. Recall = the ability to retrieve as many documents as possible that match or are related to a query. Trying to find all the relevant items (high recall) = often get a lot of junk.

Example: precision

Results for «precision»

Example: recall

Results for «recall»

Examples Source:

Automatic mode

Result list

Supervised mode

Step 1: technical field selection

Step 2: synonym selection

Step 3: translated term selection

Relevance checking

Fields

Acceptable distance

Stemming

Use of the root form of a word displayed Displaydisplaying displays

IPC checking

Why is CLIR useful? A)Search full text collections simultaneously in many foreign languages B)Improve significantly the number of relevant results without increasing significantly the number of irrelevant results C)Have confidence in your searches: No black box: users have access to the CLIR generated Boolean queries (albeit complex) and have the full control on them D)Have a responsive system even for complex queries

How to make the most of out CLIR? Expansion modes Keyword very specific with only 1 meaning AUTOMATIC For any other queries, SUPERVISED is recommended Variants/synonyms Select words that you would like to appear in your search results If you have too much noise in the result list, remove generic variant

How to make the most of out CLIR? Parameters 1. Title and abstract: unconstrained distance 2. Claims: sentence/paragraph distance 3. Description: sentence/paragraph distance Stemming recommended

How was it developed? Compilation of a long list of titles in language pairs Creation of in-house extraction methodology Tool learns statistical bilingual dictionaries of titles

Quality of dictionaries Quality of dictionaries: no human intervention The more title available, the better the coverage ChineseKoreanDutch EnglishPortugueseItalian FrenchRussianSwedish GermanSpanish Japanese

Disambiguation Disambiguation: process of identifying the sense of a word in a sentence. Disambiguation is applied to keywords: 1.Technical domains based on the IPC 2.Synonyms selection

What is next? Improve terminology coverage of Korean, Chinese and Japanese Add Polish and Danish

Q:1: About latest developments … A B Some fee-based search features Secure https protocol

Q: 1: About latest developments … Some fee-based search features A B The secure https protocol

Q:2: which languages are supported by CLIR? Chinese Korean Swedish French A B C D

Q:2: which languages are supported by CLIR? Chinese Spain Swedish Korean A B C D French

Q:3 which expansion mode was used to obtain this result list? Automatic A B Supervised

Q:3: which expansion mode was used to obtain this result list? Automatic Supervised A C

mulumesc