CLIR Cyberworld April 2014 Sandrine Ammann Marketing & Communications Officer.

Slides:



Advertisements
Similar presentations
PATENTSCOPE What’s new?
Advertisements

The PATENTSCOPE search system 2013 Retrospective Cyberworld December 2013 Sandrine Ammann Marketing & Communications Officer.
Complex queries in the PATENTSCOPE search system Cyberspace September 2013 Sandrine Ammann Marketing & Communications Officer.
THE STEPS OF SEARCH You have opened a new veterinary clinic in a small town, and want people in the vicinity to know about it. You need some new ideas.
Translation tools Cyberworld June 2014 Sandrine Ammann Marketing & Communications Officer.
Complex queries Cyberworld November 2014 Sandrine Ammann Marketing & Communications Officer.
Examples of technology searches Jerusalem 21 July 2010 Alex Riechel Associate Officer, Innovation and Technology Support Section.
PATENTSCOPE Analysis and translation tools September 2014 Sandrine Ammann Marketing & Communications Officer.
PATENTSCOPE Result list & translation tools March 2015 Sandrine Ammann Marketing & Communications Officer.
IS530 Lesson 12 Boolean vs. Statistical Retrieval Systems.
The PATENTSCOPE search system Cyberspace October 2013 Sandrine Ammann Marketing & Communications Officer.
PATENTSCOPE WEBINAR Advanced search Cyber world January 2013 Sandrine Ammann Marketing & Communications Officer.
Advanced search PATENTSCOPE search system Cyberworld February 2015 Sandrine Ammann Marketing & Communications Officer.
Presentation Title Presentation Subtitle and/or Conference Name Place Day Month Year First Name Last Name Job Title.
Search Strategies Online Search Techniques. Universal Search Techniques Precision- getting results that are relevant, “on topic.” Recall- getting all.
Intelligent Information Retrieval CS 336 –Lecture 2: Query Language Xiaoyan Li Spring 2006 Modified from Lisa Ballesteros’s slides.
Information retrieval Finding relevant data using irrelevant keys Example: database of photographic images sorted by number, date. DBMS: Well structured.
Important Task in Patents Retrieval Recall is an Important Factor Given Query Patent -> the Task is to Search all Related Patents Patents have Complex.
PATENTSCOPE Overview Cyber world June 2015 Sandrine Ammann Marketing & Communications Officer.
Search is not only about the Web An Overview on Printed Documents Search and Patent Search Walid Magdy Centre for Next Generation Localisation School of.
Overview of the PATENTSCOPE® search service Jerusalem 21 July 2010 Alex Riechel Associate Officer, Innovation and Technology Support Section.
The PATENTSCOPE search system December 2012 Sandrine Ammann Marketing & Communications Officer.
Search Engines and Information Retrieval Chapter 1.
PATENTSCOPE Patent Search Strategies and Techniques Andrew Czajkowski Head, Innovation and Technology Support Section Centurion September 11, 2014.
A Study on Query Expansion Methods for Patent Retrieval Walid MagdyGareth Jones Centre for Next Generation Localisation School of Computing Dublin City.
The PATENTSCOPE search system: CLIR February 2013 Sandrine Ammann Marketing & Communications Officer.
MIRACLE Multilingual Information RetrievAl for the CLEF campaign DAEDALUS – Data, Decisions and Language, S.A. Universidad Carlos III de.
PATENTSCOPE Result list and analysis tools Web September 2015 Sandrine Ammann Marketing & Communications Officer.
Complex queries in PATENTSCOPE Web November 2015 Sandrine Ammann Marketing & Communications Officer.
How to search using PATENTSCOPE Online October 2015 Sandrine Ammann Marketing & Communications Officer.
Customization in the PATENTSCOPE search system Cyberworld November 2013 Sandrine Ammann Marketing & communications officer.
The PATENTSCOPE search system 2015 Retrospective Cyberworld December 2015 Sandrine Ammann Marketing & Communications Officer.
The PATENTSCOPE search system 2014 Retrospective Cyberworld December 2013 Sandrine Ammann Marketing & Communications Officer.
Jean-Yves Le Meur - CERN Geneva Switzerland - GL'99 Conference 1.
Overview of PATENTSCOPE Internet January 2016 Sandrine Ammann Marketing & Communications Officer.
CLIR PATENTSCOPE search system Cyberworld February 2016 Sandrine Ammann Marketing & Communications Officer.
Cross Lingual Patent Retrieval Issues in Korean Language Minah Kim Korea Institute of Patent Information.
PATENTSCOPE Overview Cyber world January 2014 Sandrine Ammann Marketing & Communications Officer.
PATENTSCOPE search system: Advanced search Cyberspace February 2014 Sandrine Ammann Marketing & Communications Officer.
An Ontology-based Automatic Semantic Annotation Approach for Patent Document Retrieval in Product Innovation Design Feng Wang, Lanfen Lin, Zhou Yang College.
PATENTSCOPE Result list and analysis tools Web May 2016 Sandrine Ammann Marketing & Communications Officer.
PATENTSCOPE Patent Search Strategies and Techniques Andrew Czajkowski Head, Innovation and Technology Support Section.
Search Tools and Strategies Andrew Czajkowski Head, Innovation & Technology Support Section.
PATENTSCOPE Browse menu July 2016 Sandrine Ammann Marketing & Communications Officer.
PATENTSCOPE Translation tools
Information Architecture
Slides Template for Module 3 Contextual details needed to make data meaningful to others CC BY-NC.
PATENTSCOPE Result list and analysis tools
PATENTSCOPE Translation tools
Designing Cross-Language Information Retrieval System using various Techniques of Query Expansion and Indexing for Improved Performance  Hello everyone,
CLIR PATENTSCOPE search system
Retrospective of 2016 & plans for 2017
Options & Help menus Sandrine Ammann
Chemical structure search in PATENTSCOPE
Retrospective 2017 & Future Plans
IPC & PATENTSCOPE Sandrine Ammann Marketing & Communications Officer
Search Techniques and Advanced tools for Researchers
CLIR PATENTSCOPE search system
PATENTSCOPE Browse menu
Overview of PATENTSCOPE® search service Webinar September 2010
PATENTSCOPE: For beginners
Overview of PATENTSCOPE
PATENTSCOPE: For Beginners
IL Step 3: Using Bibliographic Databases
How to read, save and share your results in the Global Brand Database
Global Design Database: An introduction
Information Retrieval and Web Design
PATENTSCOPE Translation tools
How to search with the PATENTSCOPE search system
Active AI Projects at WIPO
Presentation transcript:

CLIR Cyberworld April 2014 Sandrine Ammann Marketing & Communications Officer

To the PATENTSCOPE search system webinar CLIR

Agenda CLIR What is CLIR? Why was it developed? How to search using CLIR? Why is it useful? How to make the best of CLIR? How was it developed? What is next? Q & A session

What is CLIR? Tool to search in one of the languages supported and retrieve original query and its synonyms in the 11 other languages supported Ex: enter “jackhammer”

FP:((EN_TI:("jackhammer" OR "hammer drill") OR EN_AB:("jackhammer" OR "hammer drill")) OR (DE_TI:("Bohrhammer" OR "Schlaghammer") OR DE_AB:("Bohrhammer" OR "Schlaghammer")) OR (ES_TI:("martillo de perforación" OR "taladro de percusión" OR "martillo perforador electoneumatico") OR ES_AB:("martillo de perforación" OR "taladro de percusión" OR "martillo perforador electoneumatico")) OR (FR_TI:("perceuse à percussion" OR "foreuse à percussion" OR "marteau piquer" OR "brisé béton" OR "aspiration de perçage" OR "perforatrice à percussion" OR "marteau foreur" OR "perçage à percussion" OR "marteau piqueur") OR FR_AB:("perceuse à percussion" OR "foreuse à percussion" OR "marteau piquer" OR "brisé béton" OR "aspiration de perçage" OR "perforatrice à percussion" OR "marteau foreur" OR "perçage à percussion" OR "marteau piqueur")) OR (IT_TI:("trapano battente" OR "trapano a percussione" OR "martello perforatore") OR IT_AB:("trapano battente" OR "trapano a percussione" OR "martello perforatore")) OR (JA_TI:(" ハンマドリル " OR " ハンマードリル ") OR JA_AB:(" ハンマドリル " OR " ハン マードリル ")) OR (KO_TI:(" 를 구비한 해머 드릴 " OR " 햄머드릴 ") OR KO_AB:(" 를 구비한 해머 드릴 " OR " 햄머드릴 ")) OR (NL_TI:("boorhamer") OR NL_AB:("boorhamer")) OR (PT_TI:("furadeira de percussão") OR PT_AB:("furadeira de percussão")) OR (RU_TI:("отбойный молоток" OR "помощи бурильногомолотка") OR RU_AB:("отбойный молоток" OR "помощи бурильногомолотка")) OR (SV_TI:("borrhammare" OR "slagborrmaskin") OR SV_AB:("borrhammare" OR "slagborrmaskin")) OR (ZH_TI:(" 拆除 " OR " 锤钻 " OR " 冲击钻机 ") OR ZH_AB:(" 拆除 " OR " 锤钻 " OR " 冲击钻机 ")))

Why CLIR?

Languages

CLIR interface

Query language Define the language of the query:

Expansion mode 2 modes

Precision vs recall Precision = most precise results quality Recall = higher number of documents quantity

Exemple: precision

Exemple: recall

How to search using CLIR - example

Example – automatic mode

Message

Result

Example - supervised

Message

Technical fields

Variants term 1

More variants term 1

Variants term 2

Variants term 3

Translations

Translation - Korean

Check and edit in Google Translate EDIT

Search fields

Acceptable distance

Stemming

Process that removes common endings of words

Checking: IPC

Why is CLIR useful? A)Search full text collections simultaneously in many foreign languages B)Improve significantly the number of relevant results without increasing significantly the number of irrelevant results C)Have confidence in your searches: No black box: users have access to the CLIR generated Boolean queries (albeit complex) and have the full control on them D)Have a responsive system even for complex queries

How to make the most of out CLIR? Expansion modes Keyword very specific with only 1 meaning AUTOMATIC For any other queries, SUPERVISED is recommended Variants/synonyms Select words that you would like to appear in your search results If you have too much noise in the result list, remove generic variant

How to make the most of out CLIR? Parameters 1. Title and abstract: unconstrained distance 2. Claims: sentence/paragraph distance 3. Description: sentence/paragraph distance Stemming recommended

How was it developed? Compilation of a long list of titles in language pairs Creation of in-house extraction methodology Tool learns statistical bilingual dictionaries of titles

Quality of dictionaries Quality of dictionaries: no human intervention The more title available, the better the coverage ChineseKoreanDutch EnglishPortugueseItalian FrenchRussianSwedish GermanSpanish Japanese

Disambuguation Disambiguation: process of identifying the sense of a word in a sentence. Disambiguation is applied to keywords: 1.Technical domains based on the IPC 2.Synonyms selection

What is next? Improve terminology coverage of already supported languages Add other languages: over 200’000 titles and abstracts with associated high quality translations in English

Slides and recording +

mulumesc