Complex queries in PATENTSCOPE Web November 2015 Sandrine Ammann Marketing & Communications Officer
Agenda Latest development Complex queries: Interfaces Search syntax Field definition Quiz Q&A
Latest developments 50 Million
Figures 3,400,000 records: bibliographic data, searchable full- text and drawings in Korean Front file in WIPO standard ST96ST96 New data available within 1 month after pub in KIPRIS over 50 million the total number of patent documents 5 the number of IP5 offices whose full-text is searchable
Complex queries
Interface Field Combination
Pros and cons The fields are predefined but space is limited Combination of AND and OR not possible, limited to either all AND or all OR
Interface: Advanced search
To help you fine-tune your search 1. Boolean operators 2. Proximity operators 3. Field codes 4.Stemming/wildcard/fuzzy searches 5. Grouping/nesting
Help menu
Boolean operators AND OR NOT ANDNOT
ANDNOT - NOT Use ANDNOT when searching A excluding B bicycle ANDNOT boat Use NOT when searching all documents except A NOT(car AND bicycle AND boat)
Boolean operators Narrow, broad and related terms grouped together in proximity and related to other groups of terms expressing functionality or application Are some words required to be in your results? Are any words to be excluded? Should words be near? How near?
Proximity operator NEAR Proximity searching Find words that are next to each other NEAR2 = 2 defines the maximum number of word gaps between 2 search terms
NEAR: an example carbon AND wheel 116,383 results "wheels made of carbon", "carbon may be used to make said wheel“ carbon NEAR10 wheel 2,971 results
Proximity search: BEFORE the order of terms is significant. trunk BEFORE cutting
BEFORE a particular date Anything published before October 25th, 1999 DP:[1900 TO ] DP:[ Years TO ]
Ranges – include/exclude numbers DP:[ TO ] = inclusive DP:{ TO } = exclusive
Wildcards – truncation: ? * plural form of a word, or alternative spellings * stands for 0 or more characters ? stands single character te?t = test or text electric* = electrical; electricity behavi*r = behaviour or behavior micro?p* = microspeaker, microsporidial
Wildcards - Stemming elect* would search for words like elect, elects, electron, electrons, electric, electrical, electricity, electoral electric stemmed will find electricity electrical
Fuzzy searches Use of the tilde: ~ Examples: roam~ foam / roams Roam~0.8
^ caret = weighting factor Same result but ranking will be different touch^3 AND polarize
Grouping
Grouping/nesting Solar OR (wind AND turbine) (solar OR wind) AND turbine EN_TI: electric car electric will be searched in English title but car in all fields EN_TI: (electric car) Both electric and car will be searched in the English title
Fields:
Examples FP = front page ALL = all fields ALL_NAMES = all names IC = IPC DP = publication date CTR = country either WO or country from national collection NPCC= national phase entry AN = origin of PCT NPCC:US and NPEDD:Last1Year
Intuitive queries PCTHavingThirdPartyObservations PCTPublishedLastWeek PCTHavingApplicantFromUS DP:Last1Year DP:[Today-1Week TO Today]
Fields rules Basic fields: elements of a patent document Derived fields 2 letter code = individual field EN_TI FR_AB ES_DE_S Convention: language specified by 2 letters if not specified all languages S = stemmed : to separate term without any space
Fields: golden rules EN_ALL = default field field indicator not required Field name followed by : ":" or "/" The field is only valid for the term that it directly precedes, so the query: EN_TI:("wind turbine" AND electric) solar Results: "wind turbine" AND electric in the title field. "solar" in the default field (EN_ALL ).
Most common errors (….) "…" " not “ Field name No space No wildcard at the beginning of a word
Q1: Which operator/s is/are supported in PATENTSCOPE? AND OR NEAR BEFORE A B C D
Q1: Which operator/s is/are supported in PATENTSCOPE? AND Spain NEAR OR A B C D BEFORE
Q2:How many patent documents are available in PATENTSCOPE? A B About 30 million Over 50 million
Q2: How many patent documents are available in PATENTSCOPE? About 30 million A B Over 50 million
Q3: Which the correct way of using NEAR? (electric AND car) NEAR power A B C D EN_AB: (hearing NEAR aid) EN_AB: hearing NEAR aid power NEAR (electric AND car)
Q3: Which the correct way of using NEAR? (electric AND car) NEAR power EN_AB: hearing NEAR aid EN_AB: (hearing NEAR aid) A B C D power NEAR (electric AND car)
December webinar: Retrospective of 2015 & plans for 2016 Tue, December 15 5:30 PM - 6:30 PM CEST Thu, December 17 8:30 AM - 9:30 AM CEST To sign up:
mulumesc