Reengineering Classification at the USPTO Marti Hearst, Chief IT Strategist, USPTO PIUG Conference May 4, 2010.

Slides:



Advertisements
Similar presentations
Design of Experiments Questions Network Inference Working Group October 8, 2008.
Advertisements

Presented by Erin Palmer. Speech processing is widely used today Can you think of some examples? Phone dialog systems (bank, Amtrak) Computers dictation.
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Classification & Your Intranet: From Chaos to Control Susan Stearns Inmagic, Inc. E-Libraries E204 May, 2003.
Neural networks Introduction Fitting neural networks
Toward Automatic Music Audio Summary Generation from Signal Analysis Seminar „Communications Engineering“ 11. December 2007 Patricia Signé.
Speech in Multimedia Hao Jiang Computer Science Department Boston College Oct. 9, 2007.
 Course  Name  Student ID  Discipline  Introduction to computing  xxxxxx  xxxxx.
December 2006 Cairo University Faculty of Computers and Information HMM Based Speech Synthesis Presented by Ossama Abdel-Hamid Mohamed.
Feature Vector Selection and Use With Hidden Markov Models to Identify Frequency-Modulated Bioacoustic Signals Amidst Noise T. Scott Brandes IEEE Transactions.
LYU0103 Speech Recognition Techniques for Digital Video Library Supervisor : Prof Michael R. Lyu Students: Gao Zheng Hong Lei Mo.
Building an Intelligent Web: Theory and Practice Pawan Lingras Saint Mary’s University Rajendra Akerkar American University of Armenia and SIBER, India.
1 User-Centered Design at the USPTO: Application to Patent IT Modernization Marti Hearst Chief IT Strategist, USPTO May 23, 2011.
FROM INFORMATION, KNOWLEDGE Prof. Marti Hearst MIMS Visit Day, 2006 Some Research Projects.
A metadata-based approach Marti Hearst Associate Professor BT Visit August 18, 2005.
Equine Gait Analysis and Visualization Methods Dr. Marjorie Skubic Samer Arafat Justin Satterley Computer Engineering & Computer Science Dr. Kevin Keegan.
Optimal Adaptation for Statistical Classifiers Xiao Li.
LYU0103 Speech Recognition Techniques for Digital Video Library Supervisor : Prof Michael R. Lyu Students: Gao Zheng Hong Lei Mo.
Advanced Database Applications Database Indexing and Data Mining CS591-G1 -- Fall 2001 George Kollios Boston University.
Module 1: Overview of Information System in Organizations Chapter 2: How Organizations use IS.
Statistical Natural Language Processing. What is NLP?  Natural Language Processing (NLP), or Computational Linguistics, is concerned with theoretical.
1 WARINTEK ? WARINTEK (Technology Information Kiosk) is Indonesian Multipurpose Community Telecentres that could encourage and support.
Module 3: Business Information Systems Chapter 11: Knowledge Management.
COMPUTER-ASSISTED PLAGIARISM DETECTION PRESENTER: CSCI 6530 STUDENT.
Machine Learning in Spoken Language Processing Lecture 21 Spoken Language Processing Prof. Andrew Rosenberg.
Competitive Sourcing of Classification at the United States Patent and Trademark Office Terrence Mackey International Liaison Staff U.S. Patent and Trademark.
As We May Think By Vannevar Bush July 1945 Presented by Steve Hookway 9/6/05.
Minimum Mean Squared Error Time Series Classification Using an Echo State Network Prediction Model Mark Skowronski and John Harris Computational Neuro-Engineering.
Csc Lecture 7 Recognizing speech. Geoffrey Hinton.
Overview of Part I, CMSC5707 Advanced Topics in Artificial Intelligence KH Wong (6 weeks) Audio signal processing – Signals in time & frequency domains.
Towards Coastal Threat Evaluation Decision Support Presentation by Jacques du Toit Operational Research University of Stellenbosch 3 December 2010.
14 October 2010 Leveraging Technical Expertise via Boeing Library Services* Diane Brenes, Librarian, Boeing Library & Learning Center Services.
22CS 338: Graphical User Interfaces. Dario Salvucci, Drexel University. Lecture 10: Advanced Input.
Advanced Topics in Speech Processing (IT60116) K Sreenivasa Rao School of Information Technology IIT Kharagpur.
Today Ensemble Methods. Recap of the course. Classifier Fusion
Feature Vector Selection and Use With Hidden Markov Models to Identify Frequency-Modulated Bioacoustic Signals Amidst Noise T. Scott Brandes IEEE Transactions.
Audio processing methods on marine mammal vocalizations Xanadu Halkias Laboratory for the Recognition and Organization of Speech and Audio
Cognitive Systems Foresight Language and Speech. Cognitive Systems Foresight Language and Speech How does the human system organise itself, as a neuro-biological.
Introduction to the Semantic Web and Linked Data
CHAPTER 8 DISCRIMINATIVE CLASSIFIERS HIDDEN MARKOV MODELS.
Conditional Random Fields for ASR Jeremy Morris July 25, 2006.
1 CRANDEM: Conditional Random Fields for ASR Jeremy Morris 11/21/2008.
Metadata By N.Gopinath AP/CSE Metadata and it’s role in the lifecycle. The collection, maintenance, and deployment of metadata Metadata and tool integration.
Automatic Speech Recognition A summary of contributions from multiple disciplines Mark D. Skowronski Computational Neuro-Engineering Lab Electrical and.
Introduction Part I Speech Representation, Models and Analysis Part II Speech Recognition Part III Speech Synthesis Part IV Speech Coding Part V Frontier.
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
Dr.Abeer Mahmoud ARTIFICIAL INTELLIGENCE (CS 461D) Dr. Abeer Mahmoud Computer science Department Princess Nora University Faculty of Computer & Information.
Web 2.0: Making the Web Work for You, Illustrated Unit A: Research 2.0.
Chapter 7 Speech Recognition Framework  7.1 The main form and application of speech recognition  7.2 The main factors of speech recognition  7.3 The.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
Pattern Recognition NTUEE 高奕豪 2005/4/14. Outline Introduction Definition, Examples, Related Fields, System, and Design Approaches Bayesian, Hidden Markov.
WHAT IS DATA MINING?  The process of automatically extracting useful information from large amounts of data.  Uses traditional data analysis techniques.
1 7-Speech Recognition Speech Recognition Concepts Speech Recognition Approaches Recognition Theories Bayse Rule Simple Language Model P(A|W) Network Types.
語音訊號處理之初步實驗 NTU Speech Lab 指導教授: 李琳山 助教: 熊信寬
Optical Character Recognition
Speaker Recognition UNIT -6. Introduction  Speaker recognition is the process of automatically recognizing who is speaking on the basis of information.
Speech Recognition through Neural Networks By Mohammad Usman Afzal Mohammad Waseem.
Building Smart Java Applications with Neural Networks, Using the Neuroph Framework Zoran Sevarac, Faculty of Organisational Sciences, University of Belgrade.
Module 1: Overview of Information System in Organizations
Today’s Lecture Neural networks Training
Deep Learning Amin Sobhani.
Speech recognition in mobile environment Robust ASR with dual Mic
Intro to Machine Learning
Artificial Intelligence for Speech Recognition
Data Mining Lecture 11.
Mining the Data Charu C. Aggarwal, ChengXiang Zhai
Kocaeli University Introduction to Engineering Applications
Music Signal Processing
The Application of Hidden Markov Models in Speech Recognition
Huawei CBG AI Challenges
Presentation transcript:

Reengineering Classification at the USPTO Marti Hearst, Chief IT Strategist, USPTO PIUG Conference May 4, 2010

Questions to Consider What is classification used for? What currently works well? What is currently problematic about the classification system? How can we fix these problems?

How is classification used? As a search aid –Grouping related art together –Narrowing down what needs to be looked at To assign work –Associated with art units –Assigning applications to examiners

What are its strengths? Familiar For some art areas, is nicely detailed.

What are the problems? Time-consuming to update Backwards looking (does not anticipate new directions in technology) The non-patent literature is not classified Is difficult to understand: –For new examiners –For many in the external community Is not harmonized internationally

What are some goals? Note: these are just ideas to discuss. –Make it easier for examiners, managers, or others, to suggest new classes as they arise. –Engage the external community in suggesting up-and-coming classes. –Engage the community in classifying NPL.

What are some goals? Note: these are just ideas to discuss. –Flexible, adaptive to rapidly-developing technology –Easier to browse classified documents dynamically –Easier to classify complex topics that span many fields –Easier to classify across different points of view (structure vs function, for instance) –More aligned with modern classification practices and technology.

Modern Classification Today, most online systems that use classes use faceted classification. –Not just e-commerce, but also digital libraries Bioscience (gopubmed.org, nextbio.com) Computer Science (dblp.l3s.de) Worldcat library catalog U Chicago DL ( Image collections

Lens.lib.uchicagoe.edu

Lens.lib.uchicago.edu

Worldcat.org

worldcat.org

How to apply to the USPC? Speech Signal Processing. Psychoacoustic. For storage or transmission.. Neural Network.. Transformation … Orthogonal functions.. Frequency … Specialized Information …. Pitch ….. Voiced or Unvoiced …. Formant …. Silence decision.. Voice recognition … Preliminary matching …Endpoint detection.. Word recognition … Preliminary matching … Endpoint detection … Specialized models …. Markov ….. Hidden Markov Models (HMM) …… Training of HMM ……. With insufficient amount of training data …… HMM network. Synthesis.. Neural network.. Transformation

How to apply to the USPC? Speech Signal Processing. Psychoacoustic. For storage or transmission.. Neural Network.. Transformation … Orthogonal functions.. Frequency … Specialized Information …. Pitch ….. Voiced or Unvoiced …. Formant …. Silence decision.. Voice recognition … Preliminary matching …Endpoint detection.. Word recognition … Preliminary matching … Endpoint detection … Specialized models …. Markov ….. Hidden Markov Models (HMM) …… Training of HMM ……. With insufficient amount of training data …… HMM network. Synthesis.. Neural network.. Transformation … Specialized models …. Markov ….. Hidden Markov Models (HMM) …… Training of HMM ……. With insufficient amount of training data …… HMM network.. Neural Network.. Transformation … Orthogonal functions

How to Facet the USPC? Speech Signal Properties. Psychoacoustic. Pitch.. Voiced or unvoiced. Formant Speech Signal Problems. Voice recognition. Word recognition. Phrase recognition. Noise reduction. Generation Speech Signal Applications. Text to speech. Speech to text. Meeting recording Machine Learning Methods. HMMs.. HMM Network. Neural Nets.. Linear.. Sigmoidal. Maximum Entropy Machine Learning Techniques issues. Limited training data. Training techniques Data Transformations. Orthogonal functions. Quantization

Potential Advantages More familiar to outsiders More flexible for navigation Easy to include multiple points of view Potentially easier to capture cross-domain similarities (more systematic than x-class) May integrate well with IPC and F-Terms May be easier to automate assignment

Is this a good idea? We’re doing a pilot test on classes in There are of course many issues to consider. Let’s discuss!