The Institute of Scientific and Technical Information of China

Slides:



Advertisements
Similar presentations
© 2014 Fair Isaac Corporation. Confidential. This presentation is provided for the recipient only and cannot be reproduced or shared without Fair Isaac.
Advertisements

Principal Patent Analyst
© 2014 Fair Isaac Corporation. Confidential. This presentation is provided for the recipient only and cannot be reproduced or shared without Fair Isaac.
A Quick Overview By Munir Winkel. What do you know about: 1) decision trees 2) random forests? How could they be used?
Machine Learning and Data Mining Course Summary. 2 Outline  Data Mining and Society  Discrimination, Privacy, and Security  Hype Curve  Future Directions.
Lecture 14 – Neural Networks
Introduction to WEKA Aaron 2/13/2009. Contents Introduction to weka Download and install weka Basic use of weka Weka API Survey.
Mr. Perminous KAHOME, University of Nairobi, Nairobi, Kenya. Dr. Elisha T.O. OPIYO, SCI, University of Nairobi, Nairobi, Kenya. Prof. William OKELLO-ODONGO,
An Enterprising University Roger Ford Chair of Innovation and Technology Strategy.
Gavin Russell-Rockliff BI Technical Specialist Microsoft BIN305.
Peter Myers Bitwise Solutions Pty Ltd. Predictive Analytics PresentationExplorationDiscovery Passive Interactive Proactive Business Insight Canned.
Classifiers, Part 3 Week 1, Video 5 Classification  There is something you want to predict (“the label”)  The thing you want to predict is categorical.
The New USPTO Rules and their Impact on Biomedical Patent Prosecution Mojdeh Bahar, J.D.,M.A. Technology Licensing Specialist Office of Technology Transfer.
Deductive Data Analytics Creating relevance through data mining Kris Marshall – ICE 2015.
“Put the Power of Predictive Analytics in the Hands of Clinical Researchers” Filippos Katsampouris Marketing Manager Healthcare & Pharmaceutical Accounts.
B. RAMAMURTHY EAP#2: Data Mining, Statistical Analysis and Predictive Analytics for Automotive Domain CSE651C, B. Ramamurthy 1 6/28/2014.
ROOT: A Data Mining Tool from CERN Arun Tripathi and Ravi Kumar 2008 CAS Ratemaking Seminar on Ratemaking 17 March 2008 Cambridge, Massachusetts.
Synapse and Catapult– New NIH OTT Technology Transfer Tools Ajoy K. Prabhu, M.S., M.B.A. Group Leader, Marketing Office of Technology Transfer National.
Polimi Case study: Procedures, tools, facts & Figures
Department of Computer Science, University of Waikato, New Zealand Geoffrey Holmes, Bernhard Pfahringer and Richard Kirkby Traditional machine learning.
Consul- ting Services Outsour- cing Services Techno- logy Services Local Profes- sional Services Competence Centers Business Intelligence WebTech SAP.
THE IMPORTANCE OF IPR ACROSS THE LIFECYCLE OF INNOVATION Bob Stembridge Principal Patent Analyst, IP & Science.
Xintao Wu Jan 18, 2013 Retweeting Behavior and Spectral Graph Analysis in Social Media.
BOĞAZİÇİ UNIVERSITY DEPARTMENT OF MANAGEMENT INFORMATION SYSTEMS MATLAB AS A DATA MINING ENVIRONMENT.
1 Some more examples Client satisfaction Products sold Trusted advisor score Net growth TOP PERFORMERS Age diversity HIGH Credibility HIGH Absenteeism.
PatentsView and Inventor Disambiguation Alan Marco 9/23/2015.
Intellectual Property as Integral Part of S&T Strategy Dr. Nadya Reingand Patent Hatchery LLC Washington DC, USA © Nadya Reingand.
Machine Learning in Practice Lecture 8 Carolyn Penstein Rosé Language Technologies Institute/ Human-Computer Interaction Institute.
Data Mining With SQL Server Data Tools Mining Data Using Tools You Already Have.
Social Media Analytics Market to Global Analysis and Forecast by Type, Application and Vertical No of Pages: 150 Publishing Date: Jan 2017 Single.
Social Media Analytics Market to Global Analysis and Forecast by Type, Application and Vertical No of Pages: 150 Publishing Date: Feb 2017 Single.
How Text Mining Helps To Find Fradulent Buyers. Text analytics can help companies discover true market perceptions, but only if the analysis is done in.
Introduction to Machine Learning, its potential usage in network area,
What we mean by Big Data and Advanced Analytics
Saskatoon SAS user group
Design of foresight-based evaluation in Tekes Activities
Combining Models Foundations of Algorithms and Machine Learning (CS60020), IIT KGP, 2017: Indrajit Bhattacharya.
Classify A to Z Problem Statement Technical Approach Results Dataset
Technology Transfer Office
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Makes Insurance Smarter.
Portfolio Analysis in OPASI at NIH
Data Mining in Higher Education
Decision Trees in Analytical Model Development
Carolinas HealthCare System: Consumer Analytics
PLM, Document and Workflow Management
Measuring social added value Italian experiences
Design and Manufacturing in a Distributed Computer Environment
Delivering Business Insight with SQL Server 2005
Larry Lian Attorney Advisor Office of Policy and International Affairs
Machine Learning & Data Science
Classifying enterprises by economic activity
11/22/2018 1:43 PM THR3005 How to provide business insight from your data using Azure Analysis Services Peter Myers Bitwise Solutions © Microsoft Corporation.
Dr. Morgan C. Wang Department of Statistics
Machine Learning with Weka
Analytics: Its More than Just Modeling
Rohit Singh Gole Clarivate Analytics Patent Evaluation.
INNOvation in TRAINING BUSINESS ANALYSTS HAO HElEN Zhang UniVERSITY of ARIZONA
Technical Capabilities
Machine learning Empirical Performance Analysis
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
orange.biolab.si A general-purpose open source component-based
6/17/ :03 AM © 2004 Microsoft Corporation. All rights reserved.
T-FLEX DOCs PLM, Document and Workflow Management.
A Study of Company — Sense Time
An enterprise solution to managing and transferring technology and knowledge A Springboard Project.
March Madness Data Crunch Overview
Prof. Kiran Kalia, Director NIPER Ahmedabad
Is Statistics=Data Science
Presentation transcript:

Using Open Data and Tools to Predict Which cancer Patents Would Be FDA Approved? The Institute of Scientific and Technical Information of China Yang Guan-Can Oct 26, 2016 Email: yanggc@istic.ac.cn

1. Predicting Successful FDA Approval Patent grant and FDA approval are both important to a pharmaceutical production enterprises. approval processes expensive and time-consuming, uncertainty. Inability to predict which patents could be FDA approved. So triaging potential patents based on their likelihood of accepted FDA approval would be important. Patent grant and FDA approval are both important to a pharmaceutical production enterprises. Patents are crucial to protect a company’s ideas while FDA approval is necessary to legally market their products. However the PTO and FDA approval processes are expensive and time-consuming, with full of uncertainty. Till now, we cannot clearly identify which granted patents would continue to apply a FDA approval. Which slow down the speed of patent prosecution and increase the risk and cost of enterprises. So triaging those potential product-ready patents based on their likelihood of FDA approval would be very important.

2. Opportunities of Open Data & Tools It is really not easy to solve this problem, because the path from patent grant to FDA approval is complicated and uncertain, and accompanied with enterprises’ multiple strategies, such as “Metabolite Defense”. Fortunately, more and more institutions and individuals joined the open data and tools list recently, providing us with excellent resource, which is new opportunity for us exploring to resolve the tricky questions. PatentsView just is an example, and it provided API interface and bulk downloads based on further data processing and inventor disambiguation, Recently, USPTO has provided a cancer moonshot patent data, which linked NIH funding data and FDA approval data, which externally expands the dimensions of our predictive analytics. At the same time, some commercial vendors contributes some free tools, just like KNIME, RSTUDIO, with rich machine learning algorithms and interactive visualization functions. So it is time for us to integrate all these tools helping innovators have more in-depth insights on the messy world.

3. Predictive Workflow on Patent2FDA KNIME analytic platform is a free tool; Supporting large data; integrating various projects, e.g., text mining, deep learning. We build the whole predictive project on KNIME CRISP-DM rule

4. Results and Model Evaluation Comparing several models and the performance of Random Forest is best; However, the result is only “forward looking statements”, need more validation. Model Name Accuracy Cohen's Kappa Area Under Curve Random Forest 0.914285714 0.828571429 0.9624 Multi_layer Perceptron 0.9 0.8 0.8041 Decision Tree 0.885714286 0.771428571 0.933 Naive Bayes 0.814285714 0.628571429 0.9363 Logistical Regression 0.857142857 0.714285714 0.9510 Gradient Boosted Trees 0.9408

5. Model Interpretation The applicant’s historical experience on FDA approval Similarly, the assignee’s experience is also important. Lawyer and Agency experiences; CPC group: A61K and C07K concurrence; Patent average process ratio The state of assignee’s

6. Thoughts The path from patent grant to FDA approval is complicated and uncertain. Predictive analytics may provide us some insights. In the future, we wish more and more open data could be added into model, which would facilitate the performance of predictive model, such as text mining, network analysis.

Thanks