Impromptu Data Extraction and Analysis Data Mining and Analytics Framework for VLSI Designs Sandeep P +91 80 2507

Slides:



Advertisements
Similar presentations
DIGIDOC A web based tool to Manage Documents. System Overview DigiDoc is a web-based customizable, integrated solution for Business Process Management.
Advertisements

Kensington Oracle Edition: Open Discovery Workflow Meets Oracle 10g Professor Yike Guo.
© 2012 IBM Corporation 1 IBM Cognos 10 family Analytics in the hands of everyone Address all your analytic needs Report, Analyze, Model, Plan and Collaborate.
Why ROOT?. ROOT ROOT: is an object_oriented frame work aimed at solving the data analysis challenges of high energy physics Object _oriented: by encapsulation,
Components and Architecture CS 543 – Data Warehousing.
DATA WAREHOUSING.
Progress Report 11/1/01 Matt Bridges. Overview Data collection and analysis tool for web site traffic Lets website administrators know who is on their.
Interpret Application Specifications
Performed by:Gidi Getter Svetlana Klinovsky Supervised by:Viktor Kulikov 08/03/2009.
Web-Enabling the Warehouse Chapter 16. Benefits of Web-Enabling a Data Warehouse Better-informed decision making Lower costs of deployment and management.
DASHBOARDS Dashboard provides the managers with exactly the information they need in the correct format at the correct time. BI systems are the foundation.
November 2011 At A Glance GREAT is a flexible & highly portable set of mission operations analysis tools that increases the operational value of ground.
Passage Three Introduction to Microsoft SQL Server 2000.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
State of Connecticut Core-CT Project Query 4 hrs Updated 1/21/2011.
 ETL: Extract Transformation and Load  Term is used to describe data migration or data conversion process  ETL may be part of the business process repeated.
Research on cloud computing application in the peer-to-peer based video-on-demand systems Speaker : 吳靖緯 MA0G rd International Workshop.
SharePoint 2010 Business Intelligence Module 6: Analysis Services.
Copyright © 2006, SAS Institute Inc. All rights reserved. Enterprise Guide 4.2 : A Primer SHRUG : Spring 2010 Presented by: Josée Ranger-Lacroix SAS Institute.
Overview of SQL Server Alka Arora.
CIS 375—Web App Dev II ASP.NET 2 Introducing Web Forms.
MySQL GUI Administration Tools Rob Donahue Manager, Distributed Systems Development May 7th, 2001 Rob Donahue Manager, Distributed Systems Development.
SharePoint 2010 Business Intelligence Module 2: Business Intelligence.
ROOT: A Data Mining Tool from CERN Arun Tripathi and Ravi Kumar 2008 CAS Ratemaking Seminar on Ratemaking 17 March 2008 Cambridge, Massachusetts.
OPC Database.NET. OPC Systems.NET What is OPC Systems.NET? OPC Systems.NET is a suite of.NET and HTML5 products for SCADA, HMI, Data Historian, and live.
Tutorial 10 Adding Spry Elements and Database Functionality Dreamweaver CS3 Tutorial 101.
DEVSView: A DEVS Visualization Tool Wilson Venhola.
Oracle Application Express (Oracle APEX), formerly called HTML DB, is a Free rapid web application development tool for the Oracle database.
Test Of Distributed Data Quality Monitoring Of CMS Tracker Dataset H->ZZ->2e2mu with PileUp - 10,000 events ( ~ 50,000 hits for events) The monitoring.
Design Patterns Phil Smith 28 th November Design Patterns There are many ways to produce content via Servlets and JSPs Understanding the good, the.
OracleAS Reports Services. Problem Statement To simplify the process of managing, creating and execution of Oracle Reports.
CAD for Physical Design of VLSI Circuits
Chapter 1 Introduction to SAS ® Enterprise Guide ®
A New Method For Developing IBIS-AMI Models
Fundamentals of Information Systems, Seventh Edition 1 Chapter 3 Data Centers, and Business Intelligence.
Database Design and Management CPTG /23/2015Chapter 12 of 38 Functions of a Database Store data Store data School: student records, class schedules,
The Client/Server Database Environment Ployphan Sornsuwit KPRU Ref.
C6 Databases. 2 Traditional file environment Data Redundancy and Inconsistency: –Data redundancy: The presence of duplicate data in multiple data files.
MANAGING DATA RESOURCES ~ pertemuan 7 ~ Oleh: Ir. Abdul Hayat, MTI.
ABSTRACT The JDBC (Java Database Connectivity) API is the industry standard for database- independent connectivity between the Java programming language.
By N.Gopinath AP/CSE. There are 5 categories of Decision support tools, They are; 1. Reporting 2. Managed Query 3. Executive Information Systems 4. OLAP.
1 Yield Analysis and Increasing Engineering Efficiency Spotfire Users Conference 10/15/2003 William Pressnall, Scott Lacey.
XML and Its Applications Ben Y. Zhao, CS294-7 Spring 1999.
- 1 - ©2009 Jasper Design Automation ©2009 Jasper Design Automation JasperGold for Targeted ROI JasperGold solutions portfolio delivers competitive.
Building Dashboards SharePoint and Business Intelligence.
© FPT SOFTWARE – TRAINING MATERIAL – Internal use 04e-BM/NS/HDCV/FSOFT v2/3 JSP Application Models.
Library Online Resource Analysis (LORA) System Introduction Electronic information resources and databases have become an essential part of library collections.
SSIS – Deep Dive Praveen Srivatsa Director, Asthrasoft Consulting Microsoft Regional Director | MVP.
A computer contains two major sets of tools, software and hardware. Software is generally divided into Systems software and Applications software. Systems.
Automated Testing April 2001WISQA Meeting Ronald Utz, Automated Software Testing Analyst April 11, 2001.
SAP BI – The Solution at a Glance : SAP Business Intelligence is an enterprise-class, complete, open and integrated solution.
Introduction The concept of a web framework originates from the basic idea that every web application obtains its foundations from a similar set of guidelines.
Building Enterprise Applications Using Visual Studio®
AuraPortal Cloud Helps Empower Organizations to Organize and Control Their Business Processes via Applications on the Microsoft Azure Cloud Platform MICROSOFT.
Meemim's Microsoft Azure-Hosted Knowledge Management Platform Simplifies the Sharing of Information with Colleagues, Clients or the Public MICROSOFT AZURE.
The Client/Server Database Environment
De-mystifying Big Data Testing using new generation tools / technology
CSCI/CMPE 3334 Systems Programming
Yellowfin: An Azure-Compatible Business Intelligence Platform That Connects People with Their Data for Better Decision Making MICROSOFT AZURE APP BUILDER.
On-Premises, or Deployed in a Hybrid Environment
Oracle Architecture Overview
Silverlight Technology
Accelerate Your Self-Service Data Analytics
Unleashing the power of customized reports testing framework
Tiers vs. Layers.
Media365 Portal by Ctrl365 is Powered by Azure and Enables Easy and Seamless Dissemination of Video for Enhanced B2C and B2B Communication MICROSOFT AZURE.
Overview of big data tools
Last.Backend is a Continuous Delivery Platform for Developers and Dev Teams, Allowing Them to Manage and Deploy Applications Easier and Faster MICROSOFT.
Presentation transcript:

Impromptu Data Extraction and Analysis Data Mining and Analytics Framework for VLSI Designs Sandeep P Anand Ananthanarayanan Intel Corporation

2 Intel Information Technology, FOR INTERNAL USE ONLY Author Affiliations AuthorAffiliationPhone Number Address Sandeep PIntel Corporation Anand AnanthanarayananIntel Corporation

3 Intel Information Technology, FOR INTERNAL USE ONLY Abstract Design processes from Logic design, validation, backend implementation and verification require a plethora of CAD tools. These tools generate reports, debug information in its own form and content. Designers need to parse and review data from multiple sources and tools to make design calls. When implementing backend design, a designer or a methodology owner needs to understand the patterns seen in the design. Data like number of paths dominated by low leakage, Slope profile for cells with margin > x ps, Drive strength profile of cells in timing path, etc., are critical to make design decisions, optimize design collaterals and ensure design with robust electrical functionality. Many of the data can be obtained only through data mining of results and logs of multiple tools. Data mining is also a constant activity from technology readiness to execution and post silicon debug phases. Data mining problem gets compounded when data is needed from different PV domains. For example, a designer looking to optimize power would need dynamic power information, path margin and max cap information all generated by different tools in different formats in different locations. Data mining has been historically done by adhoc scripts to parse through different reports, and log files. Data generated is post processed and then visualized. Any requirement change in data mining would need changes in the scripts. There is no data mining model which supports multiple tools with different output formats. There is no methodology which supports cross domain analysis. We present IDEA (Impromptu Data Extraction and Analysis). IDEA is data mining and data analysis framework in a highly interactive web application platform. It supports assimilating data from different tools and formats into one data organization in the form of SQL tables. SQL enables compact organization and faster queries. IDEA framework is built using the Linux-Apache™-Mysql™-Perl (LAMP) packages and uses the R language for performing statistical analysis on the data. R language enables handling huge amount of data with support for different statistical plots like pie-charts, histograms, box plots, scatter plots, Linear regression etc. IDEA data mining completes in minutes compared to hours/days with conventional approaches like scripts. IDEA is highly interactive web application with all the data extraction and plotting functionalities abstracted using highly interactive widgets. IDEA has been used to data mine power savings post Optimization, Analysis of power distribution, Profile the speed paths, Review standard cells usage, Utilization of cell sizes across the design space, RC delays per path stage and has multiple other usages. Large precious unorganized data lies unexploited. Structured Data Mining essential for competitive VLSI design. Increasing complexity makes data analytics a must-have for quality design. No EDA tool exists today to do this critical data mining. IDEA fills this gap and provides valuable data mining capability. It is time to think of Data Mining as a EDA product

4 Intel Information Technology, FOR INTERNAL USE ONLY Design Process Functional Circuit Design Verify Implement Logic Timing Reports Extraction reports Route utilization Cell utilization DRC reportsLayout Checks Multiple Tools Multiple Reports Multiple Formats Multiple Tools Multiple Reports Multiple Formats Design Reports Large Data gets generated requiring interpretation and Analysis

5 Intel Information Technology, FOR INTERNAL USE ONLY Data Conundrum Count of min Z cells with < y ps margin Count of back2back flops in design Is clock power in wires or in devices? Increase design guardband? Slope profile for cells with margin > x ps Numbers of paths dominated by low leakage Drive strength profile of cells in timing path Design Quality Increasingly Dependent on Multiple Parameters

6 Intel Information Technology, FOR INTERNAL USE ONLY Data Mining - A Constant Activity Data mining is done to generate design heuristics Tech Readiness Data mining is done to determine delta changes on design limits Data mining needed for optimization Design execution Data mining is done to root cause and understand the PV-Silicon miscorrelation Post silicon Formal Data Mining Tool or Model Not Currently Available In Industry

7 Intel Information Technology, FOR INTERNAL USE ONLY To solve this Data Mining problem, we present

8 Intel Information Technology, FOR INTERNAL USE ONLY IDEA Impromptu Data Extraction and Analysis (IDEA) is  Web application for Data mining on an open architecture  Linked Data Caching SQL databases  Common Xml interface for data manipulation  Statistical analysis capability with ‘R’ Language  Practically unlimited capacity with ‘R’ Language  Data visualization capability  Histograms, pie charts, density/scatter plots, dot charts  Faster turn around time (no text parsing scripts)  Intuitive, web based user interface Highly Interactive Application for Data Mining

9 Intel Information Technology, FOR INTERNAL USE ONLY IDEA Web Based Data Mining Platform

10 Intel Information Technology, FOR INTERNAL USE ONLY IDEA Architecture DataBase Presentation Tier Application Tier Storage Tier Three Tiered Web Application

11 Intel Information Technology, FOR INTERNAL USE ONLY Architecture – Idea Client Simple Client with Powerful Capabilities Idea Server AJAX Calls JSON for data transfer Control Center Data Viewer Report Viewer Experiments Apps Statistical Analysis Spreadsheet Generation PDF Converter Apps Statistical Analysis Spreadsheet Generation PDF Converter Data Extraction Data Manipulation

12 Intel Information Technology, FOR INTERNAL USE ONLY Basic Usage Flow Open IDEA AppSelect ProjectSelect BlocksManipulate Data Run pre-selected Queries OR Query interactively Run Statistical Analysis and Reports Generate PDF or export to spreadsheets

13 Intel Information Technology, FOR INTERNAL USE ONLY IDEA Usage and Benefits IDEA Usage Datamine power savings post Optim Analysis of power distribution Profile the speed paths Review standard cells usage Utilization of cell sizes across the design space RC delays per path stage - Data Mining Simplified -

14 Intel Information Technology, FOR INTERNAL USE ONLY Summary Large precious unorganized data lies unexploited Structured Data Mining essential for competitive VLSI design Increasing complexity makes data analytics a must-have for quality design No EDA tool exists today to do this critical data mining IDEA fills this gap and provides valuable data mining capability - It is time to think of Data Mining as a EDA product -

15 Intel Information Technology, FOR INTERNAL USE ONLY Acknowledgements  Everyone at Intel who contributed to this work