Data Mining

advertisement

Impromptu Data

Extraction and Analysis

Data Mining and Analytics

Framework for VLSI Designs

Sandeep P ( sandeep.p@intel.com

); +91 80 2507 5492

Anand Ananthanarayanan ( anand.ananthanarayanan@intel.com

); +91 80 2507 5774

Intel Corporation

Author Affiliations

Author

Sandeep P

Anand Ananthanarayanan

Affiliation

Intel Corporation

Intel Corporation

Phone Number

+91 80 2507 5492

+91 80 2507 5774

Email Address sandeep.p@intel.com

anand.ananthanarayanan@intel.com

Intel Information Technology

2 , FOR INTERNAL USE ONLY

3

Abstract

Design processes from Logic design, validation, backend implementation and verification require a plethora of CAD tools. These tools generate reports, debug information in its own form and content. Designers need to parse and review data from multiple sources and tools to make design calls. When implementing backend design, a designer or a methodology owner needs to understand the patterns seen in the design. Data like number of paths dominated by low leakage, Slope profile for cells with margin > x ps, Drive strength profile of cells in timing path, etc., are critical to make design decisions, optimize design collaterals and ensure design with robust electrical functionality. Many of the data can be obtained only through data mining of results and logs of multiple tools. Data mining is also a constant activity from technology readiness to execution and post silicon debug phases. Data mining problem gets compounded when data is needed from different PV domains. For example, a designer looking to optimize power would need dynamic power information, path margin and max cap information all generated by different tools in different formats in different locations. Data mining has been historically done by adhoc scripts to parse through different reports, and log files. Data generated is post processed and then visualized. Any requirement change in data mining would need changes in the scripts. There is no data mining model which supports multiple tools with different output formats. There is no methodology which supports cross domain analysis.

We present IDEA (Impromptu Data Extraction and Analysis). IDEA is data mining and data analysis framework in a highly interactive web application platform. It supports assimilating data from different tools and formats into one data organization in the form of SQL tables. SQL enables compact organization and faster queries. IDEA framework is built using the Linux-Apache™-Mysql™-Perl (LAMP) packages and uses the R language for performing statistical analysis on the data. R language enables handling huge amount of data with support for different statistical plots like pie-charts, histograms, box plots, scatter plots, Linear regression etc. IDEA data mining completes in minutes compared to hours/days with conventional approaches like scripts. IDEA is highly interactive web application with all the data extraction and plotting functionalities abstracted using highly interactive widgets. IDEA has been used to data mine power savings post Optimization, Analysis of power distribution, Profile the speed paths, Review standard cells usage, Utilization of cell sizes across the design space, RC delays per path stage and has multiple other usages.

Large precious unorganized data lies unexploited. Structured Data Mining essential for competitive VLSI design. Increasing complexity makes data analytics a must-have for quality design. No EDA tool exists today to do this critical data mining. IDEA fills this gap and provides valuable data mining capability. It is time to think of Data Mining as a EDA product

, FOR INTERNAL USE ONLY

Intel Information Technology

Design Process Design Reports

Extraction reports

Timing Reports

Route utilization Cell utilization

Implement

Logic

Verify

DRC reports Layout Checks

4

Multiple Tools

Multiple Reports

Multiple Formats Functional Circuit Design

Large Data gets generated requiring interpretation and Analysis

Intel Information Technology

, FOR INTERNAL USE ONLY

Data Conundrum

5

Design Quality Increasingly Dependent on Multiple Parameters

Intel Information Technology

, FOR INTERNAL USE ONLY

6

Data Mining - A Constant Activity

Tech Readiness

• Data mining is done to generate design heuristics

Design execution

• Data mining is done to determine delta changes on design limits

• Data mining needed for optimization

Post silicon • Data mining is done to root cause and understand the

PV-Silicon miscorrelation

Formal Data Mining Tool or Model Not Currently Available In Industry

Intel Information Technology

, FOR INTERNAL USE ONLY

To solve this Data Mining problem, we present

7 , FOR INTERNAL USE ONLY

Intel Information Technology

8

IDEA

Impromptu Data Extraction and Analysis (IDEA) is

 Web application for Data mining on an open architecture

 Linked Data Caching SQL databases

 Common Xml interface for data manipulation

 Statistical analysis capability with ‘R’ Language

 Practically unlimited capacity with ‘R’ Language

 Data visualization capability

 Histograms, pie charts, density/scatter plots, dot charts

 Faster turn around time (no text parsing scripts)

 Intuitive, web based user interface

Highly Interactive Application for Data Mining

Intel Information Technology

, FOR INTERNAL USE ONLY

IDEA

9 , FOR INTERNAL USE ONLY

Web Based Data Mining Platform

Intel Information Technology

IDEA Architecture

Application Tier

10

Presentation Tier

DataBase

Storage Tier

, FOR INTERNAL USE ONLY

Three Tiered Web Application

Intel Information Technology

11

Architecture – Idea Client

Control

Center

Data

Extraction

Data

Manipulation

AJAX Calls

JSON for data transfer

Data

Viewer

Experiments

Idea Server Report

Viewer

Apps

Statistical Analysis

Spreadsheet Generation

PDF Converter

Simple Client with Powerful Capabilities

Intel Information Technology

, FOR INTERNAL USE ONLY

Basic Usage Flow

12

Open IDEA App Select Project Select Blocks Manipulate Data

Generate PDF or export to spreadsheets

Run Statistical

Analysis and

Reports

Run pre-selected

Queries OR Query interactively

, FOR INTERNAL USE ONLY

Intel Information Technology

13

IDEA Usage and Benefits

Datamine power savings post Optim

RC delays per path stage

IDEA

Usage

Utilization of cell sizes across the design space

Review standard cells usage

Analysis of power distribution

Profile the speed paths

, FOR INTERNAL USE ONLY

- Data Mining Simplified -

Intel Information Technology

Summary

• Large precious unorganized data lies unexploited

• Structured Data Mining essential for competitive VLSI design

• Increasing complexity makes data analytics a must-have for quality design

• No EDA tool exists today to do this critical data mining

• IDEA fills this gap and provides valuable data mining capability

14

- It is time to think of Data Mining as a EDA product -

Intel Information Technology

, FOR INTERNAL USE ONLY

Acknowledgements

 Everyone at Intel who contributed to this work

15 , FOR INTERNAL USE ONLY

Intel Information Technology

Download