1 The GSRC Bookshelf Andrew B. Kahng and Igor L. Markov September 24, 1999.

Slides:



Advertisements
Similar presentations
Usage statistics in context - panel discussion on understanding usage, measuring success Peter Shepherd Project Director COUNTER AAP/PSP 9 February 2005.
Advertisements

Creating Institutional Repositories Stephen Pinfield.
DELOS Highlights COSTANTINO THANOS ITALIAN NATIONAL RESEARCH COUNCIL.
Configuration management
Configuration management
WHY CMS? WHY NOW? CONTENT MANAGEMENT SYSTEM. CMS OVERVIEW Why CMS? What is it? What are the benefits and how can it help me? Centralia College web content.
HP Quality Center Overview.
Submission Process. Overview Preparing for submission The submission process The review process.
Key-word Driven Automation Framework Shiva Kumar Soumya Dalvi May 25, 2007.
Requirements Engineering n Elicit requirements from customer  Information and control needs, product function and behavior, overall product performance,
July 11 th, 2005 Software Engineering with Reusable Components RiSE’s Seminars Sametinger’s book :: Chapters 16, 17 and 18 Fred Durão.
DARPA Bookshelf For VLSI CAD Algorithms: Progress and Future Directions Andrew E. Caldwell, Andrew B. Kahng and Igor L. Markov.
Advances research methods and proposal writing Ronan Fitzpatrick School of Computing, Dublin Institute of Technology. September 2008.
1 Introduction to Software Engineering Lecture 42 – Communication Skills.
Hypergraph Partitioning for VLSI CAD: Methodology for Heuristic Development, Experimentation and Reporting Andrew E. Caldwell, Andrew B. Kahng, Andrew.
UMass Lowell Computer Science Advanced Algorithms Computational Geometry Prof. Karen Daniels Spring, 2004 Project.
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation
Software Requirements
Overview of Software Requirements
Introduction to Implementing an Institutional Repository Delivered to Technical Services Staff Dr. John Archer Library University of Regina September 21,
Business research methods: data sources
© Copyright Eliyahu Brutman Programming Techniques Course.
1 The GSRC Bookshelf Andrew B. Kahng and Igor L. Markov September 2, 1999.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 2 Introduction to Database Development.
Supplement 02CASE Tools1 Supplement 02 - Case Tools And Franchise Colleges By MANSHA NAWAZ.
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation Mike Smorul, Joseph JaJa, Yang Wang, and Fritz McCall.
© 2006 Pearson Addison-Wesley. All rights reserved2-1 Chapter 2 Principles of Programming & Software Engineering.
DARPA Calibrating Achievable Design Jason Cong, Wayne Dai, Andrew B. Kahng, Kurt Keutzer and Wojciech Maly.
Pattern Myths1 Ten Design Pattern Myths Jim Fawcett condensed from Pattern Hatching, John Vlissides, Addison-Wesley, 1998.
University of Jyväskylä – Department of Mathematical Information Technology Computer Science Teacher Education ICNEE 2004 Topic Case Driven Approach for.
1 CMPT 275 Software Engineering Requirements Analysis Process Janice Regan,
Configuration Management
Computer Software.
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 18 Slide 1 Software Reuse 2.
This chapter is extracted from Sommerville’s slides. Text book chapter
Management, marketing and population of repositories Morag Greig, University of Glasgow.
Starting Chapter 4 Starting. 1 Course Outline* Covered in first half until Dr. Li takes over. JAVA and OO: Review what is Object Oriented Programming.
©Ian Sommerville 2006Software Engineering, 8th edition. Chapter 18 Slide 1 Software Reuse.
Database System Development Lifecycle © Pearson Education Limited 1995, 2005.
Overview of the Database Development Process
Systems Analysis – Analyzing Requirements.  Analyzing requirement stage identifies user information needs and new systems requirements  IS dev team.
Requirements Analysis
 To explain the importance of software configuration management (CM)  To describe key CM activities namely CM planning, change management, version management.
Introduction to MDA (Model Driven Architecture) CYT.
Panel Discussion Part I Methodology Ideas from adult MR brain segmentation are used in neonatal MR brain segmentation. However, additional challenges.
Software Requirements Engineering CSE 305 Lecture-2.
[ §3 : 1 ] 2. Life-Cycle Perspective Overview 2.1 Motivation 2.2 Waterfall Model 2.3 Requirements in Context.
SAMANVITHA RAMAYANAM 18 TH FEBRUARY 2010 CPE 691 LAYERED APPLICATION.
EMI INFSO-RI SA2 - Quality Assurance Alberto Aimar (CERN) SA2 Leader EMI First EC Review 22 June 2011, Brussels.
Configuration Management (CM)
ISO 9001:2008 to ISO 9001:2015 Summary of Changes
An Introduction to Software Engineering. Communication Systems.
1 Introduction to Software Engineering Lecture 1.
January 2005MERLOT Reusable Learning Design Guidelines OVERVIEW FOR MERLOT Copyright 2005 Reusable Learning This work is licensed under a Attribution-NoDerivs-NonCommercial.
Rational Unified Process Fundamentals Module 7: Process for e-Business Development Rational Unified Process Fundamentals Module 7: Process for e-Business.
© 2006 Pearson Addison-Wesley. All rights reserved 2-1 Chapter 2 Principles of Programming & Software Engineering.
Doc.: IEEE /0041r0 Submission November 2006 Mark Austin, OfcomSlide 1 SEAMCAT Introduction Presentation to IEEE Notice: This document has.
Architecture View Models A model is a complete, simplified description of a system from a particular perspective or viewpoint. There is no single view.
Design Reuse Earlier we have covered the re-usable Architectural Styles as design patterns for High-Level Design. At mid-level and low-level, design patterns.
Review of Parnas’ Criteria for Decomposing Systems into Modules Zheng Wang, Yuan Zhang Michigan State University 04/19/2002.
1 Chapter 12 Configuration management This chapter is extracted from Sommerville’s slides. Text book chapter 29 1.
44222: Information Systems Development
Accurate  Consistent  Compliant Contact: i4i the structured content company the structured content company.
C.A.D.: Bookshelf June 18, 8:00am-11:00am. Outline Review: [some of] bookshelf objectives Where we want to go vs what we have now Invited presentations.
Engaging Students in Technical Modules: The Quest to Promote Student Identification of Problematic Knowledge. Dr William Lyons, School of Engineering,
ICAD3218A Create User Documentation.  Before starting to create any user documentation ask ‘What is the documentation going to be used for?’.  When.
Human Computer Interaction Lecture 21 User Support
CAD-IP Reuse via the Bookshelf for Fundamental VLSI CAD Algorithms
Model-Driven Analysis Frameworks for Embedded Systems
Analysis and Understanding
Presentation transcript:

1 The GSRC Bookshelf Andrew B. Kahng and Igor L. Markov September 24, 1999

2 Outline u Observations –barriers to entry into research –wrong incentives for publishing research results u Bookshelf as a new medium –possesses features of an e-publication –what bookshelf is and what it is not –adds value through policies and availability u Implementation –structure, types of entries and acceptance standards u Use scenarios u Current progress and futures

3 Barriers to Entry/Access u Difficult combinatorial optimizations require complex metaheuristics to satisfy runtime/QOR requirements –theoretical analyses cannot distinguish good approaches from bad –30+ year history for some formulations; very sophisticated techniques –entry into field, testing ideas requires reuse (hopefully w/understanding) u Where is the leading edge? –implementations unavailable (“competitive advantage”) –methods not replicable from descriptions (“omitted for space reasons”) –reported results not verified (“trusted”) –comparisons to recent previous work not required (no excuse) u Barriers to entry/access limit the rate of advance –best new ideas cannot be identified –new work cannot build on previous work –informed adoption of existing techniques by industry is impossible

4 You Get What You Incent u Traditional publication in conferences/journals –biased toward theoretical results, descriptions of “novel techniques” u no reward for reusable practical contributions (e.g., source code for good implementation of old algorithm), nor for confirming work u easy to publish “novel” but poorly-performing heuristics u implicit assumption: value lies in description of algorithm (rather than understanding or implementation) –on the other hand: undocumented/implicit implementation decisions far outweigh most claimed “advances” (“FM”, “annealing”, …) u Key problem: no incentive to “do the Right Thing” –no respectable electronic publications in VLSI CAD (cf. physics, math) –no credit/compensation for distributing implementations, relevant details –no downside to ignoring previous work, available comparisons, and practical use models (runtime/QOR context) u GSRC Bookshelf == incentive-/infra- structure to fix this !

5 The Bookshelf Initiative u Standardizes data formats u Focuses on algorithm implementations –collects what’s available –encourages openness and competitiveness –solicits “missing” implementations u Adds value –24x7 availability and automation, encyclopedic coverage –impartial policies and scalable user interfaces –forward-looking acceptance standards and work with authors –all implementations available for free for any purpose (?) –ranking u Offers parallel publishing/ranking mechanisms: –“cathedral”: fully controlled and ranked by steering committee –“bazaar”: relies on immediate publishing and ranking by community

6 Bookshelf as a New Medium u Publication medium and review process for implementations –electronic infrastructure permits archival, “arbitrarily large” publications –credit for leading-edge work via respectable, referable nature u compensates “loss of competitive advantage” u Facilitate review  reviews/evaluations, returns for fixes  preserve conference/journal quality u establish easy comparisons to catch novel but useless algorithms u Backed by other incentives, openness u encourage simultaneous contributions from all researchers in a given area u flexible, non-draconian acceptance standards u targeted (possibly funded) areas for contributions u Backed by culture change u penalize non-use of available comparisons to previous work u penalize withholding of detailed descriptions, evaluatable implementations

7 Bookshelf and Electronic Publication u E-publication is common in physics, mathematics communities u low-volume refereed periodicals ( and high-volume archives (xxx.lanl.gov) u well-known editors, NSF funding, community acceptance… u example: xxx.lanl.gov started in 1991, NSF-funded, ~2500 submissions in January 1999 complex hand-made infrastructure, submission via Web upload, MIME- aware , ftp, vanilla u Bookshelf –aims for thorough coverage of a domain (“institutionalized community memory” for VLSI CAD) –clear focus areas u working algorithm implementations, implementation techniques u practical usage/use-model contexts, evaluation and comparison methodologies u eventually: implementation reuse –activist approach u incentivizing particular areas for implementation (cf. “special issue”) u finding, improving, integrating available implementations (“invited/edited”)

8 What the GSRC Bookshelf Is u Repository for released/published algorithm implementations –(with accompanying documentation, evaluation, etc. material) –most likely scenario for any given entry: one write, many reads u note: publications are generally static! u Open, maximally inclusive –Occam’s Razor applied to all formal “standards” and “rules” –hypersensitive to perceptions of arbitrariness, exclusionary behaviors –standards rely on informal peer review, policies of Steering Committee u Focused on underlying motivations, goals –improved effectiveness, impact of VLSI CAD heuristic algorithm research –more rapid, complete communication between research groups –more rapid adoption of research advances by industry –appropriate coverage, demonstrated utility, community acceptance –overall maturation and culture change in the VLSI CAD field –achievement of goals by providing infrastructure, examples, incentives

9 What the GSRC Bookshelf Is Not u Not concerned with development processes –doesn’t matter whether paper was written w/ typewriter, troff, Word –focus is on results, relevance, usability –e.g., authors may use confidential and unpublishable source code u Not concerned with teaching of development processes –separate from enforcement of standards –possibly useful to improve quality of implementations (saves us work) –may be pursued given explicit demand u Not concerned with infrastructure for frequent maintenance –emphasize initial quality of submissions –avoid “maintenance wars” –little need to support versioning u qualitative improvements should be credited as new submissions (e.g., DAC97 paper on HMetis1.0, DAC99 paper on HMetis1.5) u authors support versioning independently and make consistent releases

10 Bookshelf Structure (high-level) u Data hierarchy –a bookshelf covers a “domain” e.g., VLSI CAD –slots cover the domain by “areas” –each area is covered by submission in the respective slot –individual submissions represented by “entries” u Steering committee –interprets what belongs to the ‘domain’ and what does not –formally introduces new slots –solicits entries necessary to improve coverage –organises reviews of submissions –makes acceptance decisions –works with authors on revising submissions

11 Types of Entries by Function u Generic problems –standard file formats –standard in-memory representations (classes) –passed to optimization engines –integrated I/O including parsers of standard benchmark formats –standard (benchmark) instances –some information about solutions, best known solutions u Reference solver implementations –usable in successful applications and comparable to best reported –support modifications and performance analysis –accommodate alternative modules to determine best combinations u Independent evaluators u Heuristic evaluation and comparison methodology –descriptions of testing procedures and best known results –precepts for experimental evaluation of metaheuristics –references to relevant benchmarks and reference implementations

12 What Can Be Mapped Into Entries u Bibliographies, hyperlinks and other lists of resources u Expositions u Research papers, including experimental studies u Descriptions of standard data formats u Implementations –testcase generators, optimization algorithms –evaluators, consistency checkers u Standard testcases and known good solutions u Statistical data –characterisations of real-world instances –distributions of solution costs for known methods –best known solution costs etc

13 Dependency Model u Optimised for multiple independent submissions –main goal: availability of implementations –considerable reuse not expected at the beginning –unnecessary complications considered a burden –duplication allowed to simplify dependencies –refusing versioning support simplifies dependencies u Extensions for reuse –scalability is a requirement –but we do not want to frighten the illiterati –implement extensions for reuse only when necessary after establishing a reliable author/user base

14 Acceptance Standards u Enforced by individual reviewers (by decr. criticality) –availability –fitness –consistency –documentation and examples of use –availability of tests u Enforced by steering committee (by decr. criticality) –accurate labeling –focus and utility –compatibility with common data formats –ease of evaluation using published mechanisms –acknowledgement of contributions, prior publications and support –compliance with copyright laws

15 Technical Requirements u Single-file submissions: tar.Z or tar.gz (tar.bz2 ?) u Consistent choice of Unix, DOS or Mac-type line-end u Accurate labelling of file types –non-ASCII files graphics, e.g., postscript, PDF, GIF, JPEG executables data in publicly available formats (e.g., GDSII) data in submission-specific formats –executables interpreted language(s) used, e.g., sh/csh/per platform for compiled binaries type of libraries: static or shared executables linked: statically, dynamically, semistatically u At least 1of 3: Linux, Solaris, NT u Interpreted languages –must be supported by 1 of 3 platforms out-of-the-box

16 Reusing Existing Infrastructure u Leverage the WWW for display, access and docs –convenient and unavoidable –allows submissions with content-specific GUI –prospective entries may already have nice Web site (esp. for docs) e.g., (can link to it or copy) u Use CVS for storage/retrieval ? –easy since everything is likely to be stored in a tree –CVS features barely used (versioning, comments, ownerships etc) –entries = compressed releases or hyperlinks, not development trees –submissions may not be produced from development trees –we do not wish to impose use of CVS on contributors

17 Use Scenarios u Steering committee solicits submissions –bookshelf supports encyclopedic and unbiased coverage u Researchers volunteer to submit their codes as entries –bookshelf gives additional credit to past work u Industrial affiliates publish benchmarks –publicity to the company and a boost for academic research u Students using bookshelf working on dissertation –bookshelf offers reference and educational help u Reviewers use Bookshelf to evaluate a new paper –bookshelf helps easy evaluation u Researchers compare new algos to what’s in Bookshelf –bookshelf ensures competitiveness

18 Current Progress u Creation of several “charter” slots –hot areas: hypergraph partitioning, standard-cell placement, single-tree interconnect synthesis, block placement/packing –emphasis on high quality, exemplary behaviors (e.g., source code release) –will meet stated goal for December (3-5 slots instantiated) u Outreach to academic groups and industrial affiliates –advisory role for slot definition u problem statements u format specifications –contribution of reference data, entries –prototype content of file format slots has been distributed u Current infrastructure is Web-accessible tree

19 Open Issues u How to achieve visibility, critical mass ? –support by contributions –support by editorial policies, conference review policies –need publicity and consistent message (N.B.: embedded tutorial at ICCAD99 was dinged) u Integration of the bookshelf –with the development model supported by GSRC –common data models and file formats u Scalability and infrastructure for reuse –not frightening anyone away with excessive requirements u Policy for dealing with restrictions on reuse

20 Bookshelf Top-down Structure u Overview u Bookshelf as a New Electronic Medium u Bookshelf Slots u Submission (release) Standards u New Data Formats u Source Code Standards (to be expanded) u Copyright issues available at

21 Sketches of Two Slots u Introduction and overview u New Placement Formats u Publicly available instances, solutions and reference performance results u Executable Utilities (converters, generators, statistics browsers, evaluators, constraint verifiers) u Optimizers and other non-trivial executables u Common in-memory representations, parsers and other source codes

22 General Guidelines u Introduction u Motivation and Main Goals u Gotchas u Agreements u Open issues u Availability Status of New Data Formats u Resources u Appendix A. Note to Developers

23 New Common Data Formats u John Lillis at UIC –Single-tree Interconnect Synthesis (.pins,.topo,.target ) – u Patrick Madden at SUNNY Binghamton –Global Routing – u Wayne Dai at UCSC –Block Packing (.blks,.bconstr,.areapin ) – u ABKGroup at UCLA –Standard-cell placement (core formats) (.nodes,.nets,.wts,.scl,.pl ) – –Extensions for partitioning (.blk,.fix,.sol ) –