Presentation is loading. Please wait.

Presentation is loading. Please wait.

Copyright © 2004, SAS Institute Inc. All rights reserved. Paul Kent VP SAS Platform Research & Development Forthcoming Changes in SAS.

Similar presentations


Presentation on theme: "Copyright © 2004, SAS Institute Inc. All rights reserved. Paul Kent VP SAS Platform Research & Development Forthcoming Changes in SAS."— Presentation transcript:

1 Copyright © 2004, SAS Institute Inc. All rights reserved. Paul Kent VP SAS Platform Research & Development Forthcoming Changes in SAS

2 Copyright © 2004, SAS Institute Inc. All rights reserved. Where do I come from? New Hill, North Carolina Y’all Johannesburg, South Africa Julle Fareham, England ???

3 Copyright © 2004, SAS Institute Inc. All rights reserved. R & D :: Loyal Employees

4 Copyright © 2004, SAS Institute Inc. All rights reserved. R & D groups, and where I come from  Platform  Clients  Solutions With Analytics

5 Copyright © 2004, SAS Institute Inc. All rights reserved. R & D groups, and where I come from  Platform  Clients  Solutions With Analytics

6 Copyright © 2004, SAS Institute Inc. All rights reserved. What do we programmers do?  Gather Data  Organise Data  Arrange Data for consumption  Facilitate said consumption  Create understanding of Data  Promote understanding of said Data Valu e

7 Copyright © 2004, SAS Institute Inc. All rights reserved. Power Reporting Web Reporting Information Delivery Framework Information ConsumersDomain Experts Power User Business Analyst Info Tech Large%Small% Web Report Viewing Analytic Reporting Who do we programmers do it for? Audience Continuum Value

8 Copyright © 2004, SAS Institute Inc. All rights reserved. Forthcoming Improvements in the SAS Foundation  ODS (and the new ODS statistical graphics)  SAS Database Storage capabilities  The Data Step and Proc SQL  Grid Computing Capabilities  Bits and Pieces

9 Copyright © 2004, SAS Institute Inc. All rights reserved. ODS Statistical Graphics

10 Copyright © 2004, SAS Institute Inc. All rights reserved. Survival Plot Using PROC LIFETEST in SAS 8  J. Zhou, NESUG 2002  Three-page SAS program with macros  Use GPLOT and GREPLAY for graphics Statistical Metadata Overlaid Curves

11 Copyright © 2004, SAS Institute Inc. All rights reserved. Statistical Graphics  Essential for modern data analysis  Difficult to create in SAS prior to SAS 9 Context lost when statistical procedure terminates Programmer must recreate context, metadata  Statistical procedures should automatically create graphics  Follow the 80-20 rule – 20% of these might need further tweaking, but for the most part…

12 Copyright © 2004, SAS Institute Inc. All rights reserved. Life Is Easier in SAS 9 … ods graphics on; ods html file="lifetest.htm"; proc lifetest data=surv; time surv*censor(1); survival plots=(survival hwb); strata trt; id patient; run; ods html close; ods graphics off;

13 Copyright © 2004, SAS Institute Inc. All rights reserved. LIFETEST Procedure – Survival Plot

14 Copyright © 2004, SAS Institute Inc. All rights reserved. LIFETEST Procedure – HWB plot

15 Copyright © 2004, SAS Institute Inc. All rights reserved. Usage of ODS Statistical Graphics in SAS 9  Experimental in 30 SAS/STAT and SAS/ETS procedures - SAS 9.1  Automates creation of commonly used graphical displays for a particular analysis  Production in SAS 9.2

16 Copyright © 2004, SAS Institute Inc. All rights reserved. Procedures Using ODS Graphics SAS/STAT Procedures  ANOVA  CORRESP  GAM  GENMOD  GLM  KDE  LIFETEST  LOESS  LOGISTIC  MI  MIXED  PHREG  PLS  PRINCOMP  PRINQUAL  REG  ROBUSTREG  TPSPLINE SAS/ETS Procedures  ARIMA  AUTOREG  ENTROPY  EXPAND  MODEL  SPECTRA  SYSLIN  TIMESERIES  UCM  VARMAX  X12 SAS High Performance Forecasting  HPF

17 Copyright © 2004, SAS Institute Inc. All rights reserved. ODS Graphics Primer  One statement “turns on graphics” ODS GRAPHICS ON;  Procedure options determine “which plot”  Template determines “what plot looks like” SAS provides default template for each plot  Style determines “what all my plots look like”  Destination determines “where my plots go”

18 Copyright © 2004, SAS Institute Inc. All rights reserved. PROC ROBUSTREG Templates ODS Output Object Data & Template Data & Template Data & Template ODS Output Destination Engine Statistical Graphic Engine Styles Table Data Graph Data Table Template Graph Template Diagnostics Stat.Robustreg.Graphics.Diagnostics ResidualHistogram Stat.Robustreg.Graphics.ResidualHistogram ResidualQQPlot Stat.Robustreg.Graphics.ResidualQQPlot HTMLData Set RTF PDFPostscript

19 Copyright © 2004, SAS Institute Inc. All rights reserved. Graphics Supported in ODS Destinations DestinationSAS Release HTML9.1 RTF9.1 PRINTER9.1 PDF9.1 LATEX9.1 LISTING9.2

20 Copyright © 2004, SAS Institute Inc. All rights reserved. Histogram of Robust Residuals

21 Copyright © 2004, SAS Institute Inc. All rights reserved. Template for Histogram proc template; define statgraph Stat.Robustreg.Graphics.ResidualHistogram; dynamic _DEPLABEL; Layout Gridded; Layout Gridded / columns=2; EntryTitle "Distribution of Robust Residuals for”; EntryTitle _DEPLABEL; EndLayout; Layout Overlay / xaxisopts=(label="Robust Residuals") yaxisopts=(label="Percent"); Histogram RResidual; Density RResidual / LegendLabel="Normal Density" name="Normal"; Density RResidual / Kernel() LinePattern=dashlong LegendLabel="Kernel Density" name="Kernel"; EndLayout; DiscreteLegend "Normal" "Kernel"; EndLayout; end; run; proc template; define statgraph Stat.Robustreg.Graphics.ResidualHistogram; dynamic _DEPLABEL; Layout Gridded; Layout Gridded / columns=2; EntryTitle "Distribution of Robust Residuals for”; EntryTitle _DEPLABEL; EndLayout; Layout Overlay / xaxisopts=(label="Robust Residuals") yaxisopts=(label="Percent"); Histogram RResidual; Density RResidual / LegendLabel="Normal Density" name="Normal"; Density RResidual / Kernel() LinePattern=dashlong LegendLabel="Kernel Density" name="Kernel"; EndLayout; DiscreteLegend "Normal" "Kernel"; EndLayout; end; run;

22 Copyright © 2004, SAS Institute Inc. All rights reserved. PROC GLM

23 Copyright © 2004, SAS Institute Inc. All rights reserved. PROC GLM (ANCOVA)

24 Copyright © 2004, SAS Institute Inc. All rights reserved. GAM Procedure

25 Copyright © 2004, SAS Institute Inc. All rights reserved. HPF Procedure

26 Copyright © 2004, SAS Institute Inc. All rights reserved. KDE Procedure

27 Copyright © 2004, SAS Institute Inc. All rights reserved. KDE Procedure

28 Copyright © 2004, SAS Institute Inc. All rights reserved. LOESS Procedure

29 Copyright © 2004, SAS Institute Inc. All rights reserved. LOGISTIC Procedure

30 Copyright © 2004, SAS Institute Inc. All rights reserved. MIXED Procedure

31 Copyright © 2004, SAS Institute Inc. All rights reserved. MIXED Procedure

32 Copyright © 2004, SAS Institute Inc. All rights reserved. PHREG Procedure

33 Copyright © 2004, SAS Institute Inc. All rights reserved. PLS Procedure

34 Copyright © 2004, SAS Institute Inc. All rights reserved. PRINCOMP Procedure

35 Copyright © 2004, SAS Institute Inc. All rights reserved. REG Procedure

36 Copyright © 2004, SAS Institute Inc. All rights reserved. REG Procedure (Simple Regression)

37 Copyright © 2004, SAS Institute Inc. All rights reserved. TIMESERIES Procedure

38 Copyright © 2004, SAS Institute Inc. All rights reserved. UCM Procedure

39 Copyright © 2004, SAS Institute Inc. All rights reserved. UCM Procedure

40 Copyright © 2004, SAS Institute Inc. All rights reserved. Integration with ODS Styles  Over 30 different styles  New style elements for statistical graphics Fitted line Confidence lines and bands Prediction Lines Outliers Classification groups

41 Copyright © 2004, SAS Institute Inc. All rights reserved. Style Demonstration ods html file=“robustreg.htm” style=journal; ods graphics on; title “Journal Style”; proc robustreg data=mydata plot=all; model y = x1 x2 x3; run; ods html close; Journal Analysis DefaultStatisticalJournalAnalysisDefaultStatistical ( only Summary Statistics and Residual Histogram output shown)

42 Copyright © 2004, SAS Institute Inc. All rights reserved. Summary  Goal is to automate creation of graphics by statistical procedures Minimum work for user Maximum built-in functionality  Experimental in SAS 9.1  Production in SAS 9.2

43 Copyright © 2004, SAS Institute Inc. All rights reserved. SAS Transactional Storage (aka SAS Database Capabilities)  Demo Time  1. Color_table Remember to start your TableServer  2. Customers Remember to start your AppServer (tomcat5)

44 Copyright © 2004, SAS Institute Inc. All rights reserved. SAS Transactional Storage (aka SAS Database Capabilities)  A more traditional Database Capability  From SAS. (not oracle, ibm, or microsoft)  Based on OpenSource “Firebird”  Real Datatypes – INT, MONEY, VARCHAR  Real Connectors – JDBC, ODBC, SAS Libname  Real Transactions – Rollback and Commit  MultiUser Server

45 Copyright © 2004, SAS Institute Inc. All rights reserved. What’s New in SAS Grid Automation Cheryl Doninger R&D Director, Grid Development Roger Thompson Relationship Manager Merry Rabb Product Manager, Grid

46 Copyright © 2004, SAS Institute Inc. All rights reserved. Grid Computing Market Size & Growth Rapid Adoption of Grid Computing Based on Benefits

47 Copyright © 2004, SAS Institute Inc. All rights reserved. Grid Adoption is Increasing A high percentage of firms using analytical applications are considering grid 2/3 of firms surveyed are using or considering grid technology

48 Copyright © 2004, SAS Institute Inc. All rights reserved. Benefits of Grid Computing  Faster results  More executions – more data  Time to recover from errors  Better use of resources  Virtualize resources  Incremental IT spend

49 Copyright © 2004, SAS Institute Inc. All rights reserved. Types of Applications Suitable for Grid  Long running  Many replicate runs of same fundamental task simulation (what if analysis) optimization (testing lots of scenarios) BY GROUP processing data segmentation  Independent tasks running against large data sources scoring – risk analysis multiple procedures and data steps

50 Copyright © 2004, SAS Institute Inc. All rights reserved. SAS Grid Strategy  Infrastructure benefits SAS applications large data / complex algorithms  Focus areas Development Run-time System management  Incremental Releases

51 Copyright © 2004, SAS Institute Inc. All rights reserved. SAS Grid Roadmap Phase I  SAS 8.2 functionality %Distribute SAS/CONNECT SAS log

52 Copyright © 2004, SAS Institute Inc. All rights reserved. SAS Grid Success Stories Texas Tech University Statistics Canada Large Pharmaceutical Company

53 Copyright © 2004, SAS Institute Inc. All rights reserved. SAS Grid Roadmap Phase II  SAS 9.1.3 Q3/2005 functionality smarter engines for SAS IDEs SAS/Platform integration SASMC monitoring

54 Copyright © 2004, SAS Institute Inc. All rights reserved. Business Analytics - Enterprise Miner on SMP

55 Copyright © 2004, SAS Institute Inc. All rights reserved. Business Analytics - Enterprise Miner on Grid

56 Copyright © 2004, SAS Institute Inc. All rights reserved. Data Integration – ETL Studio on SMP/Grid

57 Copyright © 2004, SAS Institute Inc. All rights reserved. Data Integration – ETL Studio on SMP/Grid

58 Copyright © 2004, SAS Institute Inc. All rights reserved. SAS Stored Process Business Intelligence – Enabled on SMP/Grid SAS Program ETL Studio Enterprise Miner Web Services

59 Copyright © 2004, SAS Institute Inc. All rights reserved. Grid Manager Plugin – job view

60 Copyright © 2004, SAS Institute Inc. All rights reserved. Grid Manager Plugin – host view

61 Copyright © 2004, SAS Institute Inc. All rights reserved. SAS 9 Grid Computing Components SAS Applications Piping Distribution Session Spawning Grid Enabled Code Generation NEW September 2005Multi-Processor SAS Multiple Components Working Together to Provide Grid Computing SAS 9 Grid Computing Grid Manager Plug-in Platform Suite for SAS Grid Monitoring Grid Management Job Termination Dynamic Load Balancing Job, Queue & Host Management Enterprise Miner Stored Processes Data Integration SAS Connect

62 Copyright © 2004, SAS Institute Inc. All rights reserved. General Layout of a SAS Grid Client Machine Metadata Server Grid Control Machine Grid Node … n SAS Grid Machine Grid Mgr plugin Platform Suite for SAS LSF SAS ETL SAS EM SAS Foundation

63 Copyright © 2004, SAS Institute Inc. All rights reserved. Grid Work Flow … n Node1 Node2 Node3 Node1 ! ! 1 () (SASMain) Node2 ! ! 1 () () Node3 ! ! 1 () (SASMain) … LSF Cluster File SASMain – Server Context Platform Server Component sas -noobjectserver SAS Servers Metadata Server Workspace Server Connect Client LSF SAS MC SAS Metadata session resource sascmd wl options ------------------------------------------------- p1 SASMain sas –noobjectserver grdsvc_enable(p1, “resource=SASMain”); ETL Studio Enterprise Miner signon p1;

64 Copyright © 2004, SAS Institute Inc. All rights reserved. Partitioning the Grid … n EM grid ETL grid Node1 Node2 Node3 Node1 ! ! 1 () (SASMain,EM) Node2 ! ! 1 () (SASMain,EM,ETL) Node3 ! ! 1 () (SASMain, ETL) … LSF Cluster File Metadata Server Workspace Server Connect Client LSF SAS MC SAS Servers SASMain – Server Context Platform Server Component sas –noobjectserver EM, ETL SAS Metadata ETL Studio Enterprise Miner session resource sascmd wl options -------------------------------------------------------------------------- p1 SASMain sas –noobjectserver ETL grdsvc_enable(p1, “resource=SASMain, workload=ETL”); signon p1;

65 Copyright © 2004, SAS Institute Inc. All rights reserved. Grid Provides: Speed and Efficiency

66 Copyright © 2004, SAS Institute Inc. All rights reserved. Analytics are working, so people…  Build more models For successively refined segments of customers  Use more data in those models  Integrate the results into operational systems  A SAS9.2 datastep moviemovie

67 Copyright © 2004, SAS Institute Inc. All rights reserved. Implications  More Multi thread enablement within SAS  Yes, even the DATA STEP  Saved Programs  Multi Threaded Server Capabilities Same model, parallel data for thruput Many models, same data – one off scores in operational systems  Models Management can deploy models to “score servers” without restarting them

68 Copyright © 2004, SAS Institute Inc. All rights reserved. Bits and Pieces  Reverse Engineer SAS jobs  Checkpoint and Restart SAS jobs  Encode (and protect) your SAS jobs  ZIP functions  CRC …

69 Copyright © 2004, SAS Institute Inc. All rights reserved.

70 Protect your IP  PROC SCRAMBLE file=‘myfile.sas’ outfile=‘secret.sas’ … ;  Send secret.sas to your customers  %include ‘secret.sas’; Implies nosource; your macros can reset NOMPRINT…

71 Copyright © 2004, SAS Institute Inc. All rights reserved. Checkpoint/Restart and Parallelization Features in the Core Supervisor Rick Langston, Core Systems Department

72 Copyright © 2004, SAS Institute Inc. All rights reserved. Checkpoint/Restart  Craig R.’s request as per user community  Job fails – want to restart where it left off  ETL Studio also wanted a restart facility

73 Copyright © 2004, SAS Institute Inc. All rights reserved. A simple solution  Record a checkpoint number, save it in WORK  If restarting, skip PROC / DATA steps to there  Tokenize everything  Execute all global statements

74 Copyright © 2004, SAS Institute Inc. All rights reserved. To set up for checkpointing  Use NOWORKINIT, NOWORKTERM  Have WORK refer to a permanent directory  Use the CHECKPOINT option

75 Copyright © 2004, SAS Institute Inc. All rights reserved. Subsequent restarting  Again use NOWORKINIT, NOWORKTERM  Again use WORK to the permanent directory  Use the RESTART option  Job will restart as of the last successful step

76 Copyright © 2004, SAS Institute Inc. All rights reserved. Is this what users want?  We can’t do this without user being proactive  data temp / set temp issues  skipped steps may need to be executed  Output files (flat files – DISP=MOD, databases…)

77 Copyright © 2004, SAS Institute Inc. All rights reserved. EXECUTE_ALWAYS  CHECKPOINT / EXECUTE_ALWAYS;  Use it for a step that must be executed  For example, SYMPUT and CALL EXECUTE

78 Copyright © 2004, SAS Institute Inc. All rights reserved.

79 Example  Using options debug=‘checkpoint-implicit’;  Option names still to be decided

80 Copyright © 2004, SAS Institute Inc. All rights reserved.

81

82 data temp1; x=1; run; data temp2; x=2; run; data temp3; x=3; run; data _null_; if "&sysparm."="1" then abort abend 999; run; data temp4; x=4; run;

83 Copyright © 2004, SAS Institute Inc. All rights reserved.  Invoke once with checkpoint-implicit  Then reinvoke with restart-implicit

84 Copyright © 2004, SAS Institute Inc. All rights reserved.

85 Additional info  Planned for 9.2  Option names still being decided  Wanting additional input

86 Copyright © 2004, SAS Institute Inc. All rights reserved. Parallelization Efforts  Reading in arbitrary SAS code  Producing metadata in comments  This could be post-processed by ETL Studio  This could be post-processed by Grid Computing

87 Copyright © 2004, SAS Institute Inc. All rights reserved. Parallelization Efforts  Researching so far  Hooks in dependency opens  Catalogs, flat files, SAS data sets, etc.  Emitting info in comments  Example of use

88 Copyright © 2004, SAS Institute Inc. All rights reserved.

89

90

91

92

93 Exposure to User  New option, such as DEPMETA=fileref  SAS program with comments written to this file

94 Copyright © 2004, SAS Institute Inc. All rights reserved. Questions/comments?

95 Copyright © 2004, SAS Institute Inc. All rights reserved. Ideas for the Future!  How can the software learn?  So the user doesn’t have to learn about the software; they can learn the business!  Some future ETL studio JOB Remembers data volumes from last weeks run Uses that memory to choose a better strategy

96 Copyright © 2004, SAS Institute Inc. All rights reserved. Your Turn!!  You tell me next time SAS forgets something it should have remembered  And why remembering that would help SAS improve next time Paul.Kent@sas.com Thanks for listening!


Download ppt "Copyright © 2004, SAS Institute Inc. All rights reserved. Paul Kent VP SAS Platform Research & Development Forthcoming Changes in SAS."

Similar presentations


Ads by Google