© Copyright 2014 Wellesley Information Services, Inc. All rights reserved. Best Practices for SAP HANA Modeling and SAP Data Services Data Loading Dr.

Slides:



Advertisements
Similar presentations
Database Basics. What is Access? Database management system Computer-based equivalent of a manual database Makes it easy to organize and update information.
Advertisements

4 Oracle Data Integrator First Project – Simple Transformations: One source, one target 3-1.
Customizing the MOSS 2007 Search Results November 2007 Rafael Perez.
Chapter 10 Database Applications Copyright © 2011 by The McGraw-Hill Companies, Inc. All Rights Reserved. McGraw-Hill.
Templates and Styles Excel Advanced. Templates are pre- designed and formatted spreadsheets –They provide consistency of layout/structure –They.
Understanding Microsoft Excel
Visual Basic 2010 How to Program. © by Pearson Education, Inc. All Rights Reserved.2.
CS320n –Visual Programming LabVIEW Foundations. Visual ProgrammingLabVIEW Foundations2 What We Will Do Today Hand back and review the midterm Look at.
Introduction to Structured Query Language (SQL)
Technical Support: (989) GIS and Mapping Procedures in ArcMap 9.x Creating an ArcMap Project Editing an ArcMap Project Printing an ArcMap Project.
® IBM Software Group © 2006 IBM Corporation The Eclipse Data Perspective and Database Explorer This section describes how to use the Eclipse Data Perspective,
5 Copyright © 2009, Oracle. All rights reserved. Defining ETL Mappings for Staging Data.
Chapter 5 Using SAS ® ETL Studio. Section 5.1 SAS ETL Studio Overview.
Classroom User Training June 29, 2005 Presented by:
Databases and LINQ Visual Basic 2010 How to Program 1.
6 Copyright © 2004, Oracle. All rights reserved. Working with Data Blocks and Frames.
Event Manager Training Part 3.  Edit Event Options - Customize FY11 Sites  Edit Event Webpages  Sending s (Recruitment/Engagement)  Help and.
Create Database Tables
® IBM Software Group © 2009 IBM Corporation Rational Publishing Engine RQM Multi Level Report Tutorial David Rennie, IBM Rational Services A/NZ
10-1 aslkjdhfalskhjfgalsdkfhalskdhjfglaskdhjflaskdhjfglaksjdhflakshflaksdhjfglaksjhflaksjhf.
In This Session We will explore SAP Data Services and how to load information into SAP HANA Learn how to create transformations, merges, and joins Explore.
0 eCPIC Admin Training: Custom Calculated Fields These training materials are owned by the Federal Government. They can be used or modified only by FESCOM.
Produced by Wellesley Information Services, LLC, publisher of SAPinsider. © 2015 Wellesley Information Services. All rights reserved. Hands-on Lab: Introduction.
Microsoft Access Lesson 1 Lexington Technology Center February 11, 2003 Bob Herring On the Web at
Automating Database Processing Chapter 6. Chapter Introduction Design and implement user-friendly menu – Called navigation form Macros – Automate repetitive.
Session 4: The HANA Curriculum and Demos Dr. Bjarne Berg Associate professor Computer Science Lenoir-Rhyne University.
McGraw-Hill/Irwin ©2009 The McGraw-Hill Companies, All Rights Reserved Business Driven Information Systems 2e Plug-In T6: Basic Skills and Tools Using.
Using SAS® Information Map Studio
Dreamweaver MX. 2 Overview of Templates n Templates represent a web page design or _______ that will be common to multiple pages. n There are two situations.
Data Management Console Synonym Editor
Oracle Data Integrator Transformations: Adding More Complexity
McGraw-Hill/Irwin © 2008 The McGraw-Hill Companies, All Rights Reserved Plug-In T6: Basic Skills and Tools Using Access 2010 Business Driven Technology.
Page 1 Non-Payroll Cost Transfer Enhancements Last update January 24, 2008 What are the some of the new enhancements of the Non-Payroll Cost Transfer?
Introduction to Enterprise Guide Jennifer Schmidt Rhonda Ellis Cassandra Hall.
FIX Eye FIX Eye Getting started: The guide EPAM Systems B2BITS.
McGraw-Hill/Irwin © The McGraw-Hill Companies, All Rights Reserved TECHNOLOGY PLUG-IN T6 Basic Skills Using Access.
0 eCPIC Admin Training: OMB Submission Packages and Annual Submissions These training materials are owned by the Federal Government. They can be used or.
FrontPage Tutorial Part 2 Creating a Course Web Site.
3 Copyright © 2004, Oracle. All rights reserved. Working in the Forms Developer Environment.
Lesson 1 – Microsoft Excel * The goal of this lesson is for students to successfully explore and describe the Excel window and to create a new worksheet.
© 2008 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Lesson # 9 HP UCMDB 8.0 Essentials.
6 Copyright © 2009, Oracle. All rights reserved. Using the Data Transformation Operators.
0 © Copyright 2013 Wellesley Information Services, Inc. All rights reserved. HANDS ON LAB: Customizing Dashboards with SAP BusinessObjects Design Studio.
Lesson 4.  After a table has been created, you may need to modify it. You can make many changes to a table—or other database object—using its property.
0 © Copyright 2013 Wellesley Information Services, Inc. All rights reserved. HANDS ON LAB: Exploring SAP HANA Capabilities and SAP HANA Modeling.
0 © Copyright 2013 Wellesley Information Services, Inc. All rights reserved. HANDS ON LAB: Data Visualization with SAP Lumira.
Produced by Wellesley Information Services, LLC, publisher of SAPinsider. © 2015 Wellesley Information Services. All rights reserved. Hands-on Lab: Data.
Change Display of a Key Figure While in the Change Query (Local Definition) Hiding 1.Right click.
21 Copyright © 2009, Oracle. All rights reserved. Working with Oracle Business Intelligence Answers.
Access Module Implementing a Database with Microsoft Access A Great Module on Your CD.
5 Copyright © 2008, Oracle. All rights reserved. Testing and Validating a Repository.
Exporting & Formatting Budgets from FlexGen, NextGen & Zortec into Excel.
Active-HDL Server Farm Course 11. All materials updated on: September 30, 2004 Outline 1.Introduction 2.Advantages 3.Requirements 4.Installation 5.Architecture.
Physical Layer of a Repository. March 6, 2009 Agenda – What is a Repository? –What is meant by Physical Layer? –Data Source, Connection Pool, Tables and.
3 Copyright © 2006, Oracle. All rights reserved. Building an Analytic Workspace.
Produced by Wellesley Information Services, LLC, publisher of SAPinsider. © 2015 Wellesley Information Services. All rights reserved. Hands-on Lab: WebI.
1 Copyright © 2008, Oracle. All rights reserved. Repository Basics.
Copyright © 2014 Pearson Canada Inc. Ext. 5b-1 Copyright © 2014 Pearson Canada Inc. Application Extension 5b Using Microsoft Access Part 2: Using Information.
Dive Into® Visual Basic 2010 Express
Introduction to SAP HANA Modeling for SAP Business One: Modeling Master Data Dimensions Welcome to Modeling Master Data Dimensions for SAP Business One.
Visual Basic 2010 How to Program
Lesson # 9 HP UCMDB 8.0 Essentials
Creating Oracle Business Intelligence Interactive Dashboards
Best Practices for SAP HANA Modelling and SAP Data Services Data Loading Dr. Bjarne Berg PwC.
GO! with Microsoft Access 2016
MODULE 7 Microsoft Access 2010
Oracle Sales Cloud Sales campaign
Introduction to Database Programs
Rational Publishing Engine RQM Multi Level Report Tutorial
Introduction to Database Programs
Presentation transcript:

© Copyright 2014 Wellesley Information Services, Inc. All rights reserved. Best Practices for SAP HANA Modeling and SAP Data Services Data Loading Dr. Berg Comerit

1 In This Session We will explore SAP BusinessObjects Data Services and how to load information into SAP HANA You will learn how to create transformations, merges and joins We will look at the best practices of modeling in SAP HANA We will see step-by-step how to create calculation, attribute and analytical views At the end of this session you will know how to load data and create views to analyze the data

2 What We’ll Cover BusinessObjects Data Services  Data Services Overview  Creating Batch Jobs  Loading From Flat Files  Building Transforms and Using Functions  Creating Table Joins  Utilizing Data Merging SAP HANA Wrap-up

3 Data Services Overview SAP Data Services is a leading technology for enterprise information management providing solutions for:  Data integration  Data quality  Data profiling  Text data processing SAP Data Services transforms, refines, and delivers trusted data for the EDW.

4 What We’ll Cover BusinessObjects Data Services  Data Services Overview  Creating Batch Jobs  Loading From Flat Files  Building Transforms and Using Functions  Creating Table Joins  Utilizing Data Merging SAP HANA Wrap-up

5 What Are Data Services Batch Jobs? Batch jobs are basically used to:  Extract data from one or many sources  Transform data to meet the organization’s business requirements  Load the processed data to a location for use

6 Step-by-Step: Creating Batch Jobs 1. Create a new project and give it a relevant project name 2. Right-click on the project to create a new batch job The practice of giving relevant names to your projects and batch jobs is useful for organization purposes

7 What We’ll Cover BusinessObjects Data Services  Data Services Overview  Creating Batch Jobs  Loading From Flat Files  Building Transforms and Using Functions  Creating Table Joins  Utilizing Data Merging SAP HANA Wrap-up

8 Step-by-Step: Loading from Flat Files 1. Select the related batch job to enter into its workspace 2. From the ‘Format’ category in the ‘Local Object Library’ panel, right- click on ‘Flat Files’ and select ‘New’

9 Examples of Other Available Data Sources Use the local object library to find existing data sources under the ‘Datastore’ category There are many other data sources that can be used in Data Services. Can upload more files under the ‘Format’ category

10 Formatting the Flat File 3. In the pop up ‘File Format Editor’, fill in the appropriate fields Date format must match data format ‘Tab’ was chosen because data fields were separated by tabs

11 Defining Table Fields 4. Enter in the field properties Notice the updated view below

12 Preview Data 5. In the Repository under Format, right-click and select ‘View Data’ to preview the newly added data source This allows you to check if the source data populated without error before using the data

13 What We’ll Cover BusinessObjects Data Services  Data Services Overview  Creating Batch Jobs  Loading From Flat Files  Building Transforms and Using Functions  Creating Table Joins  Utilizing Data Merging SAP HANA Wrap-up

14 Transforms Overview Transforms are built-in objects that process source data to bring about desired outputs The most commonly used transform is Query Transform Query Transform enables you to: Filter and select data from a source Join data from multiple sources Map columns from input to output schemas Perform data nesting and unnesting Add new columns to the output schema Assign primary keys to output schema

15 Adding a Data Flow Object to the Workspace 1. Drag a data flow icon from the tool palette to the workspace 2. Double click on the data flow to enter its workspace When creating a reusable object, such as a data flow object, it will automatically appear in the local object library. The tool palette contains icons which allow the creation of new objects in the workspace.

16 Adding a Data Source to the Workspace 1. Drag a data source (i.e. flat file) from the local object library on to the workspace 2. Create a connection between the data source and query

17 Query Editor Overview 3. Double-click on the query transform to open the ‘Query Editor’ Parameters area Schema out area Schema in area The query editor is a graphical interface for carrying out query operations. It contains three areas:

18 Setting Up the Output Table 4. Drag the desired output fields to ‘Schema Out’ from the ‘Schema In’ section It is not necessary to drag all fields from schema in to schema out unless you want all the fields to appear in schema out.

19 Creating of New Output Column 1. Right-click on an output field and select ‘New Output Column’ 2. Select where to insert the new column New columns can be created to display results from calculations.

20 Defining Column Properties Give the column a descriptive name that properly identifies what the column is used for. 3. The ‘Column Properties’ will pop up for you to define and rename the new column and its properties

21 Using Functions 1. Double-click in the cell under ‘Mapping’ 2. Click on ‘Functions’ 3. Select the appropriate category and then the specific function For this demo, we want to calculate the number of days a case was open

22 Setting Up the Function 4. Define the input parameters for the function Use the dropdown list to state the input parameters to avoid typos. Noticed the updated code in this panel for the NO_DAYS_CASE_OPEN column after defining the input parameters. This formula will deliver the number of days from ODATE to CDATE giving us a measurement of how long it takes to close a case.

23 Adding an Output Table to the Workspace 1. Drag and drop a table template in the workspace to be our output table A template table allows us to view the specific information we want without the risk of altering the source data. The data that gets populated in the template table is based on the output schema requirements in the query transform. 2. Link the query to the template table A template table is an object that can be used as a target for data to populate in when a job gets executed successfully and can also be saved in the object library for use as a data source at a later time.

24 Executing a Job 1. Right-click on the job and select execute To analyze any issues that may occur during data loading, click on Enable auditing’ and make sure that ‘Use collected statistics’ are checked.

25 Job Log Overview The log file displays a list of actions in the job execution. If any errors occur, the error icon will appear. Otherwise, ‘Job is completed successfully’ will be displayed The job log has five columns:  Pid: Process thread identification number of the executing thread  Tid: Thread identification number of the thread  Number: Number prefix of the error followed by a number  Time Stamp Date and time the thread generated a message  Message: Error description of the thread

26 Job Log Overview Continued A successful job execution A job with errors will show the error icon Double-click on the error icon to view the list of errors as shown below.

27 How to Preview the Output Table Notice that the column created earlier is formatted correctly as a number and that the data is the result of the function defined. 1. Click on the Data Flow to open its workspace 2. Click on the magnify glass of the output table to view data in the output table

28 What We’ll Cover BusinessObjects Data Services  Data Services Overview  Creating Batch Jobs  Loading From Flat Files  Building Transforms and Using Functions  Creating Table Joins  Utilizing Data Merging SAP HANA Wrap-up

29 Use the Query Transform FROM clause to join the two sources: Query and Join. Creating Table Joins A join can be used to combine data from multiple sources into one target. Source 1 Source 2 In this example, Source 1 has the Car Description for the case, while Source 2 has the Solution to the case. The query transform will combine the data from the two sources in the schema out section to produce a result displaying the overall case solution.

30 Result from a Table Join 1. Once the tables have been joined in the query transform, execute the job as discussed in earlier slides. Noticed in the output table below how the Solution column from the Join source is now combined with the fields from the Query transform. 2. Enter the data flow workspace and click on the magnify glass to view the results in the output table.

31 What We’ll Cover BusinessObjects Data Services  Data Services Overview  Creating Batch Jobs  Loading From Flat Files  Building Transforms and Using Functions  Creating Table Joins  Utilizing Data Merging SAP HANA Wrap-up

32 Merges Overview You can merge rows from two or more sources into a single data set All sources must have the same schema to execute the Merge Transform Same # of columns Same column names Columns must have same data type

33 How to Create a Merge 1. To merge two sources, add a query form to each source to format all the data to be the same in both sources 2. Join the queries to a ‘Merge Transform’ 3. When opening the ‘Merge Transform’, notice how all the fields and data types match for all output and input fields.

34 How to Avoid Creating Duplicated Data in Merges 4. To avoid duplicate rows, add a query transform to display distinct rows only 5. Execute the job to complete merged table

35 Demo of Data Loading with Data Services

36 What We’ll Cover BusinessObjects Data Services SAP HANA  SAP HANA Overview  Creating Attribute Views  Building Analytic Views  Making Calculation Views Wrap-Up

37 SAP HANA is sold as an in-memory appliance. This means that both Software and Hardware are included from the vendors Currently you can buy SAP HANA solutions from Cisco, Dell, Fujitsu, IBM, HP, NEC, Hauweii and others SAP HANA indexes and compress the data from a variety of sources, including ERP and store the data in-memory. Source SAP AG,2014 SAP HANA can radically change the way databases operate and make systems dramatically faster. SAP HANA – In Memory Options

38 HANA Editions and Components While HANA is sold as an appliance, there are many internal components and the edition you buy may contain different licenses to these components.

39 Some of the Hardware Options Dell R920

40 Example: IBM 3850 X6

41 Hardware Options 2014 Onward These systems are based on Intel's E7 IvyBridge processors with 15 cores per processor (the old had only 10). UPDATE : Hitachi Servers and Dell (R920) are now also available

42 What We’ll Cover BusinessObjects Data Services SAP HANA  SAP HANA Overview  Creating Attribute Views  Building Analytic Views  Making Calculation Views Wrap-Up

43 Masterdata reporting can be modeled using attribute views Can be regarded as Master Data Tables Can be linked to fact tables in Analytic Views A measure e.g. weight can be defined as an attribute Table joins and properties: Leftouter, rightouter, full outer or text table Cardinality 1:1, N:1, 1:N Language Column Some Views and Functions is shipped with HANA Attribute Views - Overview

44 1. Open HANA Studio and expand the ‘Content’ folder 2. Right click on the appropriate package in your system 3. Navigate to New > Attribute View… Creating a New Attribute View

45 1. Give the view a name 2. Add a description 3. Finish and start adding and joining tables to the view The name and description that is provided should accurately describe the Attribute View you want to create. Naming the New Attribute View

46 Adding Tables to the Data Foundation 1. Open the ‘Catalog’ folder 2. Expand the system 3. Expand the ‘Tables’ folder 4. Drag the necessary table to the ‘Data Foundation’

47 Add tables into the data foundation by dragging another table to the data foundation area Join type is set using the Properties panel The first table that was added will be on the left in the ‘Details’ panel Adding More Tables to the Data Foundation

48 Applying Filters to the View Filters can be used to limit the data being displayed. Right click on the attribute you want to filter on and select ‘Apply Filter’ from the context menu. This example shows the creation of a filter on the ‘VALID_TO’ date field. Setting that value to ‘ ’ forces the result set to only show values that are always valid

49 1 & 2. To make an Attribute visible to users, simply click the circle beside each attribute 3. An attribute can be set to a key or changed to a certain type of label Save and Validate once complete Making Attributes Visible to End Users

50 What We’ll Cover BusinessObjects Data Services SAP HANA  SAP HANA Overview  Creating Attribute Views  Building Analytic Views  Making Calculation Views Wrap-Up

51 Logically very close to InfoCubes in BW Join together one central fact table containing measures for reporting Can consist of calculated measures and variables Analytic Views do not store data Data is found in the column store table or view based on Analytic View structure An example of an analytic view might be sales by product, customer, and organizational entity. Analytic View - Overview

52 Starting an Analytic View Analytic views are the most common views for reporting purposes They are the basic view type used as source data in the SAP BusinessObjects BI tools (or other frontend tools) We will join together sales data with product information This view will be quite simplistic but they can be as complex as you like Analytic views do not have to make use of attribute views. They can simply be a join of master data tables and a fact table.

53 1. Find the appropriate package 2. Right click and choose ‘New’ > ‘Analytic View’ 3. Provide a technical name and a description in the popup that follows Ensure that the ‘View Type’ dropdown is set to Analytic View Adding a New Analytic View

54 Add tables to the data foundation by clicking and dragging tables to it You should also select which attributes will be shown in the output by selecting the gray circles next to each item. Adding Fields to the Output

55 In the semantic layer, you can assign attributes and measures to the items that were selected to be in the output. This is necessary for attributes and measures to be displayed and aggregated properly in the reporting layer. Select which attributes will be shown in the output by selecting the gray circles next to each field. Setting Attributes and Measures

56 By default this creates a referential join of the table to the ‘Data Foundation’. Joining Tables In the ‘Logical Join’, two or more tables must be joined together on fields that are identical or share the same results. 1. Select the ‘Logical Join’ node 2. Drag another view or table into the node 3. Drag from one view to the other on the common field (i.e. Product to Product)

57 Now we will add a new calculated field called ‘Net Sales’ Using the ‘Advanced’ tab you can set the type of value such as currency or percentage. Creating a New Calculated Column

58 Demo- Building Attribute and Analytical Views

59 What We’ll Cover BusinessObjects Data Services SAP HANA  SAP HANA Overview  Creating Attribute Views  Building Analytic Views  Making Calculation Views Wrap-Up

60 Bring together database tables, attribute views, analytic views, and other calculation views Provide one source of data for reporting tools You can also write SQL statements to make sure a set of fields match requirements of other output structures Calculation views are used to satisfy complex business requirements. An example of a calculation view might be a comparison of actual sales with forecast sales. Calculation View - Overview

61 A calculation view will now be created to join together other tables and views and utilize calculations and aggregations to analyze the data. 1. Right click on the appropriate package 2. In the context menu, click ‘New’ > ‘Calculation View’ Creating a New Calculation View

62 Give the Calculation View a proper name and label The ‘Copy From’ option can be used to copy and extend an existing calculation view without editing the original view or having to create a new one each time. Naming the New Calculation View

63 In the projection layer, right click on attributes you want to display in the semantic layer and choose ‘Propagate to Semantics’ If you choose ‘Add to Output’ instead, that field in every node will have to be activated manually. Propagate to Semantics

64 Calculated columns are used to derive some meaningful information in the form of columns, from existing columns. 1. Give the column a proper name 2. Set the ‘Data Type’ 3. Choose a function 4. Select the text within the parentheses 5. Choose an element (or attribute in your table) 6. Validate the syntax Creating a New Calculation in the View You can add your own calculations to the calculation view just as in the analytic view

65 With an aggregated column on customer and amount, you would get a data set that looks like the following: Aggregation - Overview Aggregation Node - columns will be rolled up, or aggregated, when placed in this layer. CustomerProductAmount CustomerAmount Customer 1’s amounts were added up so there is one less row to display.

66 If calculations are not added to a projection layer, and then sent to an aggregation node, the totals will not work properly in reporting. Adding a Calculated Column to the Aggregation Layer In the aggregation node, calculated columns can be added as aggregated columns

67 You can set each of these types manually, but the automatic assignments are usually correct In the semantics layer, each item needs to be assigned the type attribute or measure 1. Click on the ‘Semantics’ node 2. Click the ‘Auto Assign’ button to automatically assign the ‘Type’ 3. If any of the types are incorrect, you can manually adjust them Once all assignments are complete, save and validate the view. Assigning Column Types to the View

68 What We’ll Cover BusinessObjects Data Services SAP HANA  SAP HANA Overview  Creating Attribute Views  Building Analytic Views  Making Calculation Views Wrap-Up

69 7 Key Points to Take Home SAP Data Services transforms, refines, and delivers trusted data for the Enterprise Data Warehouse Multiple data sources can be used for Data Services including Flat Files, DTDs, XML Schemas, Excel Workbooks and more Utilize built-in transforms which are objects that process source data to bring about desired outputs SAP HANA indexes data from a variety of sources and stores the results on a dedicated server Attributes add details and can be modeled using Attribute Views Analytic Views join together one central fact table consisting of calculated measures and variables for reporting Calculation Views bring together database tables, attribute views, analytic views and other calculation views

70 Where to Find More Information (2nd-Edition).html (2nd-Edition).html  Bjarne Berg and Penny Silvia, SAP HANA: An introduction, SAP Press; 3rd edition (May 1, 2014)  SAP’s main page for all SAP HANA related information  Powered by HANA demos  SAP HANA and In-Memory Computing by SAP HANA Community

71 Your Turn! How to contact me: Dr. Berg Please remember to complete your session evaluation

72 Disclaimer SAP, R/3, mySAP, mySAP.com, SAP NetWeaver ®, Duet ™®, PartnerEdge, and other SAP products and services mentioned herein as well as their respective logos are trademarks or registered trademarks of SAP AG in Germany and in several other countries all over the world. All other product and service names mentioned are the trademarks of their respective companies. Wellesley Information Services is neither owned nor controlled by SAP.