Introducing Teradata Aster Discovery Platform Getting Started Ahsan Nabi Khan September 25 th, 2015.

Slides:



Advertisements
Similar presentations
Supercharging Analytics on Big Data Announcing MapReduce-ready Advanced Analytic Functions June 21 st
Advertisements

BigBench: Big Data Benchmark Proposal Ahmad Ghazal, Tilmann Rabl, Minqing Hu, Francois Raab, Meikel Poess, Alain Crolotte, Hans-Arno Jacobsen.
DATA MINING CS157A Swathi Rangan. A Brief History of Data Mining The term “Data Mining” was only introduced in the 1990s. Data Mining roots are traced.
1 ACCTG 6910 Building Enterprise & Business Intelligence Systems (e.bis) Introduction to Data Mining Olivia R. Liu Sheng, Ph.D. Emma Eccles Jones Presidential.
PayDox applications All features can be used independently.
Data Resource Management Data Concepts Database Management Types of Databases Chapter 5 McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies,
Data Mining – Intro.
CS157A Spring 05 Data Mining Professor Sin-Min Lee.
Data Warehousing: Defined and Its Applications Pete Johnson April 2002.
WHT/ HPCC Systems Flavio Villanustre VP, Products and Infrastructure HPCC Systems Risk Solutions.
Microsoft ® Application Virtualization 4.6 Infrastructure Planning and Design Published: September 2008 Updated: February 2010.
Page 1 © Hortonworks Inc – All Rights Reserved Hortonworks Naser Ali UK Building Energy Management Group Hadoop: A Data platform for businesses.
This presentation will guide you though the initial stages of installation, through to producing your first report Click your mouse to advance the presentation.
Acceleratio Ltd. is a software development company based in Zagreb, Croatia, founded in We create innovative software solutions for SharePoint,
OM. Brad Gall Senior Consultant
WebFOCUS Developer Studio Update Dimitris Poulos Technical Director September 3, 2015 Copyright 2009, Information Builders. Slide 1.
Analytics Map Reduce Query Insight Hive Pig Hadoop SQL Map Reduce Business Intelligence Predictive Operational Interactive Visualization Exploratory.
Data Mining. 2 Models Created by Data Mining Linear Equations Rules Clusters Graphs Tree Structures Recurrent Patterns.
Marketing Attribution Problematic September 2012 Extract from various presentations: RPM Direct, Kevin.
Appliance-based architectures for high performance data intensive applications Session at Silicon India Rajgopal Kishore Vice President and Global Head.
Session 5: Working with MySQL iNET Academy Open Source Web Development.
Database Systems – Data Warehousing
Guide to Linux Installation and Administration, 2e1 Chapter 3 Installing Linux.
Chapter 7 Microsoft Windows XP. Windows XP Versions XP Home XP Home XP Professional XP Professional XP Professional 64-Bit XP Professional 64-Bit XP Media.
© 2012 The McGraw-Hill Companies, Inc. All rights reserved. 1 Third Edition Chapter 6 Today’s Windows Windows Vista and Windows 7 McGraw-Hill.
Tutorial 11 Installing, Updating, and Configuring Software
Eclipse Overview Introduction to Web Programming Kirkwood Continuing Education Fred McClurg © Copyright 2015, Fred McClurg, All Rights Reserved.
VMWare Workstation Installation. Starting Vmware Workstation Go to the start menu and start the VMware Workstation program. *Note: The following instructions.
Setting Up a Local WordPress Development Environment By Gregory Young Alternative Hosting
Database Design Part of the design process is deciding how data will be stored in the system –Conventional files (sequential, indexed,..) –Databases (database.
Introduction to SQL Server Data Mining Nick Ward SQL Server & BI Product Specialist Microsoft Australia Nick Ward SQL Server & BI Product Specialist Microsoft.
Cisco Confidential © 2010 Cisco and/or its affiliates. All rights reserved. 1 MSE Virtual Appliance Presenter Name: Patrick Nicholson.
5-1 McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
CS157B Fall 04 Introduction to Data Mining Chapter 22.3 Professor Lee Yu, Jianji (Joseph)
 Database Administration Installing Oracle 11g & Creating Database.
Consul- ting Services Outsour- cing Services Techno- logy Services Local Profes- sional Services Competence Centers Business Intelligence WebTech SAP.
Machine Learning Extract from various presentations: University of Nebraska, Scott, Freund, Domingo, Hong,
Microsoft Management Seminar Series SMS 2003 Change Management.
 Enhancing User Experience  Why it is important?  Discussing user experience one-by-one.
Data Science Background and Course Software setup Week 1.
Intro to Datazen.
Clusterpoint Margarita Sudņika ms RDBMS & NoSQL Databases & tables → Document stores Columns, rows → Schemaless documents Scales UP → Scales UP.
MIS2502: Data Analytics Advanced Analytics - Introduction.
I. Requirements for computer. 2.How to install the VMware player software. 3.How to operate the DAS system.
© 2012 The McGraw-Hill Companies, Inc. All rights reserved. 1 Third Edition Chapter 6 Today’s Windows Windows Vista and Windows 7 McGraw-Hill.
Virtual Machines Module 2. Objectives Define virtual machine Define common terminology Identify advantages and disadvantages Determine what software is.
The Future With Windows XP Microsoft announced that after April 16, 2014 it will not longer support Windows XP. No statement on continued inclusion of.
Chapter 3 Building Business Intelligence Chapter 3 DATABASES AND DATA WAREHOUSES Building Business Intelligence 6/22/2016 1Management Information Systems.
CS570: Data Mining Spring 2010, TT 1 – 2:15pm Li Xiong.
Data Analytics Challenges Some faults cannot be avoided Decrease the availability for running physics Preventive maintenance is not enough Does not take.
HedEx Lite Obtaining and Using Huawei Documentation Easily
Bhakthi Liyanage SQL Saturday Atlanta 15 July 2017
bitcurator-access-webtools Quick Start Guide
Data Mining – Intro.
SNS COLLEGE OF TECHNOLOGY
Guide to Linux Installation and Administration, 2e
MIS2502: Data Analytics Advanced Analytics - Introduction
Microsoft Ignite /11/2018 1:18 AM BRK4017
Spark Presentation.
Our Solutions Focus: Threat Detection and Investigation
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
HC Hyper-V Module GUI Portal VPS Templates Web Console
Data Warehousing and Data Mining
Microsoft Virtual Academy
bitcurator-access-webtools Quick Start Guide
MIS2502: Data Analytics Introduction to Advanced Analytics and R
IBM C IBM Big Data Engineer. You want to train yourself to do better in exam or you want to test your preparation in either situation Dumpspedia’s.
Presentation transcript:

Introducing Teradata Aster Discovery Platform Getting Started Ahsan Nabi Khan September 25 th, 2015

2 When You Need Aster Discovery Platform? 2.DIG DEEP AND FAST: Ad-hoc, interactive exploration of all data within seconds/minutes 1.SCALABLE ANALYTICS: Vast array of analytic algorithms run on commodity hardware as an Integrated Analytics Engine

3 Advanced Analytic Applications: Use Cases Credit, Risk and Fraud Packaging and Advertising Buying Patterns Cyber Defense Fraud and Crime Citizen’s Feedback Call Data Records Service Personalization Friends Graphs Click Stream Opinion, Sentiment, Stars Social Media Telecom Commerce Analysis Federal Analysis

Discovery Process Model

5 Hardware requirements: -6-8 GB memory -20 GB free disk space OS requirement: -64-bit XP, Vista, Win 7, and later Prereq. Software: -7-zip -VMware player Download Aster Discovery Platform from: - Downloading Aster 6.0

6 Installing Aster 6.0 Extract Aster 7-zip file. It contains folders for -Aster Queen VM -Aster Worker VM -Documentation Set IP address for VMNet8 Adapter to

7 Aster Management Console Start Queen and Worker VM on VM Player Point your browser to Username: db_superuser Password: db_superuser On Admin Panel, in Cluster Management, click on Activate Cluster

8 SQL-MR through ACT Use putty.exe to login into Aster Queen node shell -Queen IP address: Username: root (or) aster -Password:aster Access Aster Command Terminal (ACT) -Username: act -Password: beehive Create Tables, Query in SQL-MR

9 Setting Up Aster Lens for Visualization In ACT, put command -\dF Check that CfilterViz.zip and NpathViz.zip functions are installed alongwith prerequisite functions cfilter.zip and npath.viz If not, copy them from Analytic libraries in Aster Queen home folder using FileZilla Run command -\i NpathViz.zip -\i CfilterViz.zip Login Queen root using putty.exe Locate /opt/AsterLens/asterlens Start asterlens by running shell command -./start-asterlens.sh Point browser to Login AsterLens -Username:admin -Password:admin

10 Visualization through AsterLens Create Table Load Data through ncluster_loader Apply Cfilter to find association between Calling_Number and Called_Number Create AsterLens CfilterViz table from Cfilter result Add in AsterLens Catalog Create Table CDR ( Calling_Number VARCHAR(255), Called_Number VARCHAR(255), Minutes REAL ) DISTRIBUTE BY HASH(Calling_Number); ncluster_loader --hostname username beehive -- password beehive --dbname beehive --verbose --skip-rows 1 CDR 26_02_2015.csv –c SELECT * FROM Cfilter( ON (SELECT 1) PARTITION BY 1 PASSWORD('beehive') INPUTTABLE('CDR') OUTPUTTABLE('cdr_26_02_2015') INPUTCOLUMNS(‘calling_number',‘called_number') JOINCOLUMNS(‘calling_number') MAXSET(25) ); create table Aster_Lens.CDR_cfilterviz distribute by hash(col1_item1) as ( select * from CfilterViz( on cdr_26_02_2015 partition by 1 title(' calls') item1_col('col1_item1') item2_col('col2_item2') cnt1_col('cnt1') cnt2_col('cnt2') DIRECTED('true') score_col('cntb') accumulate('col1_item1', 'col2_item2') ) ); insert into Aster_Lens_Catalog values(1, 'aster_lens', ‘CDR_cfilterviz', '26_02_2015 calls');

11 Resulting AsterLens Visualizations

12 Example: k-Means Function What this gives you: -Organizes data into groupings or clusters based on shared attributes -Allows you to understand natural segments Example use cases: -Marketing segmentation -Fraud detection -Computer vision-- object recognition One call for clustering items into natural segments Complete Aster Data Application: Text processing required to prepare data for customer support analysis K-Means identifies hot product issues for proactive response K-Means in Use: Contact Center

13 Example: Basket Generator Function What this gives you? -Creates groupings of related items via single pass over data -Allows you to increase or decrease basket size with a single parameter change Example use cases: -Retail market basket analysis -People who bought x also bought y Extensible market basket analysis Complete Aster Data Application: Evaluate effectiveness of marketing programs Launch customer recommendations feature Evaluate and improve product placement Basket Generator in Use

14 Example: Unpack Function What this gives you: -Translates unstructured data from a single field into multiple structured columns -Allows business analysts access to data with standard SQL queries Example use cases: -Sales data -Stock transaction logs -Gaming play logs Transforming hidden data into analyst accessible columns Complete Aster Data Application: Text processing required to transform/unpack third party sales data Sessionization required to prepare data for path analysis Statistical analysis of pricing Unpack in Use: Pricing Analysis

15 Example: nPath Function for time-series analysis What this gives you: - Pattern detection via single pass over data -Allows you to understand any trend that needs to be analyzed over a continuous period of time Example use cases: - Web analytics– clickstream, golden path - Telephone calling patterns - Stock market trading sequences Uncovering patterns in sequential steps Complete Aster Data Application: Sessionization required to prepare data for path analysis nPath identifies marketing touches that drove revenue nPath in Use: Marketing Attribution

16 Aster Analytics is much more diverse and powerful in finding “interesting” patterns while loses no information assimilated from simpler queries. Simple SQL queries cannot match the analytical power of Aster Analytics. For Big Data Analytics, Teradata and Aster create a powerful combination. Conclusion