Alteryx User Group August 2016
Agenda Demonstration Hands on With Alteryx – Excel Edition Conquering Automatic Data with Alteryx Hands on With Alteryx – Excel Edition
Reminder Before We Get Started Find answers quickly with enhanced search and content recommendations Create valuable connections with peers and industry experts Submit your ideas to help us improve our products Earn reputation and rewards for your contributions Share your knowledge with the community Read insightful blog posts by data gurus Learn from real world examples* Join today: http://community.alteryx.com http://https://community.alteryx.com/t5/Portland-ME/gp-p/portland-me
Use Case 1: Automatic Office Challenge What we will cover: Data input using web download tool. Parsing JSON hierarchies. Handling encoded polylines using Alteryx macros and binary functions. Outputting multiple data sources. Using data in Tableau.
Hands On With Alteryx – Excel Edition What we will cover: Dynamic data input from multiple files and worksheets. Conforming poorly structured data using common Alteryx tools. Using transformation and summary tools Outputting data to multiple sources.
Why Today’s Exercise Matters The New York Times reports that data scientists spend from 50 percent to 80 percent of their time collecting and preparing unruly digital data, even before it can be analyzed.
Step 1: Data Sources/Input Overview Excel is commonly used by business areas to track important data. Very often this data is formatted in a way that makes sense to the user, but is not well suited for databases and analytics. Lets use Alteryx to conform a sample dataset that contains local company and product data. The link below contains the data used in this exercise. There are 10 workbooks each with 5 worksheets. https://console.aws.amazon.com/s3/home?region=us-west-2#&bucket=pmugfiles&prefix= Goal Download sample data from Amazon S3 and use the Alteryx input tool to pull data from all worksheets with the name Company Overview. Use the dynamic input tool to pull product data from all remaining sources.
Step 2: Cleaning Data Overview Goal We now how two sets of data. Company and product. Each set is poorly formed and not suitable for use by reporting tools. Goal Using Alteryx to clean company overview information. Final table should contain 7 columns: Company name, Tax ID, Situs State, Street Address, Zip Code, and Industry. Repeat the process for product information. This time dynamically rename your column headers. Use a join tool to create a third consolidated dataset.
Step 3: Output Data Overview Goal Now to make the data useful for reporting by end users. This is done by outputting the data to storage. Goal Use any Alteryx output tool to store your work to separate data files.
Open Forum/Questions