Infotools Harmoni IHU: Data Sources

Slides:



Advertisements
Similar presentations
Chapter 10 Excel: Data Handling or What do we do with all that data?
Advertisements

CC SQL Utilities.
DATABASE BASICS: INSERTING AND FORMATTING DATA EXCEL 07 SESSION II.
SAITS Learning Opportunities Introduction to Pivots and Dashboards December 12, :00 – 12:00 Cathy Bates.
Access Lesson 2 Creating a Database
1 An Introduction to IBM SPSS PSY450 Experimental Psychology Dr. Dwight Hennessy.
Microsoft Office Word 2013 Expert Microsoft Office Word 2013 Expert Courseware # 3251 Lesson 4: Working with Forms.
Tutorial 5: Working with Excel Tables, PivotTables, and PivotCharts
Managing Business Data Lecture 8. Summary of Previous Lecture File Systems  Purpose and Limitations Database systems  Definition, advantages over file.
Unit J: Creating a Database Microsoft Office Illustrated Fundamentals.
Coding for Excel Analysis Optional Exercise Map Your Hazards! Module, Unit 2 Map Your Hazards! Combining Natural Hazards with Societal Issues.
XP New Perspectives on Microsoft Access 2002 Tutorial 51 Microsoft Access 2002 Tutorial 5 – Enhancing a Table’s Design, and Creating Advanced Queries and.
Mail merge letters are used to send the same or similar documents to many different people. Since they contain the recipient’s name, address, and other.
Copyright © 2008 Pearson Prentice Hall. All rights reserved. 1 Import Data From Text Files and Other Sources Importing is the process of inserting data.
Miscellaneous Excel Combining Excel and Access. – Importing, exporting and linking Parsing and manipulating data. 1.
Lesson 17 Getting Started with Access Essentials
1 Lesson 18 Organizing and Enhancing Worksheets Computer Literacy BASICS: A Comprehensive Guide to IC 3, 3 rd Edition Morrison / Wells.
6 th Annual Focus Users’ Conference Manage Integrations Presented by: Mike Morris.
CREATING TEMPLATES CREATING CUSTOM CHARACTERS IMPORTING BATCH DATA SAVING DATA & TEMPLATES CREATING SERIES DATA PRINTING THE DATA.
Chapter 17 Creating a Database.
Page 1 Non-Payroll Cost Transfer Enhancements Last update January 24, 2008 What are the some of the new enhancements of the Non-Payroll Cost Transfer?
1 Chapter 2: Working with Data in a Project 2.1 Introduction to Tabular Data 2.2 Accessing Local Data 2.3 Accessing Remote Data 2.4 Importing Text Files.
FIX Eye FIX Eye Getting started: The guide EPAM Systems B2BITS.
XP Chapter 2 Succeeding in Business with Microsoft Office Access 2003: A Problem-Solving Approach 1 Building The Database Chapter 2 “It is only the farmer.
Microsoft Access. Microsoft access is a database programs that allows you to store retrieve, analyze and print information. Companies use databases for.
0 eCPIC Admin Training: OMB Submission Packages and Annual Submissions These training materials are owned by the Federal Government. They can be used or.
XP. Objectives Sort data and filter data Summarize an Excel table Insert subtotals into a range of data Outline buttons to show or hide details Create.
1 Lesson 13 Organizing and Enhancing Worksheets Computer Literacy BASICS: A Comprehensive Guide to IC 3, 3 rd Edition Morrison / Wells.
Gold – Crystal Reports Introductory Course Cortex User Group Meeting New Orleans – 2011.
Chapter 3 Automating Your Work. It is frustrating when you have to type the same passage of text repeatedly. For example your name and address. Word includes.
Word Create a basic TOC. Course contents Overview: table of contents basics Lesson 1: About tables of contents Lesson 2: Format your table of contents.
Ms. Hall Spring Functions. Excel – Lesson 4 Summarizing Data with Functions A function is a predefined formula that performs a calculation. When.
Emdeon Office Batch Management Services This document provides detailed information on Batch Import Services and other Batch features.
Document Handling Contents: General Structure of Documents
Dreamweaver – Setting up a Site and Page Layouts
Europa Sports Datafeed ftp://
Creating Oracle Business Intelligence Interactive Dashboards
Miscellaneous Excel Combining Excel and Access.
GO! with Microsoft Office 2016
Tutorial 5: Working with Excel Tables, PivotTables, and PivotCharts
Practical Office 2007 Chapter 10
PIVOT TABLE BASICS.
Required Data Files Review
GO! with Microsoft Access 2016
Week 12 Option 3: Database Design
Data File Import / Export
Mail Merge Instructions (Yanick’s Version)
Microsoft Access 2003 Illustrated Complete
(Global / All Markets View)
Boeing Supply Chain Platform (BSCP) Detailed Training
Microsoft Excel All editions of Microsoft office.
MODULE 7 Microsoft Access 2010
Creating and Modifying Queries
NextGen Trustee General Ledger Accounting
Infotools Harmoni IHU: Constructions
Hearth Bulk System Divisional Secretaries Briefing 2011.
Fundamentals of Data Structures
Navya Thum January 30, 2013 Day 5: MICROSOFT EXCEL Navya Thum January 30, 2013.
ICT Spreadsheets Lesson 1: Introduction to Spreadsheets
Computer Science Projects Database Theory / Prototypes
Spreadsheets, Modelling & Databases
Overview of Contract Association Batch Upload
Topic 8 – Pivot tables and Charts Lesson 1 – Pivot tables
Guidelines for Microsoft® Office 2013
Topic 11 Lesson 1 - Analyzing Data in Access
Unit J: Creating a Database
Excel Tips & Tricks July 18, 2019.
Excel-eration Webinar
Viewing the New Interface
Presentation transcript:

Infotools Harmoni IHU: Data Sources August 2017 Presented by: Janine Takle

Data Sources A data source is the starting point for all you do in Infotools Harmoni A source is a collection of data that is either in an imported file, or in a connected data store Each record in a source corresponds to a count in Infotools Harmoni Infotools Harmoni itself is source agnostic

Sources vision Our vision is for any appropriate data source to be analysed, have insights visualised, then distributed, all within Infotools Harmoni Connections that facilitate real time updates are expected to be the way of the future We have a continuous development stream dedicated to adding connections and supporting new sources We are strongly influenced by the needs of our clients when identifying additional sources to support

Source Types

Imported files Imported files capture data for a moment in time Files that can be imported into Infotools Harmoni include: SAV files X-files MDD/DDF – Dimensions files XLSX – Excel files TXT files – tab delimited text CSV files – comma delimited text

Direct connections Direct connections allow updated data to flow into Infotools Harmoni in real time Direct connections are achieved using APIs (Application Programming Interface). If an API is available in a client’s data collection system we can potentially develop a direct connection We currently support direct connections to: Tables in SQL Server Views in SQL Server Projects in Decipher

Warning, from here on in we dive into the detail….

Source Structures

Source content Sources can be divided into 2 types – record level and summary level Record level sources are sources where analysing the count of records is relevant, e.g. where a record represents a respondent, a log entry, a sales transaction, etc. Summary level sources are sources where analysing the count of records is irrelevant, and analysis only has meaning when a measure is applied, e.g. scores, GRPs, Spend, etc. Files in SAV and Dimensions format typically contain record level data with a record representing a respondent

Creating sources SAV files can be generated from a variety of data collection applications, but not all make full use of available features X-files is Infotools’ proprietary format read by our legacy Desktop applications Dimensions files are typically created by Dimensions, although some other market research data collection applications can also create them Xlsx files are primarily created from Excel in Office 2007+ Text files, delimited with a tab or comma, can be generated from a variety of systems, and do not have market research specific structures

SAV Structure Coded responses have labels Looped questions have a separate variable for each loop combination All respondents asked the question have a response Respondents not asked the question don’t have any response Multiple response sets capture yes responses Cleaned data, with only valid respondents

X-File Structure X-files are typically already designed Vaxes must be in indexless format (achieved using SUP -Clean) Each item within a common type needs a unique label Headers need to be new style, i.e. version 4+ Matrices, yngrids and vaxes with predefines are not supported X-files more likely for internal consumption, or customers with existing data services

Before continuing with the next 6 slides of detail, let’s just come up for air and be inspired by Albert Einstein for a minute… While we have some conditions around the structure of the data that can be analysed in IH, the content of the data is limited only by your imagination

Now let’s dive back down into the detail….

Dimensions MDD/DDF Structure The Dimensions converter is currently undergoing an overhaul Supported structures will be confirmed within the next few months

XLSX Structure First row contains headers with unique, non-blank descriptions Fields contain labels, not codes. If codes are used they will need to be renamed to labels after importing into Infotools Harmoni (IH) Key identifiers can be added to the header labels to ensure required Infotools Harmoni types are applied No blank rows between valid rows of data Numeric values don’t contain text, e.g. $, commas, % No fields contain |

TXT Structure First row contains headers with unique, non-blank, descriptions that don’t include tabs Fields are delimited with a tab Records contain labels, not codes. If codes are used they will need to be renamed to labels after importing into IH Key identifiers can be added to the header labels to ensure required IH types are applied No blank rows between valid rows of data Text containing tabs is enclosed in double-quotes Time/Date fields contain descriptions, not numbers Numeric values don’t contain text, e.g. $, tabs, %

CSV Structure First row contains headers with unique, non-blank, descriptions that don’t include commas Fields are delimited with a comma Records contain labels, not codes (if codes are used they will need to be renamed to labels after importing into IH) Key identifiers can be added to the header labels to ensure required IH types are applied No blank rows between valid rows of data Text containing commas is enclosed in double-quotes Time/Date fields contain descriptions, not numbers Numeric values don’t contain text, e.g. $, commas, %

SQL Structure All relevant data exists in a single view or table All data associated with a unique record is reported in a single table or view Key identifiers can be added to field names to advise IH on the default type to import Fields contain labels, not codes (if codes are used they will need to be renamed to labels after importing into IH) Keep in mind that field names in SQL are limited to 128 characters, and cannot contain |

Direct Connection Structure Each Direct Connection is developed to map the data types in the data collection system with the data types available in Infotools Harmoni Direct Connections allow market research specific structures, such as grids and multiple responses, to retain their internal relationships

Source Dictionaries

Source Dictionaries A Dictionary contains meta-data to guide interpretation When loading sources with a dictionary, Infotools Harmoni automatically maps items into Infotools Harmoni item types When loading sources that d on’t contain a dictionary, the user is given the option to override the automated mapping of source variables into Infotools Harmoni types Sources with inherent dictionaries are SAV, X-files, MDD/DDF and Decipher

SAV Dictionary Variables that are a Numeric Type, and have defined Value Labels become standard axes in IH, e.g. Variables that are a Numeric or Date Type, and have no defined Value Labels become measures in IH, e.g. Variables that are a String Type, and don’t have defined Value Labels become text items in IH, e.g.

Dictionary Tricks If the file doesn’t have a dictionary, key identifiers can be used to pre-determine the data type in text file imports The key identifiers are: $ any field starting with $ becomes a measure in IH $weight any field starting with $weight becomes a weight in IH & any field starting with & becomes a text item in IH

Tips & Tricks

Source Tips & Tricks You can see the sources in a Project that’s not loaded by selecting the project and clicking Sources You can see the sources that exist in a loaded Project by clicking Once in the view/add sources menu, you can add more sources to a project by clicking

Source Tips & Tricks Projects can contain multiple sources - harmonization of sources is, of course, one of our key selling points! Infotools Harmoni is source agnostic, so sources in a project can be a mix of imported files and direct connections Sources are specific to a Project

Update Tips & Tricks If a source is updated, or a new source is added, dictionary mappings applied to the previous source are applied to the new one When adding new sources, items that don’t already exist in the project are added to the end If an item does not appear with the expected label or type, Infotools Harmoni provides functionality to change these things within your Project … although sometimes it’s easier to change the source!

Filetype Tips & Tricks When using an x-files source you need to upload: Xbf Xdf Xef Xlk (optional) Excel files must be .xlsx Infotools Harmoni can use sources in any language, including languages requiring multi-byte storage, e.g. traditional Chinese

Tips & Tricks When importing files that do not have an associated dictionary, the elements of standard axes are written into the axis in the order that they appear within the file Multiple response sets in SAV files appear at the bottom of the project Data imported into SQL from Excel, Access or Text, has field names truncated at 64 characters

Helpful resources Infotools Harmoni Knowledge Bank Sample files https://infotools.zendesk.com/hc/en-us/categories/203145547-Infotools-Harmoni Sample files https://infotools.zendesk.com/hc/en-us/articles/115011356528 Infotools Harmoni – client sites https://connect.infotoolsonline.com Infotools Harmoni – internal sites https://internal.infotoolsonline.com

Try it yourself

Try it yourself…? Sample files can be found in https://infotools.zendesk.com/hc/en- us/articles/115011356528 Try importing each of the different sources and see if you can answer the following questions

Can you…? See what’s new in the 201502 SAV file that wasn’t in 201501? Work out why the sample size is so high in the x-files? Import Experience feedback in the XLSX to be a verbatim in Infotools Harmoni Add the TXT file into the same project you imported the SAV files and watch the magic happen Take a copy of the CSV and change the source so Num Bevs Per Week is recognised as a measure in the import screen?

What is this data…? The following slides show some screenshots of sources. For each data source, consider the following questions: Is this data record level or summary level? Is this data suitably formatted to import into Infotools Harmoni? If not, why not?

What is this data…Source 1

What is this data…Source 2

What is this data…Source 3

What is this data…Source 4

What is this data…Source 5

Answers/Hints What is this data Try it yourself Source 5 – summary level, suitably formatted Source 4 – record level, suitably formatted Source 3 – summary level, suitably formatted Source 2 – summary level, not suitably formatted – there are blank lines, merged cells in the headers, headings among the entries in column A, etc. Source 1 – record level, suitably formatted What is this data Hint: Insert a $ into the start of the Num Bevs Per Week header Hint: the colouring indicates the common and non-common items Hint: Insert a & into the start of the Experience feedback header Hint: a population projection has been set in the xlk Hint: use the NEW filter at the top of the project view Try it yourself

Questions