Download presentation
Presentation is loading. Please wait.
1
Lesson 1: Introduction to Trifacta Wrangler
Chapter 1A – Course Overview
2
Lesson 1 – Chapter 1A Chapter 1A – Course Overview
7 Structured Lessons: Lesson 1: Introduction to Trifacta Wrangler Lesson 2: Getting Started Lesson 3: Basic Operations Lesson 4: Advanced Transforms Lesson 5: More Advanced Transforms Lesson 6: Tools Lesson 7: Results and Publishing A datasourse is a reference to a set of data that has been imported into the system. This source is not modified within the application datasource and can be used in multiple datasets. It is important to note that when you use Trifacta to wrangle a source, or file, the original file is not modified – therefore, it can be used over and over – to prepare output in multiple ways, for example. Datasources are created in the Datasources Page, or when a new dataset is created. There are two ways to add a datasource to your Trifacta instance: You can locate and select a file in HDFS – HDFS stands for Hadoop File System. You can use the file browser to locate and select the file. You can also upload a local file from your machine. Note that there is a 1 GB file size limit for local files. Several file formats are supported: CSV LOG JSON AVRO EXCEL – Note that if you upload an Excel file with multiple worksheets, each worksheet will be imported as a separate source. Trifacta. Confidential & Proprietary.
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.