ACIS 1504 - Introduction to Data Analytics & Business Intelligence Text Mining Data Cleaning.

Slides:



Advertisements
Similar presentations
Insert complete company name Creating an Excel Spreadsheet Using Excel 2000.
Advertisements

Dot Plot. Goal We will take two nucleotide base strings and look for common patterns – stretches where the bases match. GAATTCATACCAGATCACCGAAAACTGTCCTCCAA.
1. Write an Excel text function in cell Company Data!A4, which may be copied down, to string together the first and last name with only the first letter.
Computer Science & Engineering 2111 Text Functions 1CSE 2111 Lecture-Text Functions.
Excel Objects, User Interface, and Data Entry. ◦ Application Window  Title Bar  Menu Bar  Toolbars  Status Bar  Worksheet Window  Worksheet Input.
Exploring Office Grauer and Barber 1 Committed to Shaping the Next Generation of IT Experts. Chapter 1 – Introduction to Excel: What is a Spreadsheet?
Understanding Microsoft Excel
Objectives 1.Identify the functions of a spreadsheet 2.Identify how spreadsheets can be used. 3.Explain the difference in columns and rows. 4.Locate specific.
 Basically, a sequence of characters  Character? › Like… a letter › Or a number › Or even blank space.
Structured Query Language - SQL Carol Wolf Computer Science.
Visualizing Multiple Physician Office Locations Exercise 9 GIS in Planning and Public Health Wansoo Im, Ph.D.
1 Computers and Representations Ascii vs. Binary Files Over the last few million years, Earth has experienced numerous ice ages when vast regions of the.
Text Mining & Basic Calculations Supplemental Resources on Class Website.
1 CS110: Lecture 2 Spreadsheets Prepared by Fred Annexstein University of Cincinnati CC Some rights reserved Today’s Topics Basics of Excel Spreadsheets.
Chapter 06: Lecture Notes (CSIT 104) 1 Copyright © 2008 Pearson Prentice Hall. All rights reserved. 1 1 Copyright © 2008 Prentice-Hall. All rights reserved.
Computer Science 1000 Spreadsheets II Permission to redistribute these slides is strictly prohibited without permission.
Chapter 9 Creating Formulas that Manipulate Text Microsoft Office Excel 2003.
DAY 6: MICROSOFT EXCEL – CHAPTER 2 CONTD. MICROSOFT EXCEL – CHAPTER 3 Akhila Kondai September 04, 2013.
Exploring Excel 2003 Revised - Grauer and Barber 1 Committed to Shaping the Next Generation of IT Experts. Chapter 1 – Introduction to Excel: What is a.
Lesson 1 – Microsoft Excel The goal of this lesson is for students to successfully explore and describe the Excel window and to create a new worksheet.
Introduction to Spreadsheet Software. Spreadsheets and Their Uses Examples of Charts Spreadsheet Basics Spreadsheet Map Types of Spreadsheet Data Navigating.
ACOT Intro/Copyright Succeeding in Business with Microsoft Excel
GIS 1 GIS Lecture 4 Geodatabases. GIS 2 Outline Administrative Data Example Data Tables Data Joins Common Datasets Spatial Joins ArcCatalog Geodatabases.
© Paradigm Publishing, Inc Access 2010 Level 2 Unit 1Advanced Tables, Relationships, Queries, and Forms Chapter 1Designing the Structure of Tables.
Miscellaneous Excel Combining Excel and Access. – Importing, exporting and linking Parsing and manipulating data. 1.
ACIS Introduction to Data Analytics & Business Intelligence Clarity.
CS212: DATA STRUCTURES Lecture 10:Hashing 1. Outline 2  Map Abstract Data type  Map Abstract Data type methods  What is hash  Hash tables  Bucket.
ACIS Introduction to Data Analytics & Business Intelligence Business Analytics E-commerce Data Analytics Simplicity.
Introduction to Excel What Can I Do with a Spreadsheet?
1 ADVANCED MICROSOFT EXCEL Lesson 9 Applying Advanced Worksheets and Charts Options.
Microsoft Excel Training Course for WD Early Recruitment Program.
Exploring Office 2003 Vol 1 2/e - Grauer and Barber 1 Committed to Shaping the Next Generation of IT Experts. Chapter 1 – Introduction to Excel: What is.
Cell Alignment By default, text is left aligned and values are right aligned. You can also adjust vertical alignment.
 Agenda: 4/24/13 o External Data o Discuss data manipulation tools and functions o Discuss data import and linking in Excel o Sorting Data o Date and.
GIS 1 GIS Lecture 4 Geodatabases Copyright – Kristen S. Kurland, Carnegie Mellon University.
Data Mining Supplemental Resources on Class Website.
ACIS Introduction to Data Analytics & Business Intelligence Data Mining Accuracy Design Goal.
ACIS Introduction to Data Analytics & Business Intelligence Verifiabilit y.
Chapter 10 Hashing. The search time of each algorithm depend on the number n of elements of the collection S of the data. A searching technique called.
1.  Formatting applied to spreadsheet for the purpose of organizing and clarifying information.  Data presented in a uniform and consistent format 
Introduction to Data Analytics & Business Intelligence Visualizati on:OLAP Pivot Tables.
ACIS Introduction to Data Analytics & Business Intelligence Database s Benefits & Components.
Text Mining Supplemental Resources on Class Website.
CHAPTER 17 INTRODUCTION TO SPREADSHEETS. SPREADSHEETS Application Software designed to aid users in entering, moving,copying, labeling, displaying and.
Lesson 6 Formatting Cells and Ranges. Objectives:  Insert and delete cells  Manually format cell contents  Copy cell formatting with the Format Painter.
Spreadsheets COE 201- Computer Proficiency. Basic Interface Excel Book = Word Document Every book can contain up to 255 different sheets.
Understanding Microsoft Excel Lesson 1 – Microsoft Excel 2013.
CHAPTER 1 – INTRODUCTION TO ACCESS Akhila Kondai September 30, 2013.
Office 2003 Introductory Concepts and Techniques M i c r o s o f t Excel Project 1 Creating a Worksheet and an Embedded Chart.
1 Chapter 2 Basic SQL SELECT Statements. 2 Chapter Objectives Distinguish between an RDBMS and an ORDBMS Identify keywords, mandatory clauses, and optional.
1 CSE 2337 Chapter 7 Organizing Data. 2 Overview Import unstructured data Concatenation Parse Create Excel Lists.
1 PEER Session 02/04/15. 2  Multiple good data management software options exist – quantitative (e.g., SPSS), qualitative (e.g, atlas.ti), mixed (e.g.,
ACIS Introduction to Data Analytics & Business Intelligence Spreadsheet Design Simplicity.
ACIS Introduction to Data Analytics & Business Intelligence Business Intelligence Logical Functions Part 2.
ACIS Introduction to Data Analytics & Business Intelligence Business Intelligence Logical Functions Part 1.
ME 142 Engineering Computation I
Lesson 3: Using Formulas
Miscellaneous Excel Combining Excel and Access.
Creates the file on disk and opens it for writing
Learning How to “Excel”
Spreadsheet/Worksheet/ Excel
After completing this lesson, you will be able to:
MS-Office It is a Software Package It contains some programs like
Understanding Microsoft Excel
Spreadsheets and Charts A Beginner’s Guide
Unit# 6: ICT Applications
Creates the file on disk and opens it for writing
Understanding Microsoft Excel
Lesson 14 Spreadsheet Software.
Exploring Microsoft Excel 2003
Presentation transcript:

ACIS Introduction to Data Analytics & Business Intelligence Text Mining Data Cleaning

Concept Map Text Mining Implementation Mixed Cell References Design: Accuracy Random Search, Left, Right, Mid, Len, & Paste Values

Objectives Define Text Mining Demonstrate Excel features that support text mining.

Segment A: Text Mining

Text Analytics / Text Mining Software that searches vast amounts of textual data (unstructured) identifying patterns.

Nestle Nestle processes Social Media 026?videoId=

Segment B: Text Functions

Text Mining Search Parse Concatenate SEARCH LEFT, MID, RIGHT, LEN &

Name Example Open Grades Textfile.xlsx. Divide Last Name, First Name into two separate columns. 1.Locate the comma (SEARCH) 2.Extract all characters to left of comma (LEFT) 3.Locate end of full name (LEN) 4.Extract almost all characters between comma and end of name (RIGHT)

SEARCH Function

LEFT Function

LEN or Length Function

RIGHT Function

MID Function Extract the first initial of first name.

Concatenate Combine First Name, space and Last Name. & is the concatenate symbol Quotes are required around constant strings of text

Student ID Example Extract each student’s PID from their address. Create a new student identifier by combining the first three letters of the last name with the last four digits of the student ID number.

Segment C: Data Cleaning & Generation

Data Cleaning Delete Unnecessary Columns & Rows Resize Columns Format Numeric Values Separate Distinct Values Shorten Lengthy Values Data Validation for Future Entries Generate Values

Favorite Pie Example

1.Ensure pie flavor data is consistent. 2.Replace confidential clicker ID # with randomly generated 6 digit number. 3.Ensure new ID number is static and unique.

Favorite Pie Example OriginalSortedConsistent

Random Number Functions =RAND() =RANDBETWEEN(low#, high#)

Paste Special - Values MAC: Edit Menu, Paste Special

Exam Feedback Example Open Exam Feedback.xlsx