Download presentation
Presentation is loading. Please wait.
1
Data Organization Data files Databases Data streams Data files Databases Data streams
2
The organization of data Data/information must be organized Computer/network accessible Stored or streaming Data/information must be organized Computer/network accessible Stored or streaming
3
Stored information Spreadsheet Worksheet oriented Cells, columns, rows Can be set up to look like a table Database Table oriented Column headings describe a row Rows are records Tables can be related (relational database) Spreadsheet Worksheet oriented Cells, columns, rows Can be set up to look like a table Database Table oriented Column headings describe a row Rows are records Tables can be related (relational database)
4
Spreadsheets in Excel Limited in size to 64K rows, 256 columns Slow to compute changes Accessed via GUI or Visual Basic Can be exported to relational database table Limited in size to 64K rows, 256 columns Slow to compute changes Accessed via GUI or Visual Basic Can be exported to relational database table
5
Excel Worksheet
6
Relational Databases Licensed Databases MS Access, MS Sequel Server, Oracle Free databases MySQL, PostgresSQL, MaxDB, Firebird, Ingres (http://www.geocities.com/mailsoftware42/db/) Interfaces: GUI, SQL, Programming languages (Java, Perl, PHP, C++, VB) Licensed Databases MS Access, MS Sequel Server, Oracle Free databases MySQL, PostgresSQL, MaxDB, Firebird, Ingres (http://www.geocities.com/mailsoftware42/db/) Interfaces: GUI, SQL, Programming languages (Java, Perl, PHP, C++, VB)
7
Relational Table
8
Relational Table Elements
9
Relating Tables
12
One to One Relationship
13
One to Many Relationship
14
Many to Many Relationship
15
Data Streams Not stored in a file Serial data Real-time Underlying model to discover changes over time Drift in the data mining/analysis tools needed Incremental adaptation of the learning algorithm 2006 inaugural conference http://www.lsi.us.es/~aguilar/ds/ Not stored in a file Serial data Real-time Underlying model to discover changes over time Drift in the data mining/analysis tools needed Incremental adaptation of the learning algorithm 2006 inaugural conference http://www.lsi.us.es/~aguilar/ds/
16
Data Stream Examples Network Packets AP news wire Live audio / video Control system data Customer click streams Telephone records Large sets of web pages Retail transactions Network Packets AP news wire Live audio / video Control system data Customer click streams Telephone records Large sets of web pages Retail transactions
17
Summary Data is stored or streaming Data is organized Data is accessible programmatically Need to know How data is organized How to access the data How to apply DI tools to the data Data is stored or streaming Data is organized Data is accessible programmatically Need to know How data is organized How to access the data How to apply DI tools to the data
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.