2008. Data East company profile Company of about 85 employees Based in Akademgorodok (Novosibirsk, Russia), Founded from the “Novosibirsk Regional Center.

Slides:



Advertisements
Similar presentations
VORTEX Version Software Application Sociology; Marketing research; Social-psychological research Social-medical research Staff recruitment, staff.
Advertisements

Ch2 Data Preprocessing part3 Dr. Bernard Chen Ph.D. University of Central Arkansas Fall 2009.
1 Copyright Jiawei Han; modified by Charles Ling for CS411a/538a Data Mining and Data Warehousing  Introduction  Data warehousing and OLAP for data mining.
OLAP Tuning. Outline OLAP 101 – Data warehouse architecture – ROLAP, MOLAP and HOLAP Data Cube – Star Schema and operations – The CUBE operator – Tuning.
Copyright © 2006 The McGraw-Hill Companies, Inc. Permission required for reproduction or display. 1 ~ Curve Fitting ~ Least Squares Regression Chapter.
Spatial Dependency Modeling Using Spatial Auto-Regression Mete Celik 1,3, Baris M. Kazar 4, Shashi Shekhar 1,3, Daniel Boley 1, David J. Lilja 1,2 1 CSE.
By: Mr Hashem Alaidaros MIS 211 Lecture 4 Title: Data Base Management System.
DATA MINING RELATIONSHIPS AMONG URBAN SOCIOECONOMIC, LAND COVER, AND REMOTELY SENSED ECOLOGICAL DATA Jeremy Mennis*, Carol, Wessman, and Nancy Golubiewski**,
Introduction to Data Mining with XLMiner
Introduction to GIS and ArcGIS How a GIS works Introduction to ArcGIS The ArcGIS Interface.
IS 466 ADVANCED TOPICS IN INFORMATION SYSTEMS LECTURER : NOUF ALMUJALLY 20 – 11 – 2011 College Of Computer Science and Information, Information Systems.
Curve-Fitting Regression
Spatial Interpolation
Advanced Topics COMP163: Database Management Systems University of the Pacific December 9, 2008.
19 th Advanced Summer School in Regional Science An introduction to GIS using ArcGIS.
Introduction to GIS and ArcGIS How a GIS works Introduction to ArcGIS.
Colliers GIS Mapping & GIS For Commercial Real Estate Colliers GIS
Introducing GIS Getting to Know ArcGIS Desktop. Brief History Recap  Studying the world using maps and globes  Models are now found inside computers.
19 th Advanced Summer School in Regional Science Overview and more advanced directions with ArcGIS.
ESRM 250/CFR 520 Winter 2010 Phil Hurvitz (with thanks to J. Lawler & P. Schiess) Introduction to GIS and ArcGIS 1 of 48.
Marine GIS Applications using ArcGIS Global Classroom training course Marine GIS Applications using ArcGIS Global Classroom training course By T.Hemasundar.
Major Tasks in Data Preprocessing(Ref Chap 3) By Prof. Muhammad Amir Alam.
Classification and Prediction: Regression Analysis
Rebecca Boger Earth and Environmental Sciences Brooklyn College.
Lecture 5 Geocoding. What is geocoding? the process of transforming a description of a location—such as a pair of coordinates, an address, or a name of.
The Future of Data Mining – Predictive Analytics.
Data Mining: Concepts & Techniques. Motivation: Necessity is the Mother of Invention Data explosion problem –Automated data collection tools and mature.
Mary LaMagna Reiter GIS Analyst, Contractor GEOG 596A.
Title: Spatial Data Mining in Geo-Business. Overview  Twisting the Perspective of Map Surfaces — describes the character of spatial distributions through.
SharePoint 2010 Business Intelligence Module 6: Analysis Services.
Spatial Intelligence – New ideas for Integrating GIS Across the Enterprise © GeoAnalytics, Inc. 2008, all rights reserved.
GIS in Business Presentation Retail Information Systems Class October 29, 2003 Colleen M. Schelde ESRI Inc.
Spatial Statistics and Spatial Knowledge Discovery First law of geography [Tobler]: Everything is related to everything, but nearby things are more related.
CIS 9002 Kannan Mohan Department of CIS Zicklin School of Business, Baruch College.
Food Store Location Analysis Albuquerque New Mexico, 2010 Prepared for: Geography 586L - Spring Semester, 2014 Larry Spear M.A., GISP Sr. Research Scientist.
Copyright © 2005, SAS Institute Inc. All rights reserved. 1.
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Multivariate Data Analysis CHAPTER seventeen.
Bayesian networks Classification, segmentation, time series prediction and more. Website: Twitter:
11/12/2012ISC471 / HCI571 Isabelle Bichindaritz 1 Prediction.
BI Terminologies.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
Chapter 4 – Descriptive Spatial Statistics Scott Kilker Geog Advanced Geographic Statistics.
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
GIS in Business Presentation American Collegiate Retailing Association March 13, 2003 Colleen M. Schelde ESRI Inc.
DATABASES AND DATA WAREHOUSES
Part II: Business environment analysis with ESRI Business Analyst Desktop Getting to Know ESRI Business Analyst Fred L. Miller, PhD Murray State University.
Glenn Meyers ISO Innovative Analytics 2007 CAS Annual Meeting Estimating Loss Cost at the Address Level.
Analytics & Reporting Tool.  Outline how to access SAS OLAP Cubes through SAS AMO  Review SAS OLAP Cube creation and how it relates to integration with.
Review of fundamental 1 Data mining in 1D: curve fitting by LLS Approximation-generalization tradeoff First homework assignment.
Lecture 6: Point Interpolation
Managing Data for DSS II. Managing Data for DS Data Warehouse Common characteristics : –Database designed to meet analytical tasks comprising of data.
STA302: Regression Analysis. Statistics Objective: To draw reasonable conclusions from noisy numerical data Entry point: Study relationships between variables.
INSTITUTO NACIONAL DE ESTATÍSTICA Census 2011 Mapping Portuguese Process United Nations EGM on Contemporary Practices in Census Mapping and Use of GIS.
3/13/2016Data Mining 1 Lecture 1-2 Data and Data Preparation Phayung Meesad, Ph.D. King Mongkut’s University of Technology North Bangkok (KMUTNB) Bangkok.
Chapter Seventeen Copyright © 2004 John Wiley & Sons, Inc. Multivariate Data Analysis.
ArcGIS Online Content & Sharing Deane Kensok. Session Agenda ArcGIS Online ContentArcGIS Online Content –Overview of Online Content –Demo of Online Content.
Data Resource Management – MGMT An overview of where we are right now SQL Developer OLAP CUBE 1 Sales Cube Data Warehouse Denormalized Historical.
CHAPTER 10 DATA EXPLORATION 10.1 Data Exploration Box 10.1 Data Visualization Descriptive Statistics Box 10.2 Descriptive Statistics Graphs.
Data Mining – Intro.
Data Transformation: Normalization
Data Mining: Concepts and Techniques
CH 5: Multivariate Methods
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
GTECH 709 GIS Data Formats GIS data formats
Supporting End-User Access
Multivariate Statistics
Data Transformations targeted at minimizing experimental variance
Analytics, BI & Data Integration
Working with Temporal Data
Presentation transcript:

2008

Data East company profile Company of about 85 employees Based in Akademgorodok (Novosibirsk, Russia), Founded from the “Novosibirsk Regional Center of Geoinformation Technologies of the Russian Academy of Sciences”

Own products and services Services: GIS software development service Data preparation service Products: Extensions for ArcGIS Drive Time Engine Personal Internet Map Server Map Engine Well Tracking

DoubleGis products’ line: –Desktop system –PocketPC application Map Engine

Atlas of Siberian Region - Navigation system for Siberian region - Data East products (CityExplorer, PersonalIMS, etc.) Personal IMS Map Engine

Data preparation service

Partners and customers worldwide ESRI, Inc. (USA) ESRI UK GlobeXplorer, Inc. (USA) NewFields, Inc. (USA) Exponent (USA) InstallShield, Inc. (USA) Schlumberger The Crown Estate (UK) ChevronTexaco (USA) Shell Group De Beers Group USGS (USA) U.S. Army Corps of Engineers (USA) Bowater (Canada) Rotorua District Council (New Zealand) Geoscience Australia (Australia) Bristol City Council (UK) Newcastle City Council (UK) Bureau of Land Management (USA) U.S. Fish and Wildlife Service (USA) Tauw bv (Netherlands) Washington State Department of Ecology (USA) and more…

Data Mining in Geoinformation Systems Data Mining Tasks: Prediction Classification Clustering Associations Discovery Sequence-based Analysis On-Line Analytical Processing (OLAP)

Target variable – sales Properties of stores: Size Number of employees Number of parking spaces Trade area attributes: Demographic variables like income, age, educational obtainment, ethnicity Intersections with competitors Forecast sales for new store location

Step 1: Preparation of datasets The set of objects must be homogeneous The same measurement for different objects should be measured in the same scale The set of measurements should be complete for every object Cannot use the target variable while calculation the values for source variables The number of objects should be reach enough Prediction Task: 7 Steps to Glory

Step 2: Calibration of variables Types of variables: Boolean variable (multi-valued logics is allowed) Nominal variable Ordered nominal variable Discrete variable Continuous variable Continuous variable with constraints Continuous variable of exp-type Prediction Task: 7 Steps to Glory

Step 3: Statistical Analysis Calculate the mean value, the standard deviation for every variable Calculate the correlation matrix Step 4: Normalization of source variables Step 5: Reduction of source variables Step 6: Thinning data and finding outliers Step 7: Constructing a predictor Calculate the predictor with minimal complexity Test the predictor on independent sample dataset Prediction Task: 7 Steps to Glory

Datasets for Analysis Fact table Categorization of columns to be mapped to dimensions of the cube On-Line Analytical Processing

Cube structure: Measures Dimensions categorized in hierarchies Attributes of members Query language: MDX JOLAP Specialized On-Line Analytical Processing

Select a spatial dimension Spatial OLAP for ArcGIS Desktop

Select a geoprocessor Spatial OLAP for ArcGIS Desktop

Specify a request to OLAP provider Spatial OLAP for ArcGIS Desktop

Select dimension members Spatial OLAP for ArcGIS Desktop

Select attributes of feature layer Spatial OLAP for ArcGIS Desktop

Splines for Data Mining under dot.net SDM Data: Core objects (vectors, vector collections) Matrices Solvers of SLAEs SDM Mining: Calibrators Core Data Mining (statistics, outlier analysis, Least Squares fitter) Transformations of variables Approximation (polynomial regression, radial basic functions) SDM Splines: Univariate polynomial splines (interpolation, smoothing, averaging) Multivariate analytic splines (interpolation, smoothing, regression, spline-collocation)

Splines for Data Mining under dot.net

At Data East we are always open for cooperation and new partnership! Address: Data East, LLC P.O. Box 664, Novosibirsk , Russia Phone: +7 (383) Fax: +7 (383) Contact information