Presentation is loading. Please wait.

Presentation is loading. Please wait.

Direct or Remotely sensed

Similar presentations


Presentation on theme: "Direct or Remotely sensed"— Presentation transcript:

1 Direct or Remotely sensed
May be the same data Covariates Direct or Remotely sensed Predictors Remotely sensed Field Data Response, coordinates Sample Data Response, covariates Qualify, Prep Qualify, Prep Qualify, Prep Random split? Randomness Inputs Test Data Training Data Outputs Temp Data Processes Build Model Repeated Over and Over Randomize parameters The Model Statistics Validate Predict Randomness Predicted Values Uncertainty Maps Summarize Predictive Map

2 Direct or Remotely sensed
May be the same data Covariates Direct or Remotely sensed Predictors Remotely sensed Jackknife Field Data Response, coordinates Sample Data Response, covariates Qualify, Prep Qualify, Prep Qualify, Prep Random split? Randomness Cross-Validation Noise Injection Inputs Test Data Training Data Outputs Temp Data Processes Build Model Repeated Over and Over Randomize parameters Monte-Carlo The Model Sensitivity Testing Randomness Statistics Validate Predict Noise Injection Predicted Values Uncertainty Maps Summarize Predictive Map

3 Cross-Validation Split the data into training (build model) and test (validate) data sets Leave-p-out cross-validation Validate on p samples, train on remainder Repeated for all combinations of p Non-exhaustive cross-validation Leave-p-out cross-validation but only on a subset of possible combinations Randomly splitting into 30% test and 70% training is common

4 K-fold Cross Validation
Break the data into K sections Test on 𝐾 𝑖 , Training remainder Repeat for all 𝐾 𝑖 10-fold is common Test 1 2 3 4 Used in rpart() 5 Training 6 7 8 9 10

5 Bootstrapping Drawing N samples from the sample data (with replacement) Building the model Repeating the process over and over

6 Random Forest N samples drawn from the data with replacement
Repeated to create many trees A “random forest” “Splits” are selected based on the most common splits in all the trees Bootstrap aggregation or “Bagging”

7 Boosting Can a set of weak learners create a single strong learner? (Wikipedia) Lots of “simple” trees used to create a really complex tree "convex potential boosters cannot withstand random classification noise,“ 2008 Phillip Long (at Google) and Rocco A. Servedio (Columbia University)

8 Boosted Regression Trees
BRTs combine thousands of trees to reduce deviance from the data Currently popular More on this later

9 Sensitivity Testing Injecting small amounts of “noise” into our data to see the effect on the model parameters. Plant The same approach can be used to model the impact of uncertainty on our model outputs and to make uncertainty maps Note: I call this noise injection to separate it from sensitivity testing for model parameters

10 Jackknifing Trying all combinations of covariates
Given covariates 1,2,3 Try: 1 2 3 1,2 1,3 2,3 1,2, 3

11 Building Models Selecting the method
Selecting the covariates/predictors (“Model Selection”) Optimizing the coefficients/parameters of the model 9bytez.com:Old School Hobbies: Building Models by Hand

12 Direct or Remotely sensed
May be the same data Covariates Direct or Remotely sensed Predictors Remotely sensed Jackknife Field Data Response, coordinates Sample Data Response, covariates Qualify, Prep Qualify, Prep Qualify, Prep Random split? Randomness Cross-Validation Noise Injection Inputs Test Data Training Data Outputs Temp Data Processes Build Model Repeated Over and Over Randomize parameters Monte-Carlo The Model Sensitivity Testing Randomness Statistics Validate Predict Noise Injection Predicted Values Uncertainty Maps Summarize Predictive Map


Download ppt "Direct or Remotely sensed"

Similar presentations


Ads by Google