Download presentation
Presentation is loading. Please wait.
1
1 Finite Population Inference for Latent Values Measured with Error from a Bayesian Perspective Edward J. Stanek III Department of Public Health University of Massachusetts Amherst, MA
2
2 Collaborators Parimal Mukhopadhyay, Indian Statistics Institute, Kolkata, India Viviana Lencina, Facultad de Ciencias Economicas, Universidad Nacional de Tucumán, CONICET, Argentina Luz Mery Gonzalez, Departamentao de Estadística, Universidad Nacional de Colombia, Bogotá, Colombia Julio Singer, Departamento de Estatística, Universidade de São Paulo, Brazil Wenjun Li, Department of Behavioral Medicine, UMASS Medical School, Worcester, MA Rongheng Li, Shuli Yu, Guoshu Yuan, Ruitao Zhang, Faculty and Students in the Biostatistics Program, UMASS, Amherst
3
3 Outline Review of Finite Population Bayesian Models 1.Populations, Prior, and Posterior 2.Notation 3.Example 4.Exchangeable distribution Addition of Measurement Error 1.Latent values and Response 2.Heterogeneous variances 3.Prior distribution of response for a prior point (vector of latent values) 4.Prior and Data- matching the subjects in the data to random variables in the population 5.Subsets of prior points: i.for populations not including some subjects in the data ii.for populations including subjects in data, where the sample doesn’t include the subjects in the data iii.for populations and samples that include subjects in data. 6.Posterior points (corresponding to 5iii) 7.Marginal posterior points (over measurement error among remaining subjects)
4
4 Bayesian Model General Idea Populations # Posterior Populations: Data Prior Posterior # Prior Populations: Prior Probabilities Posterior Probabilities Review
5
5 Bayesian Model Population Notation Population Label Latent Value Labels Parameter Vector Data Vector Review
6
6 Exchangeable Prior Bayesian Model- Example: H=3, N=3, n=2 Populations Data Prior 1053 56 1221053 56 122 Posterior 1053 56 122 Review
7
7 Exchangeable Prior Bayesian Model- Example: H=3, N=3, n=2 Populations Data Prior Posterior 1053 56 122 1053 56 1221053 56 122 105 Suppose the Data is Prior Review
8
8 Exchangeable Prior Bayesian Model- Example: H=3, N=3, n=2 Populations Data Prior Posterior 1053 56 122 1053 56 1221053 56 122 105 Prior Review
9
9 Exchangeable Prior Bayesian Model- Example: H=3, N=3, n=2 Populations Data Prior Posterior 1053 5 56 1221053 56 Posterior Prior Review
10
10 Exchangeable Prior Populations General Idea When N=3 Each Permutation p of subjects in L (i.e. each different listing) Joint Probability Density Must be identical Exchangeable Random Variables The common distribution General Notation Assigns (usually) equal probability to each permutation of subjects in the population. Review
11
11 Exchangeable Prior Populations N=3 Potential Response for Each Listing of subjects Listings Latent Values for Listing Latent Values for permutations of listing Review
12
12 Exchangeable Prior Population Permutations Rose Daisy Lily Listing p=1 Review
13
13 Exchangeable Prior Populations N=3 Permutations Rose Daisy Lily Listing p=1 Review
14
14 Exchangeable Prior Populations N=3 Rose Daisy Lily Listing p=2 Review
15
15 Exchangeable Prior Populations N=3 Permutations of Listings Listing p=1 Listing Listing p=2 Listing p=3 Listing p=4 Listing p=5 Listing p=6 Review
16
16 Exchangeable Prior Populations N=3 Potential Response for Each Listing of subjects Listings Latent Value Vectors for permutations of listing Potential response for Random Variables For Listing p Circled points are equal and have equal probability, for different listings. Listing Review
17
17 Exchangeable Prior Populations N=3 Permutations of Listings Listing p=1 Same Point in Listing Listing p=2 Listing p=3 Listing p=4 Listing p=5 Listing p=6 Review
18
18 Exchangeable Prior Populations N=3 Potential Response for Each Listing of subjects Listings Latent Value Vectors for permutations of listing Potential response for Random Variables For Listing p Circled points are equal and have equal probability, And are the same point for different listings. Same Point in Listing Review
19
19 Bayesian Model Link between Prior and Data Populations Data Prior # Prior Populations: N=3 Suppose n=2 Realizations of are the Data Review
20
20 Bayesian Model Exchangeable Prior Populations N=3 10 5 5 2 2 Listing p=1 Sample Space n=2 Prior Review
21
21 Bayesian Model Exchangeable Prior Populations N=3: Sample Point n=2 10 5 5 2 2 Listing p=3 10 5 5 2 2 Listing p=4 10 5 5 2 2 Listing p=1 Listing p=2 10 5 5 2 2 5 5 2 2 Listing p=5 10 5 5 2 2 Listing p=6 Review
22
22 Exchangeable Prior Populations N=3 Sample Points Rose Daisy Lily Listing p=1 Review
23
23 Exchangeable Prior Populations N=3 Sample Points When 10 5 5 2 2 Listing p=1 Sample Space n=2 when Prior Listing p=1 Review
24
24 Exchangeable Prior Populations N=3: Sample Points n=2 10 5 5 2 2 Listing p=3 10 5 5 2 2 Listing p=4 10 5 5 2 2 Listing p=1 Listing p=2 10 5 5 2 2 5 5 2 2 Listing p=5 10 5 5 2 2 Listing p=6 Positive Prob. Review
25
25 Exchangeable Prior Populations N=3 Sample Points with Positive Probability n=2 10 5 5 2 2 Listing p=3 10 5 5 2 2 Listing p=4 10 5 5 2 2 Listing p=1 Listing p=2 10 5 5 2 2 5 5 2 2 Listing p=5 10 5 5 2 2 Listing p=6 Review
26
26 Exchangeable Prior Populations N=3 Posterior Random Variables Prior Data If permutations of subjects in listing p are equally likely: Random variables representing the data are independent of the remaining random variables. The Expected Value of random variables for the data is the mean for the data. Review where and
27
27 Data without Measurement Error Data (set) Vectors permutation matrix, k=1,…,n! and to be anLet Data (set of vectors) Latent Value For simplicity, denote by
28
28 Data with and without Measurement Error No Measurement Error Latent Values Data With Measurement Error Vectors Sets Data Realization at t Potential Response
29
29 Data with Measurement Error the realization of on occasion t The realization of Sets Data un-observed latent value Assume: Measurement errors are independent between any two subjects Measurement errors are independent when repeatedly measured on a subject
30
30 Measurement Error Model The Data Vectors Potential response Define Latent ValuesResponse Error Variance
31
31 Measurement Error Model Prior Random Variables Populations Prior # Prior Populations: Population h Labels: Parameter Prior Probabilities Assume Random Variables representing a population are exchangeable Defines the axes for points in the prior and the measurement variance indicates initial order vector Latent Values: Measurement Variance:
32
32 Exchangeable Prior Populations N=3 Rose Daisy Lily Single Point
33
33 Measurement Error Model Prior Random Variables Population h, Prior # Prior Populations: Vectors Assume Random Variables representing a population are exchangeable When p=1, define Sets Prior
34
34 Prior Random Variables and Data with Measurement Error If permutations of subjects in listing p are equally likely: Assume Random Variables representing a population are exchangeable in each population Since or initial listing p=1 Prior Data
35
35 Prior Random Variables and Data with Measurement Error Prior Random Variables that will correspond to Latent values for subjects In the data Remaining Prior Random Variables Prior Data
36
36 Prior Random Variables and Data with Measurement Error Prior Random Variables that will correspond to Latent values for subjects In the data Remaining Prior Random Variables Prior Data
37
37 Consider Possible Populations in the Posterior Set of Subjects in Population: Set of Subjects in the Data: Populations possible in the posterior: Only Populations that include the set of subjects in the data: Prior Data
38
38 Possible Points for Possible Populations in the Posterior Set of Subjects in Population: Set of Subjects in the Data: Not in Posterior Populations possible in the posterior: Only Populations that include the set of subjects in the data: Also: Only points where the set of subjects in the data match the subjects representing the point corresponding to the data in the prior For possible population, possible points in the posterior: Prior Data
39
39 Initial Order k=1 Labels Latent Values Data Potential Response Also: the set of subjects in the data must match the set of subjects representing the point corresponding to the data in the prior Prior Data Measurement Variances Possible Points for Possible Populations with Measurement Error Possible Populations in the Posterior
40
40 Possible Points for Possible Populations with Measurement Error Initial Order k=1 Labels Latent Values Data Potential Response Define Prior Data Initial Listing Possible Populations in the Posterior The set of subjects in the data must match the subjects representing the point corresponding to the data in the prior Measurement Variances
41
41 Possible Points for Possible Populations with Measurement Error Initial Order k=1 Labels Latent Values Data Potential Response Define Prior Data Possible Populations in the Posterior The set of subjects in the data must match the subjects representing the point corresponding to the data in the prior Measurement Variances
42
42 Possible Points for Possible Populations with Measurement Error Initial Order k=1 Labels Latent Values Possible Populations in the Posterior Data Potential Response Measurement Variances Prior Data The set of subjects in the data must match the subjects representing the point corresponding to the data in the prior
43
43 Possible Points for Possible Populations with Measurement Error Possible Populations in the Posterior Data Potential Response Prior Data The set of subjects in the data must match the subjects representing the point corresponding to the data in the prior Possible Points in the Population Points in Prior where Subjects match Data or Points in the Posterior Labels Latent Values Measurement Variances
44
44 Possible Points for Possible Populations with Measurement Error Data Potential Response Points in the Posterior Possible Populations in the Posterior Possible Points in the Population Points Not in the Posterior Labels Latent Values Measurement Variances
45
45 Possible Points for Possible Populations with Measurement Error Possible Populations in the Posterior Data Potential Response Possible Points in the Population Labels Latent Values Measurement Variances Points in the Posterior
46
46 Possible Points for Possible Populations with Measurement Error Points in the Posterior Populations in the Posterior Prior Data Posterior
47
47 Possible Points for Possible Populations with Measurement Error Points in the Posterior Populations in the Posterior Prior Data Posterior Random Variables for subject labels (assuming permutations are equally likely in the prior) Posterior
48
48 Possible Points for Possible Populations with Measurement Error Points in the Posterior Populations in the Posterior Prior Data Posterior Random Variables for latent values (assuming permutations are equally likely in the prior) Posterior
49
49 Possible Points for Possible Populations with Measurement Error Points in the Posterior Populations in the Posterior Prior Data Posterior Random Variables for Measurement Variance (assuming permutations are equally likely in the prior) Posterior
50
50 Posterior Random Variables with Measurement Error Subjects Latent values Response
51
51 Posterior Random Variables with Measurement Error Subjects Latent valuesResponse Observed (in the data) Define Response Marginal over Measurement Error for Subjects that are not in the data Since Condition on to obtain the posterior distribution
52
52 Posterior Random Variables with Measurement Error If permutations of subjects in listing p are equally likely: where Posterior
53
53 Posterior Random Variables with Measurement Error If permutations of subjects in listing p are equally likely: where Posterior
54
54 Posterior Random Variables with Measurement Error If permutations of subjects in listing p are equally likely: where Posterior
55
55 Posterior Random Variables with Measurement Error Finite Population Mixed Model for the subjects in the Data: where random subject effect Use this model to obtain the best linear unbiased predictor of the latent value for a subject in the data (which we call the BLUP for a realized subject) Posterior
56
56 Finite Population Mixed Model (FPMM) for Subjects in the Data based on the Posterior Random Variables What is the Latent value for a subject in the data? Data (set) Vectors Latent Value for subject s with label FPMM Data (set of vectors) Latent Values
57
57 Finite Population Mixed Model (FPMM) for Subjects in the Data based on the Posterior Random Variables What is the Latent value for a subject in the data? Data Latent Value for subject s with label FPMM Example (n=2) In the data, the permutation matrices are not random- they just differ. FPMM In the posterior distribution, the permutation matrices are random! … a consequence of using the prior
58
58 Posterior Random Variables with Measurement Error If permutations of subjects in listing p are equally likely: where Posterior Finite Population Mixed Model for the subjects in the Data: where random subject effect Use this model to obtain the best linear unbiased predictor of the latent value for a subject in the data (which we call the BLUP for a realized subject)
59
59 Posterior Random Variables with Measurement Error FPMM to Predict Latent Values Finite Population Mixed Model for the subjects in the Data: where random subject effect Use this model to obtain the best linear unbiased predictor of the latent value for a subject in the data (which we call the BLUP for a realized subject)
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.