Download presentation
Presentation is loading. Please wait.
Published byLuciano Stickels Modified over 10 years ago
1
Texas SDC/BIDC Conference for Data Users May 22, 2013 Austin, TX Techniques for Reallocating State Estimates of the Undocumented Immigrant Population to Small Area Geographies
2
Texas is one of the fastest growing states, with migration making up 45% of this growth. Issue of immigration, especially unauthorized or illegal migration, critical when planning and considering: – Concerns about border security – Concerns about economic impact on receiving communities – Concerns about resulting shifts in the social characteristics of communities With the exception of California, sub-state level estimates of the undocumented population are not available. Rationale
3
Conventionally, estimation of the undocumented population is produced using the residual method (Warren 2011; Passel 2010, 2011). – Estimates of legal foreign born residents are subtracted from estimates of the foreign born population. Most commonly used national and state estimates include Pew Hispanic Center, Dept. of Homeland Security, and R. Warren estimates. Background
5
Source: Pew Hispanic Center, 2011
6
Background Source: Warren, 2010
7
Residual method presents challenges when attempting to produce estimates at lower geographies due to data unavailability. Challenge
8
Hill & Johnson (2011) employ a methodology that combines census population data with new administrative data that allows for estimation of the total unauthorized population and its distribution at sub- state level geographies 80 percent of unauthorized immigrants report filing federal income taxes and about 75 percent report having payroll taxes withheld (Porter 2005; Hill et al. 2010) Estimates suggest over half of unauthorized immigrants already pay income and payroll taxes through withholding, filed tax returns, or both (Orrenius and Zavodny 2012) Literature Review
9
Since immigrants without work authorization do not have valid social security numbers, many instead use Internal Revenue Service (IRS) issued Individual Taxpayer Identification Numbers (ITIN) when filing tax returns. Hill et al. (2011) have shown a high correlation (0.96 < r < 0.98) between the ITIN filers and unauthorized immigrant estimates in the U.S. Literature Review
10
To reallocate Texas state estimates of the unauthorized to the county level using ITIN data To expand upon this new estimation method by employing spatial prediction techniques to refine the distribution of unauthorized immigrants across the state Objectives
11
R. Warrens 2008 state level estimates of the unauthorized, 2008 IRS Individual Taxpayer Identification Number (ITIN) administrative data, American Community Survey (ACS) 2008 estimates of relevant sociodemographic characteristics, and U.S. Bureau of Economic Analysis (BEA) local employment data for 2008 Data Sources
12
Not all unauthorized immigrants file tax returns and not all ITIN filers are unauthorized (Hill et al. 2011). Hill & Johnson use regression analysis and incorporate economic and sociodemographic characteristics related to the unauthorized immigrant status to predict a state level ratio of ITIN filers to unauthorized immigrants. Methodology
13
Model 1 borrows Hill & Johnson method to identify important parameters useful for modeling the ITIN to unauthorized state estimate ratio. Model 2 is a simple OLS regression using parameters identified in Model 1 to estimate the ITIN to unauthorized at the county level. Model 3 is a geographically weighted regression model that incorporates a county-specific ratio that estimates the distribution of ITIN filers as a percentage of unauthorized immigrants by county. The final step involves applying the respective predicted values from each of these models and scaling these to Warrens statewide estimate. Methodology
14
Run a weighted least squares regression, weighted by foreign-born residents, using a backward elimination stepwise method (ITIN/Warren Estimate) s = X s α + W s β + Z s γ + ε s This ratio is then used as a factor to allocate the unauthorized populations at the county level. 14
15
Parameters Model 1: Texas-specific statewide model % born in Central America-0.006 % not in labor force-0.013 % manufacturing employment 0.025 % new return0.046 Constant0.219 R-squared0.52 N51 Results
16
Model 1 borrows Hill & Johnson method to identify important parameters useful for modeling the ITIN to unauthorized state estimate ratio. Model 2 is a simple OLS regression using parameters identified in Model 1 to estimate the ITIN to unauthorized at the county level. Model 3 is a geographically weighted regression model that incorporates a county-specific ratio that estimates the distribution of ITIN filers as a percentage of unauthorized immigrants by county. The final step involves applying the respective predicted values from each of these models and scaling these to Warrens statewide estimate. Methodology
17
Model 1 borrows Hill & Johnson method to identify important parameters useful for modeling the ITIN to unauthorized state estimate ratio. Model 2 is a simple OLS regression using parameters identified in Model 1 to estimate the ITIN to unauthorized at the county level. Model 3 is a geographically weighted regression model that incorporates a county-specific ratio that estimates the distribution of ITIN filers as a percentage of unauthorized immigrants by county. The final step involves applying the respective predicted values from each of these models and scaling these to Warrens statewide estimate. Methodology
19
County –Specific Models OLS ModelGWR Model (mean) % born in Central America-0.122-0.190 % not in labor force-0.663-0.603 % manufacturing employment 1.1810.649 % new return7.2827.440 Constant0.0120.040 R-squared0.150.49 N254 AIC195.88112.11 Results
20
Model 1 borrows Hill & Johnson method to identify important parameters useful for modeling the ITIN to unauthorized state estimate ratio. Model 2 is a simple OLS regression using parameters identified in Model 1 to estimate the ITIN to unauthorized at the county level. Model 3 is a geographically weighted regression model that incorporates a county-specific ratio that estimates the distribution of ITIN filers as a percentage of unauthorized immigrants by county. The final step involves applying the respective predicted values from each of these models and scaling these to Warrens statewide estimate. Methodology
22
Estimates of the Unauthorized Immigrant Population, 2008
24
The GWR model was a better fit when compared to the OLS model. Higher unauthorized estimates were found in areas characterized by agriculture, urbanicity, high employment, fast Hispanic population growth, and substantial foreign born populations These areas include counties in the Dallas-Fort Worth- Arlington, Houston-Baytown-Sugarland, and Austin-Round Rock metropolitan areas, large border counties, and counties in parts of East Texas. When examined as a percentage of the county population, Panhandle counties and counties in the Dallas and border areas have higher percentages. Results
25
Estimate models specific to Texas Explore trends from available data Explore other spatial techniques Future Directions
26
Laura Hill & Hans Johnson @ Public Policy Institute of California & Robert Warren Acknowledgements
27
Contact Office: (512) 463-8390 or (210) 458-6530 E-mail: State.Demographer@osd.state.tx.us Website: http://osd.state.tx.us Office of the State Demographer 27
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.