Skip to content

R script that analyzes the relationship between subjects who grew up in lower-income families on the basis of certain economic factors and proportion of individuals reporting incomes in the top quintile by age 30.

Notifications You must be signed in to change notification settings

adhaase/upward-mobility

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

The Potential of Upward Mobility in Lower (income) Class Adults

0. Contents

  • Two main R scripts: Region Analysis & Model Validation (test/training sets)
  • second data set based on: Chetty, Ehrlinger, Mitchum, & Dweck Study Created two R scripts that analyzed the relationship between subjects who grew up in lower-income families on the basis of location and proportion of individuals reporting incomes in the top quintile by age 30; the second script focuses on individual elements (Income, Education, etc.) that potentially contribute to economic mobility - models have been fit for a subset of these variables, training/test data sets used for analysis.

I. Content Overview

I.1 - Region Analysis

We are provided a dataset concerning 40 subjects who grew up in lower-income families. Specifically, this dataset presented variables labeled: zone, state, region, N, n, n.lowstart, and p.upmover. In each sample, n is the overall number of subjects from the indicated “zone” and n.lowstart is the number of these subjects whose family income was in the bottom quintile of U.S. household incomes. The proportion of these individuals reporting incomes in the top quintile as of 2010 is presented as "p.upmover". This fraction is also a reflection of economic mobility that will be heavily utilized in the analysis phase.

Interpretation of the upward mobility relationship will formulate from two main statistical outlets. I first studied upward mobility in each of the 40 commuting zones using the Wilson confidence interval for the 95% case. From there I then investigated how geography in terms of region (West, Midwest, Northeast, South) relates to upward mobility by using a Likelihood Ratio test over region and p.upmover. This R script essentially outlines the relationship between these variables to deduce the possibility of financial "upward mobility" in the life of a relatively lower income individual.

I.2 - Model Validation

Analyzed which characteristics of communities that contribute to economic upward mobility. Used data assembled by Chetty and coauthors (2014) to predict the likelihood of "upmoving" by commuting zone, using prior information about those commuting zones that's available in Chetty et al's publicly released data. (EMD study)

Statistical Methods used in analysis:

  • Created 95% confidence intervals utilizing the Wilson method
  • Conducted hypothesis testing via the Likelihood Ratio test (as the dataset provided full binomial data, something like ANOVA/Tukey, etc. would have been unnecessary)

New data set based on: Ehrlinger, Mitchum, & Dweck Study

  • Creation of test/training sets. Initial analysis performed on training set.
  • Fitted 6 models with LR-test and ANOVA.
  • Reinforced modeling with natural spline bases
  • Used test data set and Holm-Bonferroni method to confirm significance and model results

About

R script that analyzes the relationship between subjects who grew up in lower-income families on the basis of certain economic factors and proportion of individuals reporting incomes in the top quintile by age 30.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages