15 Lecture 6: The Multiple Regression Model II

Slides

5 The Simple Regression Model (link)

15.1 Introduction

## 
## Attaching package: 'ggpubr'

## The following objects are masked from 'package:tidylog':
## 
##     group_by, mutate

We continue studying the simple regression model.

Figure 15.1: Slides for 7 The Multiple Regression Model.

15.2 Vignette 6.1

Once again, let’s simulate some data. Maybe we are interested in urban and rural towns (70% are urban) :

df <- tibble(urban = sample(c(0,1),500,replace=T,prob=c(.3,.7))) %>%
  ## Urban towns spend, on average, $3 million more on wages than rural towns
  mutate(expen_wages = 3*urban+runif(500,min=0,max=4)) %>%
  ## Urban towns are also have greater incomes (e.g., from taxes), but these are reduced by their high wage expenditures:
  mutate(log_income = 1 + 2*urban - .3*expen_wages + rnorm(500,mean=2)) ## <- Population Eq.

Now we can estimate the effect of wage expenditure on income:

model_a <- lm(log_income ~ expen_wages, data = df) 
summary(model_a)

## 
## Call:
## lm(formula = log_income ~ expen_wages, data = df)
## 
## Residuals:
##     Min      1Q  Median      3Q     Max 
## -3.7311 -0.8162 -0.0090  0.8356  3.7481 
## 
## Coefficients:
##             Estimate Std. Error t value Pr(>|t|)    
## (Intercept)  2.77873    0.12406  22.399  < 2e-16 ***
## expen_wages  0.07650    0.02735   2.797  0.00536 ** 
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## 
## Residual standard error: 1.161 on 498 degrees of freedom
## Multiple R-squared:  0.01547,    Adjusted R-squared:  0.01349 
## F-statistic: 7.823 on 1 and 498 DF,  p-value: 0.005358

Wait what? (Interpret a log ~ level)

15.3 Vignette 6.2

Let’s see… How can we remove everything from wages that is explained by urban? How can we remove everything from income that is explained by urban?

df %>% group_by(urban) %>%
  summarise(income_urb= mean(log_income))

## summarise: now 2 rows and 2 columns, ungrouped

## # A tibble: 2 × 2
##   urban income_urb
##   <dbl>      <dbl>
## 1     0       2.40
## 2     1       3.41