Sum of Squares

Table of contents

What is Sum of Squares?
Total Sum of Squares
Residual Sum of Squares
Explained Sum of Squares
Relationship Between Sum of Squares
Ordinary Least Squares
1. Properties of OLS
  1. Consistent
  2. Asymptotically Normal

What is Sum of Squares?

Sum of squares is a concept frequently used in regression analysis.

Depending on what we choose to square, we end up with many different sums of squares.

Total Sum of Squares

The total sum of squares ( $S S_{t o t}$ ) is the sum of the squared differences between the observed dependent variable $y_{i} \in Y$ and its mean $\bar{y}$ :

$S S_{t o t} = \sum_{i = 1}^{n} (y_{i} - \bar{y})^{2}$

where $n$ is the number of observations.

Graphically, in a simple linear regression with one independent variable, $S S_{t o t}$ is the sum of the areas of the purple squares in the figure below.

Residual Sum of Squares

Also known as sum of squared errors (SSE).

The residual sum of squares ( $S S_{r e s}$ ) is the sum of the squared differences between the observed dependent variable $y_{i} \in Y$ and the predicted value ${\hat{y}}_{i}$ from the regression line:

$S S_{r e s} = \sum_{i = 1}^{n} (y_{i} - {\hat{y}}_{i})^{2}$

Graphically, in a simple linear regression with one independent variable, $S S_{r e s}$ is the sum of the areas of the red squares in the figure below.

Explained Sum of Squares

Also known as model sum of squares.

The explained sum of squares ( $S S_{e x p}$ ) is the sum of the squared differences between the predicted value ${\hat{y}}_{i}$ from the regression line and the mean $\bar{y}$ :

$S S_{e x p} = \sum_{i = 1}^{n} ({\hat{y}}_{i} - \bar{y})^{2}$

Graphically, in a simple linear regression with one independent variable, $S S_{e x p}$ is the sum of the areas of the blue squares in the figure below.

Relationship Between Sum of Squares

For linear regression models using Ordinary Least Squares (OLS) estimation, the following relationship holds:

$S S_{t o t} = S S_{e x p} + S S_{r e s}$

Ordinary Least Squares

Least squares is a common estimation method for linear regression models.

The idea is to fit a model that mimimizes some sum of squares (i.e. creates the least squares).

Ordinary least squares (OLS) minimizes the residual sum of squares.

$\hat{β} = \underset{β}{argmin} \sum_{i = 1}^{n} (y_{i} - {\hat{y}}_{i})^{2}$

Since we are trying to minimize the sum of squares with respect to the parameters, we solve for the partial derivatives. In simple linear regression, for example:

\begin{aligned} \frac{\partial}{\partial β_{0}} \sum_{i = 1}^{n} (y_{i} - β_{0} - β_{1} x_{i})^{2} = 0 \\ \frac{\partial}{\partial β_{1}} \sum_{i = 1}^{n} (y_{i} - β_{0} - β_{1} x_{i})^{2} = 0 \end{aligned}

Given that some conditions hold, there is a closed-form estimation for $β$ :

$\hat{β} = (X^{T} X)^{- 1} X^{T} y$

If $ε_{i} | X_{i} \sim N (0, σ^{2})$ , then OLS is the same as MLE.

Properties of OLS

Consistent

OLS estimators are consistent:

\hat{β} \overset{p}{\to} β

Asymptotically Normal

OLS estimators are asymptotically normal:

\frac{\hat{β} - β}{\hat{se} (\hat{β})} ⇝ N (0, 1)

Hence you could find normal confidence intervals for $β$ .