Partitioning the Sums of Squares




The sum of the squared deviations of Y from the mean of Y (YM) is called the sum of squares total and is referred to as SSY. SSY can be partitioned into the sum of squares predicted and the sum of squares error. This is analogous to the partitioning of the sums of squares in the analysis of variance.

The table below shows an example of this partitioning.

X
Y
Y-YM
(Y-YM
Y'
Y'-Y M
(Y'-YM
Y-Y'
(Y-Y')²
2
3
4
5
5
6
9
10
-2.5
-1.5
 1.5
 2.5
6.25
2.25
2.25
6.25
4.8
6.6
8.4
10.2
-2.7
-0.9
 0.9
 2.7
7.29
0.81
0.81
7.29
 0.2
-0.6
 0.6
-0.2
0.04
0.36
0.36
0.04
Sum:
30
0.0
17.00
30.0
0.0
16.20
0.0
0.8


The regression equation is:

Y' = 1.8X + 1.2

and YM = 7.5.

Defining SSY, SSY' and SSE as:

SSY = Σ(Y - YM
SSY' = Σ(Y' - YM)2
SSE = Σ(Y - Y')²

You can see from the table that SSY = SSY' + SSE which means that the sum of squares for Y is divided into the sum of squares explained (predicted) and the sum of squares error. The ratio of SSY'/SSY is the proportion explained and is equal to r². For this example, r = .976, r² = .976² = .95. SSY'/SSY = 16.2/17 = .95.