Will Ware's blog: Linear least square error estimation

Monday, July 02, 2007

Linear least square error estimation

Let z be some unknown function of x and y. Assume the function is close to linear, so we want a function

    f(x,y) = a x + b y + c

that approximates z by minimizing the total square error for a collection of N data points (x_i, y_i, z_i). We will need to accumulate the following sums. This can be done incrementally in real time, if the data arrive that way. We can use exponentially weighted sums to give more importance to more recent data points, if that makes sense.

S_x = ∑ x_i
S_y = ∑_i y_i
S_z = ∑_i z_i
S_xx = ∑_i x_i²
S_yy = ∑_i y_i²
S_xy = ∑_i x_iy_i
S_yz = ∑_i y_iz_i
N = ∑_i 1    (or with exponential weighting if desired)

Then we have a total square error:

    E = ∑_i (z_i - ax_i - by_i -c)²

and we want to minimize that error by choosing (a,b,c) at a minimum:

    ∂E/∂a = ∂E/∂b = ∂E/∂c = 0

0 = S_xz - a S_xx - b S_xy - c S_x
0 = S_yz - a S_xy - b S_yy - c S_y
0 = S_z - a S_x - b S_y - cN

Then we can obtain (a,b,c) from linear algebra.

[ a ]    [ S_xx S_xy S_x ]-1  [ S_xz ]
[ b ] =  [ S_xy S_yy S_y ]    [ S_yz ]
[ c ]    [ S_x  S_xy N  ]    [ S_z ]

Based on all this, we can write a linear least-squares estimator class in Python.

Monday, July 02, 2007

Linear least square error estimation

No comments: