cstef
/posts/lagrange
TOP

Lagrange Interpolation Primer

A quick and simple explaination on the concept of polynomial interpolation, here Lagrange's in particular.

29/11/2024 (modified 17/12/2024) ยท 4 min read ยท sapling ๐Ÿชด

You may remember, when in High School, being asked to find the equation of a line that goes through two points. The process was straightforward: you would calculate the slope aa and the intercept bb of the line f(x) = a x + bf(x) = a x + b.

This is a simple example of polynomial interpolation, where we are trying to find a polynomial that goes through a set of points. While the process is simple for a line, it becomes slightly more complex for higher degree polynomials.

Polynomials

One of the most, if not the most important property of polynomials, is that each one of them can uniquely be described by n+1n+1 points for a polynomial of degree nn, noted f(x) in PP_nf(x) in PP_n.

The most straightforward way of seeing this is by setting an equation for each point A_i (x_i, y_i)A_i (x_i, y_i) we have:

P_n (x) = u_0 + u_1 x + ... + u_n x^n = f(x) \

caP_n (x) = u_0 + u_1 x + ... + u_n x^n = f(x) \

ca

We can see that we have n+1n+1 equations for n+1n+1 factors of xx, noted u_i | 0 <= i <= nu_i | 0 <= i <= n, which could also be represented with the following Vandermonde matrix equation:

mat(
    1, x_0, x_0^2, dots.c, x_0^(n);
    1, x_mat(
    1, x_0, x_0^2, dots.c, x_0^(n);
    1, x_

Written as V u = yV u = y, we can solve this by inverting VV:

u = V^(-1) yu = V^(-1) y

This is doable by applying the Jordan-Gauss algorithm to the augmented matrix V | IV | I, where II is the identity matrix (diagonal filled with 11s), until we only have pivots equal to 11:

mat(
    1, x_0, x_0^2, dots.c, x_0^(n) space, spamat(
    1, x_0, x_0^2, dots.c, x_0^(n) space, spa

If you prefer to see this in a graphical way, letโ€™s consider the following example:

We have two points A_0 (1, 2)A_0 (1, 2) and A_1 (-2, -3)A_1 (-2, -3). We can represent them as follows:

#set text(size: 10pt)

#canvas({
    import draw: #set text(size: 10pt)

#canvas({
    import draw:

It is visually obvious that we can only draw a single line that goes through both points.

#set text(size: 10pt)

#canvas({
    import draw: #set text(size: 10pt)

#canvas({
    import draw:

A line is essentially just f(x) = a x + bf(x) = a x + b, which is of degree 11. We can guess and deduce we need at least n+1n+1 points to describe a polynomial function f(x) in PP_nf(x) in PP_n.

The same goes if we add a third point A_2 (0, 4)A_2 (0, 4):

#set text(size: 10pt)
#let f(x) = -(11/6) * calc.p#set text(size: 10pt)
#let f(x) = -(11/6) * calc.p

If we did not add this third point and still tried to find a polynomial P_2 (x)P_2 (x), we would have an infinite number of solutions:

#set text(size: 10pt)
#let f(x, b) = ((3 * b - 5) #set text(size: 10pt)
#let f(x, b) = ((3 * b - 5)

But now the question is: how do we actually find the polynomial that goes through all the points? This is where Lagrange interpolation comes into play.

Lagrange Interpolation

Letโ€™s take A_0 (0, 1)A_0 (0, 1), A_1 (1, 3)A_1 (1, 3), A_2 (2, 2)A_2 (2, 2) and A_3 (3, 4)A_3 (3, 4) to demonstrate this method.

#set text(size: 10pt)

#canvas({
    import draw: #set text(size: 10pt)

#canvas({
    import draw:

The main principle behind this is to split the wanted function into multiple sub-functions l_i (x)l_i (x), that each contribute to one given point, also called โ€œnodeโ€.

We want to construct l_i (x)l_i (x) so that:

l_i (x) = cases(
    space 1 "if" x = x_i,
    spal_i (x) = cases(
    space 1 "if" x = x_i,
    spa

Making a function equal to 00 at a certain point x_jx_j is as simple as multiplying it by (x - x_j)(x - x_j). Based on this property, we can already โ€œguessโ€ l_1^* (x)l_1^* (x):

l_1^*(x) = (x - 0)(x - 2)(x - 3)l_1^*(x) = (x - 0)(x - 2)(x - 3)

#set text(size: 10pt)

#canvas({
    import draw: #set text(size: 10pt)

#canvas({
    import draw:

Why does multiplying by (x-x_j)(x-x_j) add a root ?

For a function f(x) = (x - x_j) dot g(x) | g(x) <- RR[x]f(x) = (x - x_j) dot g(x) | g(x) <- RR[x], when x = x_jx = x_j:

f(x_j) = (x_j - x_j) dot g(x) = 0 dot g(x) = 0f(x_j) = (x_j - x_j) dot g(x) = 0 dot g(x) = 0

Thatโ€™s a good start! But we have a problem: l_1^* (x)l_1^* (x) is not equal to 11 at x = 1x = 1. We can fix this by dividing l_1^* (x)l_1^* (x) by l_1^* (1)l_1^* (1):

l_1 (x) &= (l_1^* (x)) / (l_1^* (1)) \
        &= l_1 (x) &= (l_1^* (x)) / (l_1^* (1)) \
        &=

And we effectively have l_1 (1) = 1l_1 (1) = 1. We can now repeat this process for l_0 (x)l_0 (x), l_2 (x)l_2 (x) and l_3 (x)l_3 (x):

#import "@preview/cetz:0.3.1": *
#import "@preview#import "@preview/cetz:0.3.1": *
#import "@preview

The polynomial f(x) in PP_3f(x) in PP_3 is computed by summing all the l_i (x)l_i (x) together and multiplying them by the corresponding y_iy_i:

f(x) &= y_0 l_0 (x) + y_1 l_1 (x) + y_2 l_2 (x) + f(x) &= y_0 l_0 (x) + y_1 l_1 (x) + y_2 l_2 (x) +

#import "@preview/cetz:0.3.1": *
#import "@preview#import "@preview/cetz:0.3.1": *
#import "@preview

And we have our final polynomial which effectively goes through all the points:

#set text(size: 10pt)

#canvas({
    import draw: #set text(size: 10pt)

#canvas({
    import draw:

Generalizing this process, we can write the function f(x) in PP_n (x)f(x) in PP_n (x) that goes through n+1n+1 points A_i (x_i, y_i)A_i (x_i, y_i):

f(x) &= sum_(i=0)^n y_i l_i (x) \
        &= sum_(f(x) &= sum_(i=0)^n y_i l_i (x) \
        &= sum_(