Interpolation And Curve Fitting

steve_bank · Jul 12, 2021

I was going to write code for Lagrange interpolaton but found this onthe net. For some reason he made the data structure integer insteadof double like the other variables and it caused compiler warnings.Subtracting double from int, so I chaged z,y to double.

Before PCs I hadthis coded in a progamable calculator for inerpolating tables andmeasurement data.

https://www.geeksforgeeks.org/lagranges-interpolation/
http://www-classes.usc.edu/engr/ce/108/lagrange.pdf

#include <iostream>
#include "steve.h"

// C++ program forimplementation of Lagrange's Interpolation
//#include<bits/stdc++.h>
using namespace std;

// To represent adata point corresponding to x and y = f(x)
struct Data
{
double x, y;
};

// function tointerpolate the given data points using Lagrange's formula
// xi corresponds tothe new data point whose value is to be obtained
// n represents thenumber of known data points
doubleinterpolate(Data f[], double xi, int n)
{
double result =0; // Initialize result

for (int i = 0;i < n; i++)
{
// Computeindividual terms of above formula
double term= f.y;
for (int j =0; j < n; j++)
{
if (j !=i)
term= term * (xi - f[j].x) / (f.x - f[j].x);
}

// Addcurrent term to result
result +=term;
}

return result;
}

// driver functionto check the program
int main()
{
// creating anarray of 4 known data points
Data f[] = {{0,2}, {1,3}, {2,12}, {5,147} };

// Using theinterpolate function to obtain a data point
// correspondingto x=3
cout <<"Value of f(3) is : " << interpolate(f, 3, 5);

return 0;
}

lpetrich · Jul 12, 2021

That should be easy to code. For {x} to {y}, Lagrange's formula is

\( y = \sum_i \frac{\prod_{j \ne i} (x - x_j)}{\prod_{j \ne i} (x_i - x_j)} y_i \)

For less calculation and better cancellation, using Newton's divided-difference formula:

First calculate a tableau from the interpolation points:
\( y_{1,2} = \frac{y_2 - y_1}{x_2 - x_1} ,\ y_{1,2,3} = \frac{y_{2,3} - y_{1,2}}{x_3 - x_1} ,\ y_{1,2,3,4} = \frac{y_{2,3,4} - y_{1,2,3}}{x_4 - x_1} ,\ \text{etc.} \)

Then do
\( y = y_1 + y_{1,2} (x - x_1) + y_{1,2,3} (x - x_1) (x - x_2) + \dots \)

One can also do backward differences, using the largest-index end of the tableau instead of the smallest-index one.

Swammerdami · Jul 25, 2021

Just for fun, I'll write up interpolation methods used in image (or signal) processing during affine transforms (scaling, rotation, etc.). An application like Photoshop offers a pull-down menu so you can choose an interpolation method when you do an affine transform.

Interpolation can be viewed as deriving a continuous signal g(x), x is real, from a discrete signal f(j), j an integer. In practice the desired result of interpolation will be another discrete signal h(k), k an integer where h(k) = g(x).

Interpolation methods commonly used for images are, in increasing order of computational cost:
* Nearest neighbor
* Linear
* Cubic
* Cubic spline

The first two methods are too trivial to discuss. "Cubic interpolation" on images is often called "Bicubic interpolation" (and Linear called Bilinear) because the interpolation method is operated TWICE, once along an X axis, once along a Y axis. But in this post I'll just consider a single dimension.

[ETA: The above description is wrong. You do FOUR 4-point interpolations along an X-axis to get four interpolated points for input to the final (fifth) interpolation along the Y axis.]

It is easy to conflate "cubic" and "cubic spline" interpolation methods, especially since BOTH methods involve splines of cubic polynomials. But the two methods are very different.

In the cubic spline method, the SIGNAL is interpolated to be a spline of cubics; in the cubic method it is an INTERPOLATION FUNCTION which is set to a spline of cubics. I never pursued Cubic Spline Interpolation: Its computational cost seemed far too huge for processing large images quickly and, as discussed below, the higher-speed Cubic interpolation gives excellent results.

All methods except cubic spline can be described via an interpolation function, w(i). A SINGLE interpolation function is set in advance and used for all interpolations. (Unlike cubic spline interpolation where the proper cubic is computed for each pair of adjacent samples.) The derived continuous function g(x) is simply the convolution of f() and w(), i.e. for 0 < y < 1,
. . . . . g(j+y) = w(-1-y)*f(j-1) + w(-y)*f(j) + w(1-y)*f(j+1) + w(2-y)*f(j+2)
This equation shows the convolution input as having 4 points {f(j-1), f(j), f(j+1), f(j+2)} — these are the four points in the original discrete signal which are closest to f(j+y).

In nearest neighbor, linear, and cubic interpolations, the convolution input has respectively 1, 2, and 4 points.

If the signal is band-limited and sampled above the Nyquist rate, then it is a theorem of Fourier theory that the ideal interpolation function will be the

Sinc function.

Whew! After all this prelude, we see that some solutions to the interpolation problem can be viewed as simply finding an approximation to the Sinc function, and using that approximation as our interpolation function! We can't use the sinc function itself: it's infinite, with small ripples extending forever. (Conceptually, our interpolation function will be infinite, but its value will be zero everywhere outside the central kernel area.)

Images are NOT band-limited — they contain sharp edges — but it turns out the cubic interpolation method we outline works rather well on many real images.

The Interpolation function — remember that this is ideally an approximation to sinc() — is implicitly a square wave for nearest neighbor interpolation, and a triangle wave for linear interpolation. For the remainder of this post we consider ONLY cubic interpolation, where the interpolation function has width four and looks like a finite approximation to sinc(). Because the very same interpolation function is always used it can be pre-computed, at some granularity, and the convolution filter values come from table look-ups.

So: How is that function derived? I think it's rather cute.

The filter is a spline of four cubic polynomials, but it is symmetric so only two cubics are derived. A cubic ax^3 + bx^2 + cx + d has 4 degrees of freedom; two cubics mean 8 degrees of freedoms, but seven of those freedoms are made to disappear via constraints:
* We want the function to be continuous — this constraint kills 2 freedoms
* We choose the function to be differentiable everywhere! — this constraint kills 3 freedoms
* We want g(x) = f(j), i.e. the interpolation doesn't change known samples — this constraint kills 2 freedoms.

This leaves 1 degree of freedom I call "sharpness": it effects the magnitude of the side-lobes. With the usual parametric formula, the parameter 0.5 is usually used, but in the experiment described below I found that the "sharper" 0.6 worked somewhat better.

This concludes my explication of cubic interpolation. Raise your hand if this makes any sense at all!

~ ~ ~ ~ ~ ~

Now let's talk about rotating an image. (Or more generally, any affine transform with a rotational component.)

The obvious way to rotate is simply to map the target pixel to a location among the input pixels, extract the 4x4 pixel array near that location, and apply the 4-point interpolation twice, once on an X axis, once on a Y axis. Straightforward ... but there is a problem. Or rather, there was a problem in the olden days when memory was very limited.

With limited memory, we'd like to do the image processing just a few lines at a time, but when rotating by, say, 45 degrees, the input and output images do not line up. We'd have to either hopscotch through the input or hopscotch through the output.

Paeth's clever approach to the rescue! The idea is to replace Rotation with a 3-step process: Shear, Transpose by 90 degrees, Shear again. (Never mind that the Transpose has a similar problem to the hopscotching just mentioned!) This is called Paeth's Two-Shear algorithm.

Much superior, depending on the rotation angle, may be Paeth's Three-Shear algorithm, which has 5 steps: Shear, Transpose by 90 degrees, Shear, Transpose by 90 degrees, Shear. No, I'm too lazy to go into all the details!

Anyway, I did some experiments. I started with a particular image, rotated it a few dozens of times by some arbitrary angles which summed to a multiple of 360 degrees, and then compared the result with the original image. (I did this three times, using Paeth 2-Shear, Paeth 3-shear and the pristine Rotate.) Paeth 2-shear didn't do well, but with the other techniques the resultant image was very close to the original, despite that it had gone through dozens of interpolations. (It did get a bit blurry with the standard cubic parameter of 0.5, but not with 0.6.) Fun!

I'm pretty sure nobody here cares about, or even reads, these posts of mine; but it's fun for me to recollect and write about some of the things I worked with almost three decades ago!

lpetrich · Jul 25, 2021

It's three shears, and shears in alternating directions.

{{1, s}, {0, 1}} . {{1, 0}, {t, 1}} . {{1, s}, {0, 1}}

where
s = - tan(a/2)
t = sin(a)
for angle a.

Swammerdami · Jul 27, 2021

Swammerdami said:
[I've prefixed numbers to three sentences, for reference]

(1) Paeth's clever approach to the rescue! The idea is to replace Rotation with a 3-step process: Shear, ~~Transpose~~ Rotate by 90 degrees, Shear again.... This is called Paeth's Two-Shear algorithm.

(2) Much superior, depending on the rotation angle, may be Paeth's Three-Shear algorithm, which has 5 steps: Shear, Rotate by 90 degrees, Shear, Rotate by 90 degrees, Shear.

(3) I'm pretty sure nobody here cares about, or even reads, these posts of mine; but it's fun for me to recollect and write about some of the things I worked with almost three decades ago!

Evidence for the hypothesis in (3) might be partially confirmed by the following response to (1) and (2) !

lpetrich said:
It's three shears ...

That two shears (plus the 90-degree rotate) can implement rotate is shown by this matrix equation
. . . . . [1/c s/c; 0 1] [0 -1; 1 0] [c s; 0 1] = [s (s^2-1)/c; c s]
That (s^2-1)/c = -c is a well-known identity. (And yes, quibblers will note an irrelevant sign reversal. I'm using the actual matrices implied in source code mentioned below.)

If two shears can do a rotate, why do Paeth et al recommend three? I won't embark on a discussion, but I think you'll find there is a numeric issue (the interpolations become sub-optimal) vaguely related to

Gimbal lock !

An extra degree of freedom is available when using three_shear instead of two_shear. Optimizing that parameter is left as an exercise!

Just now I browsed some old source folders and found two_shear.c and three_shear.c, both timestamped Jul 20 1997. That was several laptops ago, but they've kept their timestamps during several tar cycles!

Interpolation And Curve Fitting

steve_bank

Diabetic retinopathy and poor eyesight. Typos ...

lpetrich

Contributor

Swammerdami

Squadron Leader

lpetrich

Contributor

Swammerdami

Squadron Leader