I am trying to do a polyfit on a set of data using numpy package.
The following is the code, it can run successfully. The fitted line seems to fit the data when the o
TL;DR: In this case the warning means: use a lower order!
To quote the documentation:
Note that fitting polynomial coefficients is inherently badly conditioned when the degree of the polynomial is large or the interval of sample points is badly centered. The quality of the fit should always be checked in these cases. When polynomial fits are not satisfactory, splines may be a good alternative.
In other words, the warning tells you to double-check the results. If they seem fine don't worry. But are they fine? To answer that you should evaluate the resulting fit not only on the data points used for fitting (these often match rather well, especially when overfitting). Consider this:
xp = np.linspace(-1, 1, 10000) * 2 * np.pi
for n in range(3):
for k in range(3):
order = 20*n+10*k+1
print(order)
z = np.polyfit(x,y,order)
p = np.poly1d(z)
ax[n,k].scatter(x,y,label = "Real data",s=1)
ax[n,k].plot(xp,p(xp),label = "Polynomial with order={}".format(order), color='C1')
ax[n,k].legend()
Here we evaluate the polyfit on points spaced much more finely than the sample data. This is the result:
You can see that for orders 40 and obove the results really shoot off. This coincides with the warnings I get.