[HELP] Linear regression with errors #103

mpound · 2021-03-04T20:14:57Z

Hi Samuel. I have run through the notebook modopt_example.ipynb, and inverse_problems_1.ipynb and sparsity_1.ipynb in the CosmoStat/Tutorials ada branch, playing around with parameters to better my understanding. I am wondering how one takes the next step of fitting the curve if there are known random errors in y (e.g. weighted fit). Is it a term added to the cost function? Is there a modopt class to handle this?
thanks,
Marc

mpound · 2021-03-05T20:20:46Z

ok, H changes to (X^T W X)^-1 X^T W, where W are the weights.
I'm not yet reproducing the correct answer with W=1 but I think the above is correct.

sfarrens · 2021-03-08T09:16:43Z

Hi @mpound, apologies for the delay in addressing this issue. I was not available last week.

I have not really thought about a weighted fit example. I mainly use these basic regression problems as very simple way to illustrate how convex optimisation works. I assume that given the points in x, y and the weights the problem can still be solved analytically, in which case it would be a simple case of defining the gradient appropriately in ModOpt. I will try to find some time to play with this.

mpound · 2021-03-08T13:57:53Z

Hi Samuel. Yes, the simple regression problem could be solved in other ways. I am trying to teach myself using the simple understandable case before moving on to more complex ones. I'm not typically used to thinking of things in linear algebraic terms so I have to familiarize myself with that. Ultimately, my goal is to fit a combination of linear functions at all pixels in a spatial map and add regularization to ensure a spatially smooth solution.

The science case is fitting an H_2 excitation diagram with two linear functions of temperature at all pixels given spatial maps of line intensity. The fit can be sensitive to measurement errors (ill-conditioned) so I'm thinking the modopt package can help.
Example:

sfarrens · 2021-03-08T14:58:57Z

@mpound That makes sense. I will try to find some time to add a simple example using weighted data points so that you can have a better idea of how to manipulate the appropriate ModOpt objects for your application. This could be helpful for other users also.

mpound · 2021-03-09T14:30:30Z

@sfarrens Here is a sample method for applying weights to a fit. I tested it on the linear and polynomial fit examples in inverse_problems_1.ipynb.

    # This function implements the normal equation given a weight array W
    # to weight the fit of the y points. e.g. W = np.diag(1/sigma_y ** 2).
    # W must be a diagonal array.
    def normal_eq_wt(H,W,y):
        q = H.T @ W @ H
        qinv = np.linalg.pinv(q)
        return qinv @ H.T @ W @ y

mpound added the help wanted label Mar 4, 2021

sfarrens self-assigned this Mar 8, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[HELP] Linear regression with errors #103

[HELP] Linear regression with errors #103

mpound commented Mar 4, 2021

mpound commented Mar 5, 2021

sfarrens commented Mar 8, 2021

mpound commented Mar 8, 2021

sfarrens commented Mar 8, 2021

mpound commented Mar 9, 2021

[HELP] Linear regression with errors #103

[HELP] Linear regression with errors #103

Comments

mpound commented Mar 4, 2021

mpound commented Mar 5, 2021

sfarrens commented Mar 8, 2021

mpound commented Mar 8, 2021

sfarrens commented Mar 8, 2021

mpound commented Mar 9, 2021