Let be a function where are the independent variables and are the p parameters lying in the domain A. If A is not the whole space , the problem is said to be constrained.
If a situation can be observed by a set of events , i.e. a set of couples representing the measured dependant and variables, it is possible to deduce the value of the parameters of your model corresponding to that situation. As the measurements are generally given with some error, it is impossible to get the exact value of the parameters but only an estimation of them. Estimating is in some sense finding the most likely value of the parameters. Much more events than parameters are in general necessary.
In a linear problem, if the errors on the observations have a gaussian distribution, the ``Maximum Likelihood Principle'' gives you the ``best estimate'' of the parameters as the solution of the so-called ``Least Squares Minimization'' that follows:
with
The expected variance of the so--computed estimator is minimum among all approximation methods and is therefore called in statistics an ``efficient estimator''.
The quantities
are named the residuals and the weight of the observation that can be, for instance, the inverse of the computed variance of the observation.
If depends linearly on each parameter , the problem is also known as a Linear Regression and is solved in MIDAS by the command REGRESSION. This chapter deals with which have a non-linear dependance in a.
Let us now introduce some mathematical notations. Let and be respectively the gradient and the Hessian matrix of the function . They can be expressed by
where is the residuals vector
the Jacobian matrix of i.e.
and is
with , the Hessian matrix of .
In the rest of the chapter, all the functions are supposed to be differentiable if they are applied the derivation operator even when this condition is not necessary for the convergence of the algorithm.
A certain number of numerical methods have been developed to solve the
non--linear least squares problem, four have so far been implemented in
MIDAS. A complete description of these algorithms can be found in [1]
and [3], the present document will only give a basic introduction.