When using SVD decomposition to strengthen the constraints, the resulting Fundamental (or Essential) matrix fully meets the requirements to be considered Fundamental (or Essential). However, it is merely a matrix that is more similar under a specific norm (in this case, Frobenius) to the one obtained from the linear system.
Therefore, this solution is not optimal either, as it does not account for how the error propagates from the input points within the transformation: it is still fundamentally an algebraic solution rather than a geometric one.
A preliminary technique that minimizes geometric error involves leveraging the distance between points and the epipolar lines generated through the Fundamental matrix (epipolar distance).
Even intuitively, the distance between a point and the epipolar line
can be used as a metric to estimate the geometric error:
Since it is possible to compute this error for both the first and the second image, it is appropriate to minimize both contributions together. Through this metric, it is possible to define a cost function that minimizes the error symmetrically (symmetric transfer error) between the two images:
| (9.63) |
Alternatively to the Symmetric Transfer Error, the first-order approximation of the distance between the points and the function is often used in the literature (Sampson error, section 3.3.7).
It is possible to define an approximate distance between the homologous image points
and the variety
through the metric
| (9.64) |
| (9.65) |
Both the Symmetric Transfer Error and the Sampson distance, although superior metrics compared to the algebraic estimate, do not yield the optimal estimator. The Maximum Likelihood Estimation (MLE) for the Fundamental matrix would be obtained by using a cost function of the form
To solve this problem, it is necessary to combine the issue of calculating the Essential or Fundamental matrix with that of three-dimensional reconstruction, and to set the three-dimensional coordinate of the observed point
as the auxiliary variable, rather than the projections.
The Essential matrix can be obtained given the knowledge of the intrinsic parameters of the two sensors. In this case, it is indeed possible to exploit the nonlinear system that projects the auxiliary variable
onto the respective observations from the two sensors:
When intrinsic parameters are not available, in the case of estimating the Fundamental matrix, it is not possible to perform a true three-dimensional reconstruction of the scene due to the lack of these parameters. However, it is possible to exploit fictitious perspective projections by setting
and obtaining constraints of the form:
By incorporating the constraints (9.69) into the equation (9.66), the objective of deriving the Fundamental matrix is once again transformed into that of extracting the parameters of the projective matrix .
Through the camera matrix
, a fictitious camera matrix, it is finally possible to derive
by directly applying the definition (9.41), where, however, the matrix
is not a rotation matrix.
The maximum likelihood estimation of the fundamental matrix, corrected from a probabilistic standpoint, nonetheless requires a substantial amount of resources: in addition to the 12 global unknowns necessary to estimate (compared to the 5 of the essential matrix), for each pair of points to be minimized, 3 additional unknowns are incorporated into the problem.
Finally, as a final warning, for the optimal estimation of matrices in the presence of potential outliers in the scene, techniques such as RANSAC are widely employed (see section 3.12).
Paolo medici