match_rel_pose_ransacT_match_rel_pose_ransacMatchRelPoseRansacMatchRelPoseRansac — Compute the relative orientation between two cameras by automatically
finding correspondences between image points.
Note, that the essential matrix is a projective entity and thus is
defined up to a scaling factor. From this follows that the
translation vector of the relative pose can only be determined up to
scale too. In fact, the computed translation vector will always be
normalized to unit length. As a consequence, a subsequent
threedimensional reconstruction of the scene, using for instance
vector_to_rel_posevector_to_rel_poseVectorToRelPoseVectorToRelPoseVectorToRelPose, can be carried out only
up to a single global scaling factor.
The matching process is based on characteristic points, which can be
extracted with point operators like points_foerstnerpoints_foerstnerPointsFoerstnerPointsFoerstnerPointsFoerstner or
points_harrispoints_harrisPointsHarrisPointsHarrisPointsHarris.
The matching itself is carried out in two steps: first, gray value
correlations of mask windows around the input points in the first
and the second image are determined and an initial matching between
them is generated using the similarity of the windows in both images.
Then, the RANSAC algorithm is applied to find the relative pose
that maximizes the number of correspondences under the epipolar constraint.
To increase the speed of the algorithm, the search area for the
matchings can be limited. Only points within a window of points are considered. The offset of the
center of the search window in the second image with respect to the
position of the current point in the first image is given by
RowMoveRowMoveRowMoveRowMoverowMove and ColMoveColMoveColMoveColMovecolMove.
If the second camera is
rotated around the optical axis with respect to the first camera
the parameter RotationRotationRotationRotationrotation may contain an estimate for the
rotation angle or an angle interval in radians. A good guess will
increase the quality of the gray value matching. If the actual
rotation differs too much from the specified estimate the matching
will typically fail. In this case, an angle interval should be
specified, and RotationRotationRotationRotationrotation is a tuple with two elements. The
larger the given interval the slower the operator is since the
RANSAC algorithm is run over all angle increments within the
interval.
The parameter EstimationMethodEstimationMethodEstimationMethodEstimationMethodestimationMethod decides whether the relative
orientation between the cameras is of a special type and which algorithm is
to be applied for its computation.
If EstimationMethodEstimationMethodEstimationMethodEstimationMethodestimationMethod is either 'normalized_dlt'"normalized_dlt""normalized_dlt""normalized_dlt""normalized_dlt" or
'gold_standard'"gold_standard""gold_standard""gold_standard""gold_standard" the relative orientation is arbitrary.
Choosing 'trans_normalized_dlt'"trans_normalized_dlt""trans_normalized_dlt""trans_normalized_dlt""trans_normalized_dlt" or 'trans_gold_standard'"trans_gold_standard""trans_gold_standard""trans_gold_standard""trans_gold_standard"
means that the relative motion between the cameras is a pure translation.
The typical application for this special motion case is the
scenario of a single fixed camera looking onto a moving conveyor belt.
In order to get a unique solution in the correspondence problem the minimum
required number of corresponding points is six in the general case and three
in the special, translational case.
The relative pose is computed by a linear algorithm if
'normalized_dlt'"normalized_dlt""normalized_dlt""normalized_dlt""normalized_dlt" or 'trans_normalized_dlt'"trans_normalized_dlt""trans_normalized_dlt""trans_normalized_dlt""trans_normalized_dlt" is chosen.
With 'gold_standard'"gold_standard""gold_standard""gold_standard""gold_standard" or 'trans_gold_standard'"trans_gold_standard""trans_gold_standard""trans_gold_standard""trans_gold_standard"
the algorithm gives a statistically optimal result, and returns as well the
covariance of the relative pose CovRelPoseCovRelPoseCovRelPoseCovRelPosecovRelPose.
Here, 'normalized_dlt' and 'gold_standard' stand for
direct-linear-transformation and gold-standard-algorithm respectively.
Note, that in general the found correspondences differ depending on the
deployed estimation method.
The value ErrorErrorErrorErrorerror indicates the overall quality of the estimation
procedure and is the mean euclidian distance in pixels between the
points and their corresponding epipolar lines.
Point pairs consistent with the mentioned constraints are considered to be
in correspondences. Points1Points1Points1Points1points1 contains the indices of the
matched input points from the first image and Points2Points2Points2Points2points2 contains
the indices of the corresponding points in the second image.
For the operator match_rel_pose_ransacmatch_rel_pose_ransacMatchRelPoseRansacMatchRelPoseRansacMatchRelPoseRansac a special
configuration of scene points and cameras exists: if all 3D points lie in a
single plane and additionally are all closer to one of the two cameras then
the solution in the essential matrix is not unique but twofold.
As a consequence both solutions are computed and returned by the operator.
This means that the output parameters RelPoseRelPoseRelPoseRelPoserelPose, CovRelPoseCovRelPoseCovRelPoseCovRelPosecovRelPose
and ErrorErrorErrorErrorerror are of double length and the values of the second
solution are simply concatenated behind the values of the first one.
List of values: 'gold_standard'"gold_standard""gold_standard""gold_standard""gold_standard", 'normalized_dlt'"normalized_dlt""normalized_dlt""normalized_dlt""normalized_dlt", 'trans_gold_standard'"trans_gold_standard""trans_gold_standard""trans_gold_standard""trans_gold_standard", 'trans_normalized_dlt'"trans_normalized_dlt""trans_normalized_dlt""trans_normalized_dlt""trans_normalized_dlt"
Richard Hartley, Andrew Zisserman: “Multiple View Geometry in
Computer Vision”; Cambridge University Press, Cambridge; 2003.
Olivier Faugeras, Quang-Tuan Luong: “The Geometry of Multiple
Images: The Laws That Govern the Formation of Multiple Images of a
Scene and Some of Their Applications”; MIT Press, Cambridge, MA;
2001.