Transformation decomposition - 5.2.1 3D CAD program

5.2.1 3D CAD program

5.3 Transformation decomposition

5.3.1 Problem formulation

The coordinates of the features of the recognized objects are known in two coordinate systems, see Figure 5.1:

• Common projective frame of the scene, in this coordinate system every feature is described with its projective coordinates. This system is common for every object in the scene though contains no Euclidean information.

This information is the output of the projective reconstruction.

• Euclidean frame of an object (attached to the object) as stored in the object database. This information is object dependent and contains metrical data. Every object has its ows Euclidean frame.

Using this twofold description of the recognized features makes it possible to determine the relative Euclidean trans-formation (position, orientation) between object frames as occurs in the scene. If one of the frames is absolute (known in world reference frame), then it is possible to describe the scene in the absolute Euclidean coordinate frame.

The candidate collineations between the common projective space of the scene and the local frame of the recog-nized object are already determined as the output of verification step of the object recognition, see Section 4.4. From the members of the given cluster in consolidation (see Section 4.5) the collineation is updated from the candidates, therefore this is the most accurate estimation that is available. But there is no internal constraint that could be applied to the elements of the collineation during projective reconstruction. This means that the elements depend only on the data from which they are estimated, there is no inter-dependency between elements.

Y Z

Y X

Y X Z

H

D

Frame A

Frame B

Common projective frame

Figure 5.1: Reference frames used in the displacement calculation

However the calculation of the Euclidean transformation between objects allows introducing additional con-straints. Let the collineations of two recognized objects be HA and HB, respectively. Let us suppose, that the collineations describe the mapping from scene frame into Euclidean object frames. In this case the displacement that describes the mapping from metric frame of the object A into metric frame of object B, can be calculated as D=H⁻_B¹H_A. But matrix

D4×4=s





 Ω t z^T 1







describes a metric (Euclidean) transformation, therefore there are some constraints that must be fulfilled:

• TheΩshould be a rotation matrix

– Orthogonality condition:ΩΩ^T =Ω^TΩ=I – Non-reflection condition:|Ω|= +1

• The||z^T||²=0

• The value of s must not be too small or too large (valid scaling)

Due to noise and other disturbances the matrixΩis not a rotation matrix. The aim is to determine the “closest”

rotation matrix R to matrixΩminimizing||R−Ω||²F such that R^TR−I =0 and|R|= +1, where||||F denotes the vector compatible Frobenius norm.

5.3.2 Solution techniques

The solution can be found factorizing the matrixΩ. The possible (most commonly used) matrix factorization algo-rithms that yield orthogonal matrix are the following [93]:

• Singular Value Dedomposition (SVD) givesΩ=UΛV^T where U and V are orthogonal matrices. The draw-backs of the algorithm are:

– Small perturbation could yield very different orthogonal factorization (though the singular values remain stable).

– Theoretically there are infinite many ways as a rotation matrix can be composed from two other rotations.

• QR decomposition givesΩ=QR, where Q is an orthogonal and R is a lower-triangular matrix, respectively.

The drawback is that the given orthogonal matrix is basis-dependent.

• Polar decomposition givesΩ =RS, where R is an orthogonal and S is a symmetric positive definite matrix (will be discussed later in details). If|R|=−1 (reflection included), then the decomposition can be written into the following form:Ω=(−R)(−I)S

Using the polar decomposition let the original displacement matrix be factorized into the following form:

D4×4=s





 I 0 z^T 1







| {z }





 I t 0^T 1







| {z }







R 0

0^T 1







| {z }







V 0

0^T 1







| {z }

∆





 S 0 0^T 1







| {z }

where s is a scale, the matrices are responsible for perspectivity (P), translation (T), rotation (Θ), mirroring (∆) and stretch (transformed shear,Σ), respectively.

Using the above decomposition the constrains can be expressed as a limit on phisically meaningful quantities.

• The perspectivity must be an identity matrix, this means that||z^T||²=0.

• The mirroring must be an identity matrix: V=I.

• The stretch must be an identity matrix: S=I or in a more general case a diagonal matrix, yielding isotropically or anisotropically scaled Euclidean transformation.

• The value of s should not be too small or too large (valid scaling).

The translational part t is unconstrained.

The output of the algorithm is the Euclidean (metric) transformation between the two reference frames of the objects A and B that describes the relative position and orientation between two recognized objects. This information can directly be used in a robot control system.

5.3.3 Polar decomposition

The polar decomposition problem can be formulated as given the matrix Q∈R^N^×^N, find R∈R^N^×^Nsuch that

minR ||R−Q||²F such that R^TR−I=0 (5.1) where||||^Fdenotes a vector compatible Frobenius norm, A=[aik]→ kAk²F =P

ka²_ik=trace(A^TA).

With the notations R=(^r1r₂r₃) the constraint is equivalent to r^T₁r1 =1, r^T₁r2=0, r^T₁r3 =0, r^T₂r2=1, r^T₂r3=0, r^T₃r3 = 1, therefore six Lagrange multiplicatorsλ₁₁,λ₁₂,λ₁₃,λ₂₂,λ₂₃,λ₃₃ are needed which can be colledted in a symmetric matrixΛ.

The polar decomposition method factors the matrix Q as Q = RS, where R is an orthogonal matrix and S is symmetric positive definite matrix.

Simple computation shows that

(R^TR−I)Λ=







r^T₁r1−1 r^T₁r2 r^T₁r3

r^T₁r2 r^T₂r2−1 r^T₂r3

r^T₁r₃ r^T₂r₃ r^T₃r₃−1













λ₁₁ λ₁₂ λ₁₃ λ12 λ22 λ23

λ₁₃ λ₂₃ λ₃₃













(r^T₁r1−1)λ11+r^T₁r2λ12+r^T₁r3λ13 ⋆ ⋆

⋆ r^T₁r2λ12+(r^T₂r2−1)λ22+r^T₂r3λ23 ⋆

⋆ ⋆ r^T₁r₃λ₁₃+r^T₂r₃λ₂₃+(r^T₃r₃−1)λ₃₃







traceh

(R^TR−I)Λi

= (r^T₁r₁−1)λ₁₁+2λ₁₂r^T₁r₂+2λ₁₃r^T₁r₃+ λ22(r^T₂r2−1)+2λ23r^T₂r3+

λ33(r^T₃r3−1)

Since r^T_irj=0 ⇔ 2r^T_irj=0,∀i, j, hence the constraints can be taken into consideration by using the Langrange multiplicator rule as

traceh

(R^TR−I)Λi andΛis symmetric.

Writing the||R−Q||²Fas trace((R−Q)^T(R−Q)) and applying the Lagrange multiplier rule yields

L=trace((R−Q)^T(R−Q)+(R^TR−I)Λ) (5.2) whereΛis the Lagrange multiplier. Making the derivative operator dL() equal to the zero operator O() yields

O() = dL() = traceh

()^TR+R^T()−Q^T()−()^TQ+()^TRΛ+R^T()Λi

= traceh

2(R^T−Q^T+ΛR^T)()i

= trace [2A()] (5.3)

Since

dL()=O() ⇔ ∀X : trace(AX)=0 (5.4)

and

trace(AX)=X

(AX)(i,i)=X

e^T_i(AX)ei=X

e^T_i X

A(µ, ν)eµe^T_νXei=X

A(µ, ν)e^T_νXeµ

hence choosing X=eie^T_j it follows X

A(µ, ν)e^T_νeie^T_jeν=A( j,i)=0,∀i,j ⇔ A^T=0 and thus

R−Q+RΛ=0 Rearranging the equation

R (I+Λ)

| {z }

symmetric

=RS=Q

In order to express the elements of S with the elements of Q (eliminate the unknowns inΛ), the following form is used:

R^TR=I and R=QS⁻¹ (5.5)

A symmetric matrix has symmetric inverse, therefore

(QS⁻¹)^T(QS⁻¹)=I S⁻¹Q^TQS⁻¹=I This gives

S²=Q^TQ

which is always symmetric and positive definite (note: Q is nonsingular). Using the SVD of S²and applying the fact that S²is symmetrical yields

Q^TQ≕S^{2 S V D}−−−→S²=U_S2Σ_S2U^T_S2 and Σ_S2 =< σ1 σ2 σ3> with σ1≥σ2≥σ3≥0 (5.6) Since S is symmetrical and positive definite hence S is the positive definite square root of S²:

S=U_S2Σ

1 2

S²U^T_S₂ and Σ

1 2

S² =<+√σ1 +√σ2 +√σ3> (5.7) On the other hand the second derivative of the objective function||R−Q||²F is twice the unit matrix, 2I > 0 (posi-tive definite Hessian matrix), hence the sufficient conditon of the global optimum is also satisfied with the above S.

Therefore the global optimal solution is

R=QS⁻¹=QU_S2Σ⁻

1 2

S²U^T_S2 (5.8)

Note: if the SVD of the matrix is already given, then the polar decomposition of Q can be calculated as

Q^TQ−−−→^{S V D} UΣU^T →S=UΣ¹2U^T →R=QS⁻¹=QUΣ⁻¹2U^T (5.9)

Q−−−→^{S V D} UΣV^T →Q=(UV^T)(VΣV^T)=RS (5.10)

5.3.4 Alternative nonlinear method for transformation update

The original implementation of the system uses a nonlinear method to determine the best estimation of the Euclidean transformation that fulfills the constraints (in case of points only). Using the situation depicted in Figure 5.1 the Euclidean coordinates of a point of object B (with reference frame B) in the frame of object A can be expressed in two ways. The first one is the direct application of the collineation H_Athat maps from projective to Euclidean frame. The second one is to transform the coordinates into the frame of object B using HBthen apply D. Using the notations of the Section 4.4 the relation can be written into equation form

H_AX_jDH_BX_j (5.11)

This equation can be rewritten into the form VA = DVB. Rescaling all of the Vi in order to represent Euclidean coordinates the displacement can be calculated in closed form using quaternions, see [61], [60].

In order to put all the results together, a refinement step is also developed. The relation (5.11) can be written as equality

j=1

λH_AX_j−DH_BX_j=0

The unknowns are the elements of D, HA, HBandλ. Constraints must be introduced for D to hold the desired form.

Using the properties of the rotation matrix contained in D, the constraints are the following:

X3 k=1

D( j,k)D(l,k) = 0 j,l=1, . . . ,3 j,l orthogonality of the rows X3

k=1

D(k,j)D(k,l) = 0 j,l=1, . . . ,3 j,l orthogonality of the columns X3

k=1

D²(4,k) = 0 first three elements of last row are zero D(4,4)−1 = 0 scaling is one

These systems of equations can be minimized with Levenberg-Marquardt method. The initial values of the unknown collineations are the output of the verification (Section 4.4) and the calculation is based on quaternions for displace-ment.

K_Cam

K_Block K_m

K_E

K₀

K_Obj

K_B K_Conv

K_B K₀ K_m K_E K_Obj

K_Conv K_Block

K_Cam DBlock,Obj

Figure 5.2: Transformation graph of the robot

In document Budapest University of Technology and Economics Department of Control Engineering and Information Technology (Pldal 117-122)