Shape and Motion from Image Streams using Factorization Method

Erika Chuang and Ulises Robles-Mellin

Occlusion

We frequently encounter situations where features appear and disappear from the image sequence. This problem is known as occlusion. The factorization method described in the previous section cannot be applied, per se, to solve this problem. The image sequences yield a matrix W which has some unknowns values (i.e., it is partially filled). Fortunately, we can explore some information in the sequences that can allow us to find the unknown entries of W. This is done by doing projections of the feature coordinates onto camera positions.

Tomasi and Kanade [1], established a condition for reconstructing an unknown image point pair (u_fpv_fp ) in frame f as follows: we can reconstruct such point pair if for 3 given frames f₁ ,f₂ ,f₃ (not necessarily consecutive) point p is visible and if we can find at least 3 more points p₁ ,p₂ ,p₃ (again, not necessarily consecutive), which are visible in all the four frames (f, f₁ ,f₂ , f₃). We will illustrate the methodology as follows. Suppose W = [U;V] is a 10 x 4 matrix, with U_{5 x 4} and V_{5
x 4} . This matrix has 2 unknown values denoted by X.

         [u₁₁u₁₂   u₁₃ u₁₄]
         [u₂₁u₂₂   u₂₃ u₂₄]
         [u₃₁u₃₂   u₃₃ u₃₄]
         [u₄₁u₄₂   u₄₃ X ]
         [u₅₁u₅₂   u₅₃ u₅₄]
         [v₁₁v₅₂   v₁₃ v₁₄]
         [v₂₁v₂₂   v₂₃ v₂₄]
         [v₃₁v₃₂   v₃₃ v₃₄]
         [v₄₁v₄₂   v₄₃ X ]
         [v₅₁v₅₂   v₅₃ v₅₄]

The first step is to use the factorization method over W_{8 x 4} . This matrix is obtained by eliminating the rows in W whose values are unknown in U and V, i.e.,

         [u₁₁u₁₂   u₁₃ u₁₄]
         [u₂₁u₂₂   u₂₃ u₂₄]
         [u₃₁u₃₂   u₃₃ u₃₄]
         [u₅₁u₅₂   u₅₃ u₅₄]
         [v₁₁v₅₂   v₁₃ v₁₄]
         [v₂₁v₂₂   v₂₃ v₂₄]
         [v₃₁v₃₂   v₃₃ v₃₄]
         [v₅₁v₅₂   v₅₃ v₅₄]

After factoring W_{8 x 4}, we get:

t _{8 x 1} = [a₁ a₂ a₃ a₅ b₁ b₂ b₃ b₅]^t

R _{8x 3} = [i₁^Ti₂^Ti₃^Ti₅^Tj₁^Tj₂^Tj₃^Tj₅^T]^T

S = [s₁s₂s₃s₄]

, which respectively represent the translation, rotation and shape generated from the submatrix W_{8 x
4 .}

Therefore, by the factorization method, the above can be expressed as: W_{8 x 3}= R_{8
x 3}S + t_{8 x 1}e₄^T , where e₄= [1,1,1,1].

In order to have the full R (rotation matrix0, we need to compute i₄and j_4, which are unknown. We first need to make the origins of i₄ and j₄ coincide by referring to the centroid c = 1/3 (s₁ + s₂ + s₃) , where the indexes in s denote the points visible in all the four frames. In frame f , the projection of the centroid c (i.e., in vector t) has its coordinates:

a₄ = 1/3 (u₄₁ + u₄₂ + u₄₃) and b₄= 1/3 (v₄₁ + v₄₂ + v₄₃) .

As can be seen, with these coordinates, we get the full vector t.

Now, we define S_p' ,U_4p' and V_4p' for p=1,2,3 (in this example) by subtracting S, U_4pand V_4p (respectively) by their coordinates with respect of their centroid (i.e., c, a₄ and b₄, respectively). We find i₄and j₄by solving:

i₄^T[ s'₁ s'₂ s'₃ ] = [ u'₄₁ u'₄₂ u'₄₃ ]
j₄^T[ s'₁ s'₂ s'₃ ] = [ v'₄₁ v'₄₂ v'₄₃ ]

From the factorization method, we have that:

u₄₄ = i₄^Ts'₄+ a₄
v₄₄ = j₄^Ts'₄+ b₄

We now have all the missing information.

This method of reconstructing a point is called row wise extension, since we propagated S over the frames. There is another method called column wise extension, in which we propagate the feature points instead.

Next:Results and Discussion Previous: Perspective Approximation Contents:Shape and Motion from Image Streams

Erika Chuang and Ulises Robles-Mellin
Last modified: Tue. Mar 14, 2000