3D-2D registration

In this step we perform the model-to-image(s) registration.

Problem Statement

Given a 3D model of the statue find rotation, translation and camera projection parameters that map 3D points to 2D image coordinates. Assume a pinhole camera model.

Approach Taken

We used the original raw range data to perform model-to-texture registration. The input to our program that computed the transform was the point cloud produced by the laser scanner in the cyclone "ptx" format, which is an ascii file that contains 3D coordinates of all points in the scanner coordinate frame. This was necessary because the original scans contained the features around the statue which were used for the registration (The statue itself contained very few easily identifyable features).

A user manually selects a set of corresponding points from the point cloud and the 2D image which are used to compute a projection matrix that transforms world coordinates to image coordinates. The method for computing is described in [Hartley and Zisserman]. Let $(\mathbf{X}_i, \mathbf{x}_i)$ be a pair of 3D and 2D homogeneous point correspondences, with $\mathbf{X}_i$ and $\mathbf{x}_i$ of the form and respectively. Each pair provides the following two equations,

$\begin{displaymath} \left[ \begin{array}{ccc} \mathbf{0}^T & -w_i{\mathbf{... ...athbf{P}^2 \\ \mathbf{P}^3 \end{array} \right)= \mathbf{0}, \end{displaymath}$

where each is a row of . By stacking up the equations derived from a set of pairs, a matrix is obtained . The solution vector of the set of equations contains the entries of the matrix . At least 6 point correspondences are required to obtain a unique solution. In practice, an overdetermined system is used, which we solve using the SVD decomposition of matrix . Prior to solving the system of equations, both 3D and 2D points are normalized to improve data accuracy [Hartley and Zisserman]. This normalization consists of a translation and scaling step. Both 2D and 3D points are translated so that their centroid is at the origin. Both 2D and 3D points are then scaled so that their RMS (root-mean-squared) distance to the center is $\sqrt{2}$ and $\sqrt{3}$ respectively. Once the projection matrix is obtained an occlusion function $V(P, T_i) \to {0,1}$ where each is a mesh triangle is computed. The function evalutes to when is visible from the camera described by and otherwise. At this point the mesh is textured by calculating the texture coordinats of every visible triangle.

Procedure

The program displays the range data and the image side-by-side using the OpenGL renderer.

The user selects the points in the image and the corresponding points in the model using the mouse:

After the minimum of 6 constraints were added the program produces the transform.

Then user may render the mesh with the texture applied to evaluate the transform:

If the mapping is not satifactory, user may choose to start the process over or add more constrains to refine the transform. Once the mapping is satisfactory it is only left to compute the transform between camera position and the model coordinate frame. We achive this by composing the transform that represents the camera position in relation to a given point cloud with the transform that maps that point cloud to the model coordinate frame. (which we obtained in the global registration phase). More on this in part 4 of this report.

Additional Considerations

We used Kodak DC280 digital camera which is a consumer digital camera with practicaly no manual settings, therefore we had to post process the digital images to extract the maximum color information from the surface of the statue.

g
	Figure 3-3: Captured images were manually adjusted with an image processing program to extract more color information. (left: original image, center: level adjustment dialog, right: image with levels adjusted.

There are few things to be done here to improve the results. 1. It would be better to use camera with manual settings for aperture and exposure. 2. before appying more than one texture on a model one should normalize all textures with respect to one another.

mesh1_img002.jpg	mesh1_img008.jpg
mesh1_img015.jpg	mesh1_img040.jpg
mesh2_img005.jpg	mesh2_img039.jpg
mesh2_img11.jpg	mesh3_img12.jpg
mesh3_img042.jpg	mesh4_img003.jpg
mesh4_img009.jpg	msh4_img041.jpg
mesh5_img006ls.jpg	mesh5_img38.jpg
mesh7_img007.jpg	mesh7_img16.jpg

The Forgers

CS-6998 - 3D Photography - Fall 02