[16-889] Learning for 3D Vision, Spring 2022

Assignment 2

1. Practicing with Cameras

1.1. Fitting a voxel grid (5 points)

vox

 

1.2. Fitting a point cloud (10 points)

point

 

1.3. Fitting a mesh (5 points)

mesh

 

 

2. Reconstructing 3D from single view

2.1. Image to voxel grid (15 points)

Example 1

vox_1500iter_0

vox_1500iter_0

 

Example 2

vox_1500iter_20

vox_1500iter_20

 

Example 3

vox_1500iter_40

vox_1500iter_40

 

2.2. Image to point cloud (15 points)

Example 1

point_500iter_0

point_500iter_0

 

Example 2

point_500iter_20

point_500iter_20

 

Example 3

point_500iter_40

point_500iter_40

 

2.3. Image to mesh (15 points)

Example 1

mesh_4000iter_0

 

mesh_4000iter_0

 

Example 2

mesh_4000iter_20

mesh_4000iter_20

 

Example 3

mesh_4000iter_40

mesh_4000iter_40

 

2.4. Quantitative comparisions(10 points)

Voxel: Avg F1@0.05: 3.213

Mesh: Avg F1@0.05: 89.949

Pointcloud: Avg F1@0.05: 89.928

2.5. Analyse effects of hyperparms variations (10 points)

Changing w_chamfer

With increasing chamfer loss, the shape of the object seems to be more well defined while the one with the lowest chamfer loss weight is very noisy (corresponding connectivity of vertices are also not smooth).

mesh_4000iter_0

w_chamfer = 1

mesh_4000iter_wchamfer1

 

w_chamfer = 10

mesh_2000iter_0

w_chamfer = 100

mesh_2000iter_0_wchamfer100

 

2.6. Interpret your model (15 points)

Computing the normals of point sets is a good way to visualize the kind of surfaces the network tries to learn when predicting pointclouds. The normals were estimated based on a 10-neighborhood radius and plotted.

 

point_500iter_0

normals_pcl1

 

point_500iter_10

normals_pcl2

 

point_500iter_20

normals_pcl3