Assignment #2
3D Learning
Target
Result
Target
Result
Target
Result
Voxel F1 at 0.05 threshold: 34.3%
Point F1 at 0.05 threshold: 88.2%
Mesh F1 at 0.05 threshold: 81.5%
The pointcloud reconstruction has the highest F1 score, slightly above mesh reconstruction and both significantly more than the voxel reconstruction. This heirachy makes sense for models that output similar qualitativ results (granted, of different variety). This is because the chamfer loss used to guide pointcloud reconstruction almost exactly reduces the F1 score through it's objective. Thus, it is reasonable that the point cloud F1 score is the highest.
Mesh similarly uses the chamfer loss, although is further guided by a smoothing loss on the surface representation. This additional constraint on the loss explains the slightly deteriorated loss relative to point cloud reconstruction. Additionally, the meshes are reproduced through the deformation of a sphere. This the mesh is not able to accurately reconstruct shapes other than genus 0, which contributes to error in the reconstruction.
Finally, the voxel F1 score has a significantly lower F1 score. This can be ascribed to the relatively coarse nature of the voxel representation relative to the reconstruction domain. Specifically, voxels are predicted at 32x32x32 over a scene in the range [-1,1]^3. This means the edge of the voxels are 0.0625, and the corresponding pointclouds sampled from this voxel mesh are approximately as course (minus smoothing from mcube) . Since the F1 score is being calculated at a threshold distance of 0.05, it is reasonable that the voxel reconstruction performs poorly under this metric, since it is only capable of expressing shape at a coarser resolution.
Ground Truth vs Reconstruction over 19200 training samples