Submitted for 16-889 course project by Aarush Gupta (aarushg3), Ayush Jain (ayushj2) and Mayank Singh (mayanks2).
The leftmost gifs are from the pixelNeRF baseline. The center gifs are from the pixelNeRF model trained with augmented training dataset and RGB loss. The rightmost gifs are from the pixelNeRF model trained with augmented training dataset and RGB + Perceptual loss.
Results on Lego scene:
DietNeRF baseline
Depth consistent DietNeRF
Results on Drums scene:
DietNeRF baseline
Depth consistent DietNeRF
When the DietNeRF model is trained on 8 views, we qualitatively observe that the depth consistency loss helps in learning better depth maps.
SimpleNeRF (baseline)
DistillNeRF (RGB MSE)
DistillNeRF (RGB + Depth MSE)
DistillNeRF (Raw MSE)
DistillNeRF (RGB + Depth + Huber loss)
We notice that DistillNerf with RGB + Depth MSE gives slightly better results in few frames. Please check Fig. 4 in report for more discussion on this.