yenchich
===================================
The visualization of grid and rays:
The visualization of the sample points of the first camera:
The visualization of the spiral rendering:
And the visualization of the depth:
Box center: (0.25, 0.25, 0.00)
Box side lengths: (2.00, 1.50, 1.50)
The visualization of the optimized volume is shown below. Left: TA-image. Right: output from my implementation.
The visualization of the trained NeRF:
NeRF with view dependence:
Trade-offs between increased view dependence and generalization quality: adding view dependece makes the model easier to memorize the pixel values and the corresponding rays. So the model might perform poorly on novel views. For instance, for a ray that directly hitting the front of the lego, model can directly copying the pixels when rendering a novel view from the back of the lego. This is not ideal as the rendering images of front and the back of the lego differ drastically.
NeRF with hierarchical sampling:
Trade-offs between speed and quality: hierarchical sampling increases the quality since we sample more relevant points along each ray when optimizing the volume. However, this requires one more forward pass of the coarse networks and additional sampling during the inverse transform sampling step. Therefore, each forward pass is 2x slower for the exchange of better quality.
NeRF with higher resolution:
I increase the number of the harmonic function for embedding of the viewing directions. I also decrease the chunk_size
to fit the model into the GPU. Other parameters such as hidden_size
of the MLP, dimension of the positional embedding, ..., etc., remains the same.