16-889 Assignment 3

Jianchun Chen jianchuc@andrw.cmu.edu


1. Differentiable Volume Rendering

1.3. Ray sampling (10 points)

xygridrayimg

1.4. Point sampling (10 points)

sampledpts

1.5. Volume rendering (30 points)

Spiral Rendering of Part 1 Spiral Rendering of Part 1

2. Optimizing a basic implicit volume

2.2 Loss and training (5 points)

Box center: 0.25, 0.25, 0.00
Box side lengths: 2.01, 1.50, 1.50

2.3. Visualization

Spiral Rendering of Part 2

3. Optimizing a Neural Radiance Field (NeRF) (30 points)

SimpleNerf

4. NeRF Extras (Choose at least one! More than one is extra credit)

4.1 View Dependence (10 pts)

NeRF w/o ray direction embedding, w/ 2,4,8,32 different frequencies of ray direction embeddings are shown below from left to right.
SimpleNerf2direncNerf4direncNerf8direncNerf32direncNerf

Idealy, more directional embedding helps predict the ray dependent color prediction, such as the specular light. However, with too many dimensions of embedding, the network would overfit the input images and fail to generalize in novel view. However, in practice I don't see a major difference with 0,2,4,8,32 dimension of ray embedding.

4.3 High Resolution Imagery (10 pts)

I render the high-res lego below. The first gif uses default parameter, the second has 4 more layer of MLPs to encode xyz position, and the third has 80 v.s. 128 number of sampled points per ray.

highresNerf
10layerhighresNerf
512ptshighresNerf

Overall, adding layers improves the expression ability of the model, but a neural net that is too deep breaks the smoothness of the implicit function. The second image looks more unsmooth. Training with more sampled points also helps, but it requires significant larger GPU memory in the inference time (e.g. 256 points for 400*400 image). And sampling 80 points per ray in image 3 gives a fair rendering result, while sampling 64 points the training will fail.