16-889 Assignment 4: Neural Surfaces

Author: Zhe Huang (zhehuang@andrew.cmu.edu)

1. Sphere Tracing

Implementation description: the idea is that we iterate through z_near to z_far for every ray. We find the point where the SDF is smaller than some certain threshould $\epsilon$ and mask that ray off from the searching scheme. We repeat this process until we reach to z_far.

2. Optimizing a Neural SDF

Network Structure: harmonic functions + 6 layer of FCs + 1 output layer.

Hyperparameters: I use the default setting provided in the config, which is

  n_harmonic_functions_xyz: 4

  n_layers_distance: 6
  n_hidden_neurons_distance: 128
  append_distance: []

Eikonal loss: $$ \mathcal{L} = \frac{1}{N}\sum_{i}^{N}\|\texttt{grad}_i - 1\|^2 $$

3. VolSDF

Network Structure: the same as in 2 with the last FC layer adapted for color output.

Network hyperparameters: to learn a better model I pump up the model capacity a little bit.

  n_harmonic_functions_xyz: 6

  n_layers_distance: 9
  n_hidden_neurons_distance: 128
  append_distance: []

What the parameters $\alpha$ and $\beta$ are doing?

$\alpha$: basically the learning rate for the density.

$\beta$: basically controls the "sharpness" of the sdf_to_density function.

How does high $\beta$ bias your learned SDF? What about low $\beta$?

High $\beta$ will decrease the "sharpness" of the mesh at different locations, meaning our result will likely have lower resolution due to the insensitivity. On the other hand, low $\beta$ will make the learned SDF more unstable and may result in noncontinuity and artifacts.

Would an SDF be easier to train with volume rendering and low $\beta$ or high $\beta$? Why?

High $\beta$ will make it easier to train as the VolSDF loss will be smaller (i.e. less enforced).

Would you be more likely to learn an accurate surface with high $\beta$ or low $\beta$? Why?

Low $\beta$ will give us more chance to learn an accurate surface as the VolSDF loss is assigned a larger weight during the training.

Visualization: the following visualization results from default $\alpha$ and $\beta$ vaules.

4.2 Fewer Training Views