It uses gradients as the features rather than the raw color data. It constructs an image pyramid, halving the image size until it's less than 400 pixels wide. The low res version is aligned first, searching within 1/8 image size offset. At each subsequent level in the pyramid, it searches within a 2x2 window adjustment from the previous more crude alignment.
It takes about 12 seconds per image.
 
 
 
 
 
 
 
 
 
