Skip to content

Latest commit

 

History

History
191 lines (94 loc) · 7.26 KB

comp_photo2.md

File metadata and controls

191 lines (94 loc) · 7.26 KB

Some more Computational Photography: Merging and Blending Images using Gaussian and Laplacian Pyramids

The following problem appeared as an assignment in the coursera course Computational Photography (by Georgia Tech, 2013). The following description of the problem is taken directly from the assignment's description.

Introduction

In this article, we shall be putting together a pyramid blending pipeline that will allow us to blend any two images with a mask. The blending procedure takes the two images and the mask, and splits them into their red, green and blue channels. It then blends each channel separately.

Then a laplacian pyramid will be constructed for the two images. It will then scale the mask image to the range [0,1] and construct a gaussian pyramid for it. Finally, it will blend the two pyramids and collapse them down to the output image.

Pixel values of 255 in the mask image are scaled to the value 1 in the mask pyramid, and assign the strongest weight to the image labeled ‘white’ during blending. Pixel values of 0 in the mask image are scaled to the value 0 in the mask pyramid, and assign the strongest weight to the image labeled ‘black’ during blending.

The following figures describe the pyramid blending process with couple of sample images.

png

png

Reduce

This function takes an image and subsamples it down to a quarter of the size (dividing the height and width by two). Before we subsample the image, however, we need to first smooth out the image.

The following 5x5 generating kernel will be used for convolution, for the reduce and later for the expand function.

png

Expand

This function takes an image and supersamples it to four times the size (multiplying the height and width by two). After increasing the size, we have to interpolate the missing values by running over it with a smoothing filter.

Gaussian Pyramid

This function takes an image and builds a pyramid out of it. The first layer of this pyramid is the original image, and each subsequent layer of the pyramid is the reduced form of the previous layer. Within the code, these pyramids are represented as lists of arrays, so pyramid = [layer0, layer1, layer2, ...]. The reduce function implemented in the previous part needs to be used in order to implement this function.

Laplacian Pyramid

This function takes a gaussian pyramid constructed by the previous function, and turns it into a laplacian pyramid. Like with gaussian pyramids, laplacian pyramids are represented as lists and the expand function implemented in the previous part needs to be used in order to implement this function.

Blend

In this part, the pipeline will be set up by implementing the actual blend function, and then implementing a collapse function that will allow us to disassemble our laplacian pyramid into an output image.

Blend function takes three pyramids:

  • white - a laplacian pyramid of an image
  • black - a laplacian pyramid of another image
  • mask - a gaussian pyramid of a mask image

It should perform an alphablend of the two laplacian pyramids according to the mask pyramid. So, we need to blend each pair of layers together using the mask of that layer as the weight. Pixels where the mask is equal to 1 should be taken from the white image, pixels where the mask is 0 should be taken from the black image. Pixels with value 0.5 in the mask should be an equal blend of the white and black images, etc.

Collapse

This function is given a laplacian pyramid, and is expected to 'flatten' it to an image. We need to take the top layer, expand it, and then add it to the next layer. This results in a pyramid of one level less than we started with. We continue this process until we are left with only a single layer.

Results (some images are taken from the same Computational Photography Course)

The following figures show a few pairs of input images, the masks, the blended output images, along with the gassian and laplacian pyramids.

1. Input Images

png

Gaussian Pyramids

jpeg

Laplacian Pyramids

jpeg

Blended Laplacian Pyramids

jpeg

Blended Output Image

png

2. Input Images

png

Gaussian Pyramids

jpeg

Laplacian Pyramids

jpeg

Blended Laplacian Pyramids

png

Blended Output Image

png

3. Input Images

png

Gaussian Pyramids

png

Laplacian Pyramids

png

Blended Laplacian Pyramids

png

Blended Output Image

png

4. Input Images

png

Gaussian Pyramids

png

Laplacian Pyramids

png

Blended Laplacian Pyramids

png

Blended Output Image

png

The next example shows how the pyramid pipeline can be used to edit an image background.

5. Input Images

png

Gaussian Pyramids

png

Laplacian Pyramids

png

Blended Laplacian Pyramids

png

Blended Output Image

png

As can be seen from the output image above, the pyramid blending pipleine did a good job in blending the two images.

6. Input Images

png

Gaussian Pyramids

png

Laplacian Pyramids

png

Blended Laplacian Pyramids

png

Blended Output Image

png