Implementation of Denoising Algorithms on CIFAR-10 Dataset

Applied Various Unsupervised Machine Learning algorithms on ciar-10 data to denoise the images.
Focused on CNN based approaches which do unsupervised pre-training and can learn good representations via reconstruction
To prevent the auto-encoders from merely copying inputs during training, denoising auto-encoders were proposed to learn representations from corrupted data

Framework Used :

Algorithms Used :

There are broadly two types of deep learning algorithms which may be used for denoising :

Discriminative Learning-Based using CNNs
Generative Learning-Based using GANs

For the denoising problem of known noise like Gaussian noise, using CNNs based approaches it is possible to form paired training data and leverage these methods to achieve state-of-the-art performance. They could fully exploit the great capability of the network architecture to learn from data, which breaks through the limitations of prior based methods and further improves the performance whereas GANs are used where there are more complex real noises and dataset is small.[1]

Since, in our experiment we used simple Gaussian noise and CIFAR-10 dataset has considerable amount of data(60000 examples), I preferred to use Discriminative Learning-Based such as:

Simple Denoising Autoencoders (DAE)Reference Paper [2]
Convolutional Auto-encoders with Symmetric Skip ConnectionsReference Paper [3]
Feed-forward denoising convolutional neural networks (DnCNNs)Reference Paper [4]

Dataset and Noise

The dataset used comprises of 60000 color pictures in 10 classes with 6000 picture per class. Dimension of each image is 32 x 32.

I have introduced External Noise i.e. Pixel-level Gaussian noise to all the input images which would be fed into our models.

We add Gaussian noise matrix on both training and testing with noise factor 0.1 and clip images between 0 and 1.

noisy_imgs = images + noise_factor * torch.randn(*images.shape)
noisy_imgs = np.clip(noisy_imgs, 0., 1.)

Simple Denoising Autoencoder (DAE)

MODEL ARCHITECTURE

Encoder : 3x3 convolutional layers and downsamping done using 2x2 maxpooling layers
Decoder : Upsampling done and layers are symmetric to the encoder.

Each convolutional/deconvolutional layer is followed by a ReLU non-linearity layer

ConvDenoiser(
  (conv1): Conv2d(3, 32, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
  (conv2): Conv2d(32, 32, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
  (pool): MaxPool2d(kernel_size=2, stride=2, padding=0, dilation=1, ceil_mode=False)
  (t_conv1): ConvTranspose2d(32, 32, kernel_size=(2, 2), stride=(2, 2))
  (t_conv2): ConvTranspose2d(32, 32, kernel_size=(2, 2), stride=(2, 2))
  (convout): Conv2d(32, 3, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
)

Hyperparameters : Batch Size = 20 . no. of epochs = 40 . Learning rate =0.001 .

RESULTS

EVALUATION

On Test data of 10000 examples:

Average PSNR:24.830
Average SSIM: 0.868

Convolutional Auto-encoders with Symmetric Skip Connections

MODEL ARCHITECTURE

Encoder : 3x3 convolutional layers and downsamping done using stride of 2 instead of pooling, which can be harmful to image restoration tasks
Decoder : Upsampling done and layers are symmetric to the encoder.
The corresponding encoder and decoder layers are connected by shortcut connections.
Each convolutional/deconvolutional layer is followed by a Batch Normalization layer and a ReLU non-linearity layer
Number of encoder/decoder layers : 15

Hyperparameters : Batch Size = 32 . no. of epochs = 40 . Learning rate =0.001 .

RESULTS

EVALUATION

On Test data of 10000 examples:

Average PSNR:28.254
Average SSIM: 0.938

Feed-forward denoising convolutional neural networks (DnCNNs)

MODEL ARCHITECTURE

VGG modeified network
Integration of Residual Learning and Batch Normalization
Size of convolutional filters to be 3 × 3 but remove all pooling layers. 3 Types of layers used :

Conv+ReLU : generate feature maps
Conv+BN+ReLU : batch normalization incorporated to speed up training as well as boost the denoising performance
Conv : To reconstruct the output

Hyperparameters : Batch Size = 32 . no. of epochs = 40 . Learning rate =0.001 .

RESULTS

EVALUATION

On Test data of 10000 examples:

Average PSNR:28.992
Average SSIM: 0.947

Conclusion

SSIM (Structural Similarity Image Metric), which estimates the degradation of structural similarity based on the statistical properties of local information between a reference and a distorted image.It combines three local similarity measures based on luminance, contrast, and structure.

PSNR, the term peak signal-to-noise ratio is an expression for the ratio between the maximum possible value (power) of a signal and the power of distorting noise that affects the quality of its representation The main limitation of this metric is that it relies strictly on numeric comparison and does not actually take into account any level of biological factors of the human vision system such as the structural similarity index. (SSIM)

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
images		images
Denoise_SkipSym.ipynb		Denoise_SkipSym.ipynb
DnCNN_Denoise.ipynb		DnCNN_Denoise.ipynb
README.md		README.md
aut.ipynb		aut.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Implementation of Denoising Algorithms on CIFAR-10 Dataset

Algorithms Used :

Dataset and Noise

Simple Denoising Autoencoder (DAE)

MODEL ARCHITECTURE

RESULTS

EVALUATION

Convolutional Auto-encoders with Symmetric Skip Connections

MODEL ARCHITECTURE

RESULTS

EVALUATION

Feed-forward denoising convolutional neural networks (DnCNNs)

MODEL ARCHITECTURE

RESULTS

EVALUATION

Conclusion

References

Other References

About

Releases

Packages

Languages

anushkayadav/Denoising_cifar10

Folders and files

Latest commit

History

Repository files navigation

Implementation of Denoising Algorithms on CIFAR-10 Dataset

Algorithms Used :

Dataset and Noise

Simple Denoising Autoencoder (DAE)

MODEL ARCHITECTURE

RESULTS

EVALUATION

Convolutional Auto-encoders with Symmetric Skip Connections

MODEL ARCHITECTURE

RESULTS

EVALUATION

Feed-forward denoising convolutional neural networks (DnCNNs)

MODEL ARCHITECTURE

RESULTS

EVALUATION

Conclusion

References

Other References

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages