CS 754 - Advanced Image Processing (aka Inverse Problems in Image Processing)



Course Information

Course Description

Besides being just two- or three-dimensional arrays containing numbers (pixel intensity values), the images that are seen and acquired in everyday life have a number of very interesting properties. For example, small patches from different spatial regions of an image are quite similar to each other. Another example is that these images when transformed by certain kinds of matrix operations (eg: wavelet or discrete cosine transform) produce coefficients that are very sparse (i.e. most of them are zero or close to zero in value). These properties are a key to developing efficient and accurate algorithms for a variety of applications such as image noise removal, blur removal, image separation, compression and even image-based forensics. Not just that, these properties have also inspired the design of new sensors to acquire images much faster (a technology called compressed sensing) which is really crucial in tasks involving video or some types of images in medicine (such as magnetic resonance imaging - MRI). This course will systematically study several such techniques motivated by these interesting properties of natural images. The course will explore some key theory (including the proofs of some truly beautiful theorems), algorithms, as well as many applications. It will expose students to a broad range of modern, state-of-the-art techniques in image processing. A more detailed (tentative) syllabus can be accessed here. Many of the problems we will deal with in this class fall in the category of inverse problems, i.e. problems where the number of unknowns seems to exceed the number of knowns, at superficial glance, and there is also noise or system matrix contamination.

Need for the course Image Processing Applications

The course will cover principled techniques that can be applied to some interesting image processing problems:

Intended Audience and Pre-requisites

Intended for 3rd, 4th year B Tech students, DDP students, and all PG students (MTech/PhD) from CSE/EE/EP/CSRE/Earth Sciences. You must have taken CS 663 or EE 610 or CS 725, otherwise you must discuss with me whether you have suitable background for this course.

Textbooks

Resources


Grading scheme (tentative)

Webpages of Previous Offerings


Detailed Schedule

Date

Content of the Lecture

Assignments/Readings/Notes

3rd Jan (Thu)
  • Course overview
  • Slides (check moodle)
7th Jan (Mon) Compressed Sensing
  • Compressed sensing (CS): introduction and motivation
  • Review of DFT and DCT, Review of JPEG
  • Sparsity of natural images in transform bases
  • Candes, Romberg, Tao: puzzling experiment; Basic optimization problem for CS involving the total variation
  • Concept of incoherence between sensing matrix and representation basis
  • Slides (check moodle)
  • E. Candes and M. Wakin, "Introduction to Compressive sampling", IEEE Signal Processing Magazine, 2008
10th Jan (Thu)
  • Shannon's sampling theorem and the Whittaker-Shannon Interpolation Formula
  • Basic CS optimization problem: comments on the uniqueness of its solution
  • Basic theorem of compressed sensing involving coherence (due to Candes, Romberg, Tao)
  • Intuition behind role of incoherence in CS
  • Slides (check moodle)
  • E. Candes and M. Wakin, "Introduction to Compressive sampling", IEEE Signal Processing Magazine, 2008
14th Jan (Mon)
  • Restricted isometry property
  • Basic theorems of CS: reconstruction of compressible signals
  • Compressed sensing under noise (theorem 3): choice of regularization parameter under different noise models
  • The role of randomness
  • Basis pursuit and its efficiency - Basis pursuit as a linear programming problem
  • Intuitive explanation: benefits of L1 or L2 penalty for signal sparsity
  • Toy examples of CS results
  • Slides (check moodle)
  • E. Candes and M. Wakin, "Introduction to Compressive sampling", IEEE Signal Processing Magazine, 2008
  • J. Romberg, "Imaging via Compressive sampling", IEEE Signal Processing Magazine, 2008
17th Jan (Thu)
  • Sketch of a conventional camera
  • Rice single pixel camera and its patchwise variant from Stanford
  • Video version of the Rice SPC
  • Conventional hyperspectral camera
  • Architecture of CASSI: Coded aperture snapshot spectral imager
  • Slides (check moodle)
  • Duarte et al, "Single-pixel imaging via Compressive sampling", IEEE Signal processing magazine, 2008
  • Kittle et al, "Multiframe image estimation for coded aperture snapshot spectral imagers", Applied Optics, 2010
21st Jan (Mon)
  • Architecture of CASSI: Coded aperture snapshot spectral imager
  • Concept for color filter arrays (CFA), Comparison of CASSI with color image demosaicing
  • Video compressive sensing using coded snapshots
  • Slides (check moodle)
  • Kittle et al, "Multiframe image estimation for coded aperture snapshot spectral imagers", Applied Optics, 2010
  • Hitomi et al, "Video from single exposure coded snapshots", ICCV 2011
24th Jan (Thu)
  • Video compressive sensing using coded snapshots
  • Compressed sensing in MRI: concept of different MRI trajectories
  • CS algorithms: matching pursuit (MP) and orthogonal matching pursuit (OMP)
  • Slides (check moodle)
28th Jan (Mon)
  • CS algorithms: ISTA (iterative shrinkage/thresholding algorithm)
  • Concept of mutual coherence and its relationship with the restricted isometry property/constant via Gershgorin's disc theorem
  • Comparison between mutual coherence and RIC
  • Theorem for performance bounds using mutual coherence
  • Slides (check moodle)
31st Jan (Thu)
  • RIC interpreted in terms of singular values, relationship between mutual coherence and RIC
  • CS bounds based on mutual coherence (Theorem 5)
  • CS bounds based on RIC of order s (as opposed to 2s): Theorem 6
  • Compressive classification: maximum likelihood classifier, matched filter, smashed filter, relation to RIP
  • Sketch of proof of Theorem 3
  • Slides (check moodle)
  • Candes, "The restricted isometry property and its implications for compressed sensing", CRM 2008
  • Davenport et al, "The smashed filter for compressive classification and target recognition", SPIE 2007
4th Feb (Mon)
  • Theorem 4: CS for piecewise constant signals
  • CS with tight frames
  • Clarification regarding RIP of Random Discrete Fourier Measurement Matrices

Tomography
  • Concept of tomography and tompgraphic projection/Radon transform
  • Concept of backprojection and its limitations
  • Generations of CT (computed tomography) machines
  • Slides (check moodle)
7th Feb (Thur)
  • Fourier slice theorem in tomography
  • Concept of filtered backprojection and its relationship to (unfiltered) backprojection
  • Tomography as a compressed sensing problem
  • Slides (check moodle)
9th Feb (Sat, extra lecture)
  • Tomography as a compressed sensing problem
  • Tomography under unknown angles: motivation in cryo-EM, algorithms for 2D images
  • Moment based approach - Helgason Ludwig consistency conditions (2D images)
  • Concept of micrograph and particle picking in cryo-EM
  • Slides (check moodle)
11th Feb (Mon)
  • Moment-based algorithm (2D images)
  • Ordering-based algorithm: nearest neighbor algorithm (2D images)
  • Slides (check moodle)
15th Feb (Thu)
  • Graph-laplacian algorithm for dimensionality reduction (due to Belkin and Niyogi); its application to tomography under unknown angles
  • PCA-based algorithm for denoising of tomographic projections
  • Slides (check moodle)
21st Feb (Thu)
  • PCA-based algorithm for denoising of tomographic projections; derivation of a Wiener filter in the transform domain
  • Tomography under unknown angles in 3D (i.e. 2D projection images): the concept of common lines, algorithm for finding common lines in spatial or Fourier domain
  • Algorithm for finding unknown angles and underlying 3D structure from 2D projections under unknown angles - case of 3 projections only
  • Orthogonal procrustes algorithm
  • Slides (check moodle)
4th March (Mon)
  • Distribution of midsem papers and discussion of solutions
  • Algorithm for finding unknown angles and underlying 3D structure from 2D projections under unknown angles - case of 3 projections only
  • Orthogonal procrustes algorithm
  • Algorithm for finding unknown angles (based on semi-definite programming) and underlying 3D structure from 2D projections under unknown angles - case of N projections
  • Slides (check moodle)
7th March (Thurs) Dictionary Learning
  • Introduction to overcomplete dictionaries
  • Concept of overcomplete dictionaries: sinusoids and spikes examples
  • Issues of uniqueness and sparsity of representation in overcomplete dictionaries, dictionary learning as a generalization of K-means
  • Principal Components Analysis: Eigenfaces; relationship between DCT and PCA
  • Slides (check moodle)
11th March (Mon)
  • A basic algorithm for simultaneously obtaining dictionary and sparse code - using projected gradient descent with adaptive step-size
  • Method of Optimal Directions (MOD)
  • Union of Orthonormal Bases
  • KSVD method
14th March (Thurs)
  • KSVD method
  • KSVD method for compression, denoising, inpainting
18th March (Mon)
  • Blind compressed sensing
  • Compressive KSVD algorithm
  • Requirement for blind compressed sensing: diversity of measurement matrices
  • Fisher's linear discriminant: case of 2 classes
25th March (Mon)
  • Fisher's linear discriminant: case of 2 classes
  • Fisher's linear discriminant: case of more than 2 classes
  • Limitations of FLD, introduction to mutual information for classification, concept of mutual information and its properties
  • Mutual information for optimal transform learning for classification
  • Kernel density estimation
  • Optimization of quadratic mutual information
28th March (Thu) Low Rank Matrix Recovery and Beyond
  • Ubiquitousness of low rank matrices in image processing and machine learning: recommender systems, image patches assembled to form a matrix, distance matrices, applications in structrue from motion, applications in face recognition
  • The problem of low rank matrix completion
  • Informal statement of the basic theorem, concept of coherence of subspaces, sufficient conditions for successful recovery and pathological cases
  • Formal statement of theorem of low rank matrix completion; extension to the case of noisy matrix completion
  • Empirical results for low rank matrix completion
  • Concept of low rank matrix recovery
  • Basic theorems for low rank matrix recovery, concept of RIP of linear maps/operators, relation between low rank matrix recovery and low rank matrix completion
  • Slides (check moodle)
1st April (Mon)
  • Introduction to Robust Principal Components Analysis (PCA)
  • Motivating examples for RPCA: background subtraction in videos, specularity and shadow removal from face images
  • Basic theorem for RPCA; extension to the case when there is noise in observed matrix
  • Basic theorem for RPCA with incompletely observed entries
  • Singular Value Thresholding Algorithm (SVT) for matrix completion
  • Slides (check moodle)
4th April (Thurs)
  • Introduction to Robust Principal Components Analysis (PCA)
  • Motivating examples for RPCA: background subtraction in videos, specularity and shadow removal from face images
  • Basic theorem for RPCA; extension to the case when there is noise in observed matrix
  • Basic theorem for RPCA with incompletely observed entries
  • Singular Value Thresholding Algorithm (SVT) for matrix completion
  • Slides (check moodle)
8th April (Mon) Inverse Problems using Deep Neural Networks
  • Inverse problems from a Bayesian perspective; limitations of analytical models
  • Deep neural networks for inverse problems: introduction
  • Multi-layer perceptrons; Image denoising using MLPs
  • Motivation for convolutional neural networks; their use for inverse problems
  • Encoder-decoder CNNs
  • Auto-encoders for super-resolution
11th April (Thurs) Low Rank Matrix recovery and Beyond
  • Compressive RPCA: Greedy Algorithm by Waters et al, applications in video and hyperspectral CS, and RPCA with missing entries
  • Basic Theorem for Compressive RPCA

Statistics of Natural Images
  • Power law
  • Correlation of pixel values in natural images
  • Statistics of DCT coefficients of natural images: laplacian models and justification for the same
  • Brief introduction to the Haar wavelet transform
  • Slides (check moodle)
15th April (Mon)
  • Brief introduction to the Haar wavelet transform
  • Revision of basic Bayesian statistics, concept of MAP and MMSE, posterior probabilities with gaussian likelihood and Lapalcian and Gaussian priors
  • Statistics of wavelet coefficients of natural images, use of dependencies between wavelet coefficients as priors for image denoising and in compression
  • Compressed sensing based on statistical priors
  • Slides (check moodle)
18th April (Thu)
  • Statistics of natural image categories: natural, man-made,etc. Image and scene scale
  • A semi-automated method for reflection removal using statistics of image gradients: Iteratively reweighted least squares algorithm
  • Results of the IRLS algorithm for reflection removal, mixture of Laplacian prior for image gradients
  • Slides (check moodle)