Reading Assignment: Advanced CNN Architectures

CNN architecture overview

Start with the high-level overview provided in this blog post by Adit Deshpande. Anything beyond the section on region-based CNNs is optional. (GANs are covered in the “Learning Generative Models” course.)

Next, read Densely connected convolutional networks (2016), Huang et al. There is a lot to learn from this paper as the authors do a very good job pointing out similarities and differences with many related approaches.

Detailed Reading as bonus

As additional reading assignment, here are 6 papers that use (and extend) CNNs in various settings. They are all worth reading, but you don’t need to read all of them.

Here are some questions to guide you during reading:

What is the learning problem? (datasets, representation of inputs and outputs, cost function and evaluation measures) Which specific challenges are addressed?
What does the network design look like? (Try to understand the details as if you were to implement the described architecture!)
What are the main innovations described in the paper? Where does it go beyond the techniques covered in the deep learning book?
What are the main experimental findings?
Does the paper have any weak spots?
Is there any information on performance (throughput) and hardware requirements?

Optional Further Reading

If you would like to dig deeper, here are some more ressources:

on ResNets: Deep residual learning for image recognition (2016), K. He et al.
on Inception architectures: Rethinking the inception architecture for computer vision (2016), C. Szegedy et al.
on 1x1 convolution: One by One [ 1 x 1 ] Convolution - counter-intuitively useful (2016), A. Prakash
on convolution arithmatic A guide to convolution arithmetic for deep learning (2016), V. Dumoulin & F. Visin