Winter 2018

Introduction to Deep Learning

 

Reading Assignment: Advanced CNN Architectures

CNN architecture overview

Start with the high-level overview provided in this blog post by Adit Deshpande. Anything beyond the section on region-based CNNs is optional. (GANs are covered in the “Learning Generative Models” course.)

Next, read Densely connected convolutional networks (2016), Huang et al. There is a lot to learn from this paper as the authors do a very good job pointing out similarities and differences with many related approaches.

Detailed Reading as bonus

As additional reading assignment, here are 6 papers that use (and extend) CNNs in various settings. They are all worth reading, but you don’t need to read all of them.

Here are some questions to guide you during reading:

  1. A convolutional neural network for modeling sentences (2014), N. Kalchbrenner et al.

  2. End-to-end Learning for Music Audio Tagging at Scale (2018), J. Pons et al.

  3. Image style transfer using convolutional neural networks (2016), L.A. Gatys et al.

  4. Image Super-Resolution Using Deep Convolutional Networks (2016), C. Dong et al.

  5. Singing voice separation with deep U-Net convolutional networks (2017), A. Jansson et al.

  6. You only look once: Unified, real-time object detection (2016), J. Redmon et al.

Optional Further Reading

If you would like to dig deeper, here are some more ressources: