Groundbreaking Architectures

ImageNet

Over 1 million images of 1000 classes of images. Here are some models that performed extremely well with this dataset.

AlexNet (2012)

  • Pioneered usage of ReLU and Dropout to avoid overfitting

  • Had large convolutional filters (11x11)

  • By UofT

VGG (2014)

  • VGG-16, VGG-19

  • Long sequence of 3x3 convolutional, 2x2 Pooling, 3 FC layers

  • Pioneered usage of small 3x3 convolutional filters

  • By Oxford University

ResNet (2015)

  • ResNet-152

    • With the enormous # of layers, Vanishing Gradient Problem was discovered

    • They solved this by adding skip layers which shortened the route for gradients to travel (initially, the gradients had to travel the entire 152 layers which is a lot)

  • By Microsoft Research

Last updated