Convolutional Neural Network

Hui Lin @Google

Ming Li @Amazon

Types of Neural Network

Computer Vision

Image Classification

Computer Vision

Object Detection

Computer Vision

Neural Style Transfer

Image Data

Convolutions

Edge Detection

Vertical Edge Detection

Parameters

Padding

Pad so that output size is the same as the input size.

Strided convolutions

()
()
no padding
stride

Summary of Convolutions

image; filter
padding ; stride
Output:

Convolutions Over Volume

https://www.youtube.com/watch?v=HzuFgLnhgqw

Your Turn: Number of Parameters in One Layer

Question: If you have 10 filters that are in one layer of a neural network, how many parameters does that layer have?

Summary of Notation

If layer is a convolution layer:

= filter size
= padding
= stride
= number of filters
Each filter:
Activations:
Weights:
bias:
Input:
Output:
(similar for )

Pooling Layers

Examples: LeNet - 5

LeCun et al., 1998. Gradient-based learning applied to document recognition

Examples: LeNet - 5

	Activation Shape	Activation Size	# Parameters
Input	(32, 32, 1)	1024	0
CONV1 (f=5, s=1)	(28, 28, 6)	6272
POOL1 (f=2, s=2)	(14, 14, 6)	1176	0
CONV2 (f=5, s=1)	(10, 10, 16)	1600
POOL2 (f=2, s=2)	(5, 5, 16)	1176	0
FC3	(120, 1)	120
FC4	(84, 1)	84
Softmax	(10, 1)	10

Types of Layer in A Convolutional Network

Convolution
Pooling
Fully Connected

Using Keras To Build CNN

Typical keras workflow:

Define your training data: input tensors and target tensors
Define a network of layers (or models) that maps your inputs to your targets
Configure the learning process by choosing a loss function, an optimizer, and some metrics to monitor
Iterate on your training data by calling the fit() method of your model

Using Keras To Build CNN

# Define model structure
cnn_model <- keras_model_sequential() %>%
  layer_conv_2d(filters = 32, kernel_size = c(3, 3), 
  activation = "relu", input_shape = input_shape) %>%
  layer_max_pooling_2d(pool_size = c(2, 2)) %>%
  layer_conv_2d(filters = 64, kernel_size = c(3, 3), activation = "relu") %>%
  layer_dropout(rate = 0.25) %>%
  layer_flatten() %>%
  layer_dense(units = 128, activation = "relu") %>%
  layer_dropout(rate = 0.5) %>%
  layer_dense(units = num_classes, activation = "softmax")

Using Keras To Build CNN

Compile and define loss function, optimizer and metrics to monitor during the training

# Compile model
cnn_model %>% compile(
  loss = loss_categorical_crossentropy,
  optimizer = optimizer_adadelta(),
  metrics = c('accuracy')
)

Using Keras To Build CNN

Fit the model using training dataset and define epochs, batch size and validation data

# Train model
cnn_history <- cnn_model %>%
  fit(
    x_train, y_train,
    batch_size = batch_size,
    epochs = epochs,
    validation_split = 0.2
  )

Predict new outcomes using the trained model

# Model prediction
cnn_pred <- cnn_model %>%
  predict_classes(x_test)

Size of the Model

Effective CNNs

LeNet -5: LeCun et al., 1998. Gradient-based learning applied to document recognition
AlexNet: Krizhevsky et al., 2012. ImageNet Classification with Deep Convolutional Neural Networks
VGG-16: Simonyan & Zisserman 2015. Very Deep Convolutional Networks for Large-Scale Image Recognition
ResNets: He et al., 2015. Deep Residual Learning for Image Recognition

Different Architecture Search Algorithms:

NASnet: 1800 GPU days (5 yrs on 1 GPU)
AmoebaNet: 3150 GPU days
DARTS: 4 GPU days
ENAS: 1000 x cheaper than standard NAS

Understanding Neural Networks