Vectorization trick

Numpy style

batch # size (B, in_size)
w # size (out_size, in_size)
B # size (out_size)

output = []
for i in range(batch.shape[0]):
  temp = w @ batch[i] + b
  output.append(temp)
output = np.stack(axis=0)

output # size (B, out_size)

With batch operations

batch # size (B, in_size)
w # size (in_size, out_size)
B # size (out_size)



output = batch @ w + B



output # size (B, out_size)

Neural networks

Alexandre Boulch

Outline

Motivation

Simple problem:

Hypothesis:

Motivation

Motivation

Motivation

Motivation

Motivation

Motivation

Ideally

Neural networks

Artificial neuron

Historical Background

Bio inspired model

Formulation

Back to example

Geometric interpretation of the neuron

Limitations

Limitations

Limitations

Possible solution: using multiple stacked neurons

Universal approximation theorem

Arbitrary width

Arbitrary depth

Neural networks in practice

Network design

Optimization

Stochactic gradient descent

Optimizing the parameters

Objective

Gradient descent

Objective

Gradient descent

Objective

Gradient descent

Gradient descent

Gradient descent

Gradient descent

Back to example

Problem

Objective / loss function

Optimization

Forward

Backward

Weight update

Mean Squared Differences

Mean Squared Differences

Mean Squared Differences

Chain rule

Chain rule applied to neural networks

Chain rule applied to neural networks

Chain rule applied to neural networks

Code architecture

Optimizing the parameters

Objective

In practice

Limits of gradient descent

Objectives

Stochastic gradient Descent (SGD)

Idea

Stochastic gradient Descent (SGD)

Problem

Solution

Vectorization trick

Multi-label classification

Information

Entropy

Cross-Entropy

Cross-entropy loss

Cross-entropy loss - Classification

Cross-entropy loss - Binary classification

Multi-label classification

Example with 5 classes

Cross-entropy loss - Multi-label classification

Example with 5 classes

Multi-label classification

Solution