Name	Name	Last commit message	Last commit date
parent directory ..
examples	examples
traffic	traffic
README.md	README.md

Neural Networks

One area in modern machine learning is neural networks (NN). NN is inspired by how human brains function, neurons in human brains communicate information with each other and learn about the world, machine learning imitates that and uses artificial neurons to process and learn from data

Structure and Algorithms

In the previous lecture, the AI takes input and multiply the input by some weights to produce an output. For example, when the input is x1 and x2, AI learn from data what the weights for each of the inputs should be, multiply inputs by their weights and uses the ouput function to either perform classification tasks or regression tasks. NN shares the same idea, graphically

Gradient Descent

Gradient Descent is an algorithm for minimizing loss when training neural networks. In calculus, there is a term called "gradient", or slope of a function at any given input. The Gradient Descent algorithm utilizes this mathematical principle to minimize the loss function by looking at all the data points and calculate the gradient that will lead to decreasing loss. By repeating this process, the weights should converge and the loss will be minized

However, there is a limitation to this algorithm. When there is a lot of data, which is most likely the case in real world problems, the step of looking at all the data points becomes computationally costly and time inefficient

Stochastic Gradient Descent (SGD): The traditional gradient descent algorithm takes too much computational effort to calculate gradient based on all data points. Therefore, SGD is introduced to minimize this cost. SGD propose that only looking at one random data point each iteration. Though the gradient will not be as accurate, this algorithm takes much less time to compute
Mini-Batch Gradient Descent: While the traditional gradient descent takes too much time and the SGD is not as accurate, the Mini-Batch Gradient Descent algorithm is introduced to find a balance point. This approach uses neither all the data points nor one random one, but randomly select a batch of data points (batch size customizable) and calculate the gradient based on them

Multilayer Neural Network

Simple NN connects input neurons directly to output neurons, which can only provide linear relationship between input and output; however, real world problems can rarely be solved by linear relationships. Therefore, the Multilayer Neural Network (MNN) is introduced to form more complex relationships between input and output neurons

The middle layers of neural networks are called the hidden layers. Mathematically, they serve the purpose of creating complex functions relating input and output. In a real world sense, they can be interpreted as individual neural networks which learns different features from the input and combine them to form a more accurate output

Backpropagation

Gradient Descent algorithm is used for simple NN since we know what the actual values the input and output should be so we are able to calculate the loss and gradient. But in MNN, we have no idea what the actual value of the hidden layers should be, that's where the Backpropagation algorithm comes in. Since we only know the loss of the ouput given the input, we will start with the output neurons and backtrack each hidden layers to the input. The key here is that we have information about the weights of each neuron in the hidden layers, so we can estimate what proportion of loss is due to which neuron

Overfitting

Sometimes the number of hidden layers and neurons are too many for a certain problem, which leads to overfitting since the AI is learning too closely to the given data so it fails to generalize. One way to combat overfitting is by dropout. In this technique, during each learning epoch/iteration, the NN randomly "hides" some neurons (as they do not exist) in order to prevent relying too heavily on certain neurons

Computer Vision

Computer vision encompasses the different computational methods for analyzing and understanding digital images, and it is often achieved using NN. For instance, in Apple's Photo app, the AI is able to recognize which photos have certain person's face

A way the AI can achieve this is by inputing every pixels colour information as input neurons and through MNN to recognize features in images; however, this method is too computationally expensive since the input size is enormous as there are millions of pixels in modern devices

Image Convolution

A way to simplify the input neurons is by convolution. Image convolution is applying a filter that adds each pixel value of an image to its neighbors, weighted according to a kernel matrix. Doing so alters the image and can help the neural network process it. For instance, in the figure below, by multiplying and adding in all possible 3x3 matrix by the kernal matrix, the resultant is a 2x2 matrix which shrinks the size of the input

Furthermore, image convolution can do more than shrinking the input size, by applying certain kernal matrix, we are able to get certain features on the image. For instance, by applying the famous matrix in the figure below

we are able to get a convoluted image that only has the boundaries of objects in the image like the below figure

Pooling

Another way to simplify the image is by the method of pooling. In this method, the input size is shrinked even more by sampling from regions of the image. A typical pooling method is Max-Pooling, where we take the maximum value in a region to be the representative and put it in a new matrix. See the below figure, for every 2x2 matrix in the 4x4 matrix, extracting the maximum value in it and form a new 2x2 matrix

Convolutional Neural Network

A convolutional neural network is a neural network that uses convolution, usually for analyzing images. It starts by appling image convolution and pooling to simplify the image and then input the information about each pixel to the input neurons. See the figure below

Recurrent Neural Networks

Feed-Forward Neural Networks are the type of neural networks discussed so far, where input data is provided to the network, which eventually produces some output. Whereas in recurrent neural networks, input and ouput is not one to one, depending on the structure and design of the network, sometimes the output of the network is fed in to the input again in order to process continuous input such as videos or voices

Examples

Check out some examples that practice these theories

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

5.Neural_Networks

5.Neural_Networks

README.md

Neural Networks

Structure and Algorithms

Gradient Descent

Multilayer Neural Network

Backpropagation

Overfitting

Computer Vision

Image Convolution

Pooling

Convolutional Neural Network

Recurrent Neural Networks

Examples

Files

5.Neural_Networks

Directory actions

More options

Directory actions

More options

Latest commit

History

5.Neural_Networks

Folders and files

parent directory

README.md

Neural Networks

Structure and Algorithms

Gradient Descent

Multilayer Neural Network

Backpropagation

Overfitting

Computer Vision

Image Convolution

Pooling

Convolutional Neural Network

Recurrent Neural Networks

Examples