Category Archives: Deep Learning

September 22, 2022

Differences between Learning Rate and Weight Decay Hyperparameters in Neural networks.

The amount of regularization must be balanced for each dataset and architecture. Recognition of this principle permits the general use of super-convergence. Reducing other forms of regularization and regularizing with very large learning rates makes training significantly more efficient.

June 26, 2021

Explain Pooling layers: Max Pooling, Average Pooling, Global Average Pooling, and Global Max pooling.

admin

Global Average Pooling does something different. It applies average pooling on the spatial dimensions until each spatial dimension is one, and leaves other dimensions unchanged.

December 14, 2019

What is batch size, steps, iteration, and epoch in the neural network?

admin

When you put m examples in a mini-batch, you need to do O(m) computation and use O(m) memory, and you reduce the amount of uncertainty in the gradient by a factor of only O(sqrt(m)).

December 9, 2019

How embedding layer work in Keras?

admin

An embedding is a matrix in which each column is the vector that corresponds to an item in your vocabulary.

August 15, 2019

How do you split a dataset into train and test in Python?

admin

The common assumption is that you will develop a system using the train and dev data and then evaluate it on test data.

December 9, 2018

Replace your RNN and LSTM with an Attention base Transformer model for NLP

admin

Train the network with a long range with that can look back thousands of steps and remember it.

December 5, 2018

TensorFlow BERT for Pre-training Natural Language Processing

admin

It is a method of representations pre-training language.

July 30, 2018

How does batch normalization work?

admin

Batch Normalization is a regularization function that has appeared recently.

Category Archives: Deep Learning

Differences between Learning Rate and Weight Decay Hyperparameters in Neural networks.

Explain Pooling layers: Max Pooling, Average Pooling, Global Average Pooling, and Global Max pooling.

What is batch size, steps, iteration, and epoch in the neural network?

How embedding layer work in Keras?

How do you split a dataset into train and test in Python?

Replace your RNN and LSTM with an Attention base Transformer model for NLP

TensorFlow BERT for Pre-training Natural Language Processing

How does batch normalization work?

Latest Posts