PyTorch & Keras

November 1, 2022

What is PyTorch nn.Parameters?

A module can have one or more Parameters (its weights and bise) instances as attributes, which are tensors. A module can also have one or more submodules (subclasses of nn.Module) as attributes, and it will also be able to track their parameters.

October 20, 2022

PyTorch changes the Learning rate based on Epoch

PyTorchPragati

Another popular learning rate schedule used with deep learning models is systematically dropping the learning rate at specific times during training.Decays the learning rate by gamma every step_size epochs.

October 7, 2022

PyTorch AdamW and Adam with weight decay optimizers

PyTorchadmin

Adam does not generalize as well as SGD with momentum when tested on a diverse set of deep learning tasks such as image classification, character-level language modeling, and constituency parsing. Adam lies in its dysfunctional implementation of weight decay.

October 3, 2022

How to set dimension for softmax function in PyTorch?

PyTorchadmin

Dimensions along which they encode probabilities and others in which they don’t, nn.Softmax requires us to specify the dimension along which the softmax function is applied. Let’s test it on an input vector

September 28, 2022

How to assign num_workers to PyTorch DataLoader?

PyTorchadmin

Choosing the best value for the num_workers argument depends on your hardware, characteristics of your training data (such as its size and shape), the cost of your transform function, and what other processing is happening on the CPU at the same time. A simple heuristic is to use the number of available CPU cores.

September 22, 2022

Differences between Learning Rate and Weight Decay Hyperparameters in Neural networks.

Deep LearningKerasPyTorchadmin

The amount of regularization must be balanced for each dataset and architecture. Recognition of this principle permits the general use of super-convergence. Reducing other forms of regularization and regularizing with very large learning rates makes training significantly more efficient.

September 19, 2022

Weight Decay parameter for SGD optimizer in PyTorch

PyTorchPragati

L2 regularization is also referred to as weight decay. The reason for this name is that thinking about SGD and backpropagation, the negative gradient of the L2 regularization term with respect to a parameter w_i is – 2 * lambda * w_i, where lambda is the aforementioned hyperparameter, simply named weight decay in PyTorch.

September 16, 2022

How loss.backward(), optimizer.step() and optimizer.zero_grad() related in PyTorch

PyTorchadmin

When we call loss.backward(), PyTorch traverses this graph in the reverse direction to compute the gradients and accumulate their values in the grad attribute of those tensors (the leaf nodes of the graph).

August 28, 2022

Calculate mean and std for the PyTorch image dataset

PyTorchadmin

In working with images, it is good practice to compute the mean and standard deviation on all the training data in advance and then subtract and divide by these fixed, precomputed quantities.

August 28, 2022

Normalize Image Dataset in PyTorch using transforms.Normalize()

PyTorchadmin

PyTorch offers a package called Torchvision that includes many commonly used transforms for image processing. In PyTorch, we can achieve this using the Normalize Transform.

What is PyTorch nn.Parameters?

PyTorch changes the Learning rate based on Epoch

PyTorch AdamW and Adam with weight decay optimizers

How to set dimension for softmax function in PyTorch?

How to assign num_workers to PyTorch DataLoader?

Differences between Learning Rate and Weight Decay Hyperparameters in Neural networks.

Weight Decay parameter for SGD optimizer in PyTorch

How loss.backward(), optimizer.step() and optimizer.zero_grad() related in PyTorch

Calculate mean and std for the PyTorch image dataset

Normalize Image Dataset in PyTorch using transforms.Normalize()

Latest Posts