Training Loop

1. Basics
2. Learning Rate
- 2.1. One cycle policy

1. Basics

A basic training loop can be summarized by the following steps:

Considering a supervised deep learning setting for pedagogical purposes.

Assuming we have the model, the data and weight updating mechanism setup along with performance tracking metrics, it goes as …

initialize weights, setup dataloaders, hyperparams and other initial strategies
for each batch in the training data loader
- run inference on the batch
- compute the loss for the batch
- backpropogate on the loss to obtain gradients for updates
- update the weights according to the chosen strategy
for all data in the validation data loader
- run inference
- computer aggregate performance
- may use this to test out upon hyperparams or other manually chosen strategies
This is one epoch, repeat 2 and 3 for as many epochs as needed:
- concepts like early stopping, not overfitting and convergence need to be taken into account here
Test the model on a previously unseen test set to gauge the performance of the algorithm with your manual choices.

2. Learning Rate

An important aspect of the weight updating mechanisms is the extent to which you choose to do so and how you, if you choose to, vary it over an epoch or the complete training cycle. Accumulating some strategies in this section with external resources.

2.1. One cycle policy

https://www.deepspeed.ai/tutorials/one-cycle/

Training Loop

Table of Contents

1. Basics

2. Learning Rate

2.1. One cycle policy