eva2_mim model, training not converging #1764
-
Hi, I am retraining the eva2mim model, but the model is not converging, I get 0% test Accuracy
what am i doing wrong? Desktop (please complete the following information):
|
Beta Was this translation helpful? Give feedback.
Replies: 2 comments
-
@keertika-11 nothing wrong with the model, has been tested. I can't get into detail debugging other's train scripts or hparams, but you LR is 1-2 orders of magnitude too large for that batch size and I'd never train these models without gradient clipping. Moving to dicussions in case anyone else has suggestions as this isn't a bug... |
Beta Was this translation helpful? Give feedback.
-
gradient clipping did help, thanks for the suggestion. The loss decreased from 7.57 --> 5.082 in 100 steps of the 1st epoch
|
Beta Was this translation helpful? Give feedback.
@keertika-11 nothing wrong with the model, has been tested. I can't get into detail debugging other's train scripts or hparams, but you LR is 1-2 orders of magnitude too large for that batch size and I'd never train these models without gradient clipping. Moving to dicussions in case anyone else has suggestions as this isn't a bug...