Skip to content

eva2_mim model, training not converging #1764

Answered by rwightman
keertika-11 asked this question in Q&A
Discussion options

You must be logged in to vote

@keertika-11 nothing wrong with the model, has been tested. I can't get into detail debugging other's train scripts or hparams, but you LR is 1-2 orders of magnitude too large for that batch size and I'd never train these models without gradient clipping. Moving to dicussions in case anyone else has suggestions as this isn't a bug...

Replies: 2 comments

Comment options

You must be logged in to vote
0 replies
Answer selected by keertika-11
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants
Converted from issue

This discussion was converted from issue #1759 on April 10, 2023 06:17.