I'm look at trying to re-produce your results.
I see that none of the train_*.sh scripts include a learning rate and therefore obtain_train_args() will give 0.001 by default.
However, I see in all the the released checkpoints:
checkpoint['optimizer']['param_groups'][0]['initial_lr']
0.0001
checkpoint['optimizer']['param_groups'][0]['lr']
1.25e-05
Is this just a small oversight? Might there be any other differences?
I'm look at trying to re-produce your results.
I see that none of the train_*.sh scripts include a learning rate and therefore
obtain_train_args()will give 0.001 by default.However, I see in all the the released checkpoints:
Is this just a small oversight? Might there be any other differences?