Hello, your project is very innovative. But I did not achieve the effect you provided during the reproduction process.
Are you using the file onlydynamic+8conv+captions.yaml?
And what model of GPU are you using?
Perhaps random seeds can also affect the effect. Is it convenient to know which random seed you are using?
It would be great if you could answer my questions. Thank you very much!